Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single Image Denoising

3citations

arXiv:2502.06432 PDF Project

citations

#1222

in AAAI 2025

of 3028 papers

Top Authors

Data Points

Top Authors

Huaqiu Li Wang Zhang Xiaowan Hu Tao Jiang Zikang Chen Haoqian Wang

Topics

single image denoising self-supervised learning latent diffusion process structural representation learning prompt learning transformer architecture attention mechanism scale replay training

Abstract

Many studies have concentrated on constructing supervised models utilizing paired datasets for image denoising, which proves to be expensive and time-consuming. Current self-supervised and unsupervised approaches typically rely on blind-spot networks or sub-image pairs sampling, resulting in pixel information loss and destruction of detailed structural information, thereby significantly constraining the efficacy of such methods. In this paper, we introduce Prompt-SID, a prompt-learning-based single image denoising framework that emphasizes preserving of structural details. This approach is trained in a self-supervised manner using downsampled image pairs. It captures original-scale image information through structural encoding and integrates this prompt into the denoiser. To achieve this, we propose a structural representation generation model based on the latent diffusion process and design a structural attention module within the transformer-based denoiser architecture to decode the prompt. Additionally, we introduce a scale replay training mechanism, which effectively mitigates the scale gap from images of different resolutions. We conduct comprehensive experiments on synthetic, real-world, and fluorescence imaging datasets, showcasing the remarkable effectiveness of Prompt-SID. Our code will be released at https://github.com/huaqlili/Prompt-SID.

Citation History

Jan 27, 2026

Feb 7, 2026

3+3

Feb 13, 2026