基于提示学习的多尺度图像去模糊新方法

谢斌; 黎彦先; 邵祥; 戴邦强

发布时间： 2024-08-15
摘要点击次数： 299
全文下载次数： 321
DOI: :10.11834/jig.240315
| Volume | Number

基于提示学习的多尺度图像去模糊新方法

谢斌, 黎彦先, 邵祥, 戴邦强(江西理工大学)

摘要

目的针对传统基于深度学习的去模糊方法存在的伪影明显、细节模糊和噪声残留等问题,文中提出了一种基于提示学习的多尺度图像去模糊新方法。方法首先,在详细地分析了传统去模糊方法的基础上,文中引入了基于提示学习的特定退化信息编码模块,利用退化信息中包含的上下文信息来动态地引导深度网络以更有效地完成去模糊任务。其次,设计了新的门控前馈网络,通过控制各个层级的信息流动构建更为丰富和更具层次结构的特征表示,从而进一步提高对复杂数据的理解和处理能力,以更好地保持结果图像的几何结构。另外,新方法引入了经典的总变差正则来抑制去模糊过程中的噪声残留,以提高结果图像的视觉表现。结果大量基于GoPro和REDS数据集的实验结果表明,与其他先进的基于深度学习的去模糊方法相比,文中所提新方法在图像去模糊方面取得了更好的效果。在峰值信噪比(Peak Signal-to-Noise Ratio, PSNR)和结构相似性(Structural Similarity, SSIM)指标上,文中提出的新方法在GoPro数据集上分别达到了33.04dB和0.962的最优结果。在REDS数据集上分别达到了28.70dB和0.859的结果,并且,相比SAM-deblur方法,PSNR提升了1.77dB。结论相较于其他的去模糊方法,文中所提出的新方法不仅能够较好的保持结果图像的细节信息,而且还能够有效地克服伪影明显和噪声残留的问题,所得结果图像在PSNR和SSIM等客观评价指标方面均有更好的表现。

关键词

图像去模糊提示学习多尺度门控前馈网络深度卷积

Multi-scale image deblurring method based on prompt learning

Xie Bin, Li Yanxian, Shao Xiang, Dai Bangqiang(College of Information Engineering,Jiangxi University of Science and Technology)

Abstract

Objective Image deblurring is to restore a clean image from blurry image. It aims to maintain the structure and details of the original image during the restoration. With the rapid development of Internet technology, the way people get images becomes more diversified. However, the image is often blurred or distorted by various factors in the process of acquisition, so it is necessary to deblur the image. Image deblurring is of great significance to improve image quality, and plays a key role in many fields such as medical imaging, satellite image processing, security monitoring, which has attracted the attention of many researchers. Due to the ill-posed image deblurring task, more prior knowledge is needed to recover image with high quality. At present, the existing deblurring methods include traditional methods and deep learning based methods. In the traditional methods, although the filter based deblurring method is simple and convenient, the recovered images often have artifacts, content loss and other problems, which can not meet the needs of various applications. And the deblurring method based on the idea of regularity has been widely concerned by researchers for a long time, various methods of constructing regular terms have been proposed to solve this kind of ill-posed problems. Although these traditional methods can achieve the purpose of deblurring to a certain extent, these methods rely on the prior information of images, which is difficult to obtain accurately in practical applications, so this kind of methods can not be well promoted in a wide range. With the wide application of deep learning technology, more and more researchers begin to use this technology to solve the ill-posed problem. The image deblurring methods as a whole fall into three main categories: Convolutional Neural Network (CNN) based method; Generative Adversarial Networks(GAN) based method and Transformer based method. In the CNN based method, with the powerful feature extraction capability of CNN, the model can learn the complex mapping relationship, and by minimizing the loss function, it can guide the model convergence to get the best output images. However, such methods lack of the features on multi-scales, and can produce artifacts and loss of image details. In order to make up for the deficiency of the appeal methods, the researchers propose a new framework named GAN. In such method, by alternating training the generator and discriminator to continuously improve the performance of the generator and then get better quality resulting images. Due to the success of Transformer in natural language processing, researchers begin to introduce it into the field of image processing. The advantage of the Transformer based method is that the model can better capture local context information for better image deblurring. However, using the Transformer block will inevitably increase the computational complexity of the model. Aiming at the problems of obvious artifacts, fuzzy details and residual noise in previous image deblurring methods, a novel method of multi-scale image deblurring based on prompt learning is proposed. Method In this paper, three improvements are made. Firstly, the degraded information coding module based on Prompt learning can use the context information contained in the degraded image to dynamically guide the deep network to complete different image deblurring tasks. Next, a Gated Feed-Forward Network (GFFN) is designed to control the flow of information at each level to build a richer and more hierarchical feature representation. Based on this, Prompt U-shaped Block(PUBlock) is designed. In addition, on the basis of the original loss function, the adaptive total variation regularization is added to effectively suppress the noise residue in the process of image restoration and improve the visual performance of the result image. In general, through the introduction of gating mechanism, the network can dynamically control the flow of information, so as to capture complex feature relationships more effectively. Using deep convolution can improve the efficiency of the model while ensuring the performance of the model. Prompt learning can better help model utilize degraded images and adaptive regularization can selectively smooth the image, which not only removes the noise, but also prevents the image from being over-smooth. Result To demonstrate the effectiveness of the proposed method, we performed deblurring experiments on the GoPro and REDS datasets and compared them with other advanced methods. In addition, Peak Signal-to-Noise Ratio(PSNR) and Structural Similarity(SSIM) are used as objective evaluation metrics. The experimental results show that the proposed method outperforms all other methods in GoPro and REDS datasets and achieves 33.04dB and 0.962 respectively on the GoPro dataset and 28.70dB and 0.859 respectively on the REDS dataset under the two metrics, which are better than the PSNR and SSIM values of the conventional image deblurring method. The comparison results with SAM-deblur algorithm show that PSNR improves by 1.77dB on REDS dataset. And the comparison results with DFFA-Net(deep feature fusion attention) based on the GoPro dataset show that the proposed method improve the PSNR and SSIM by 0.49dB and 0.005, respectively. In addition, the visual results also show that the images recovered by our model are closest to the original real image, maintaining the original structure and features, and has a finer edge. Conclusion In this paper, aiming at the problems of existing image deblurring methods, we propose a novel method of multi-scale image deblurring based on prompt learning. The experimental results show that the new method can not only preserve the details of the result image, but also effectively overcome the problems of obvious artifacts and noise residue, and the result image has better performance in the objective evaluation metrics on PSNR and SSIM.

Keywords

image deblurring prompt learning multi-scale gated feed-forward network depthwise convolution

在线采编平台

论文出版

年度会议

下载中心

年度信息