基于深度残差生成对抗网络的超分辨率重建算法研究
Research on Super-Resolution Reconstruction Algorithms Based on Deep Residual Generative Adversarial Networks
DOI: 10.12677/csa.2024.145112, PDF,    科研立项经费支持
作者: 王刘胜, 由从哲*:江苏理工学院计算机工程学院,江苏 常州
关键词: 超分辨率重建生成对抗网络深度残差网络Super-Resolution Reconstruction Generative Adversarial Networks Deep Residual Networks
摘要: 图像超分辨重建是计算机视觉领域中的重要研究方向之一。本文主要针对图像超分辨率在重建过程中信息恢复不充分、特征提取不全面、高频细节不明显等问题,在SRGAN的基础上提出一种基于深度残差生成对抗网络的图像超分辨率算法。该算法实现了一种深度残差的结构(Depth-ResNet),即残差中的残差,来形成非常深的网络。该结构由几个具有长跳跃连接的残差组组成,每个残差组中也包含一些具有短跳跃连接的残差块。除此之外,Depth-ResNet允许通过多个跳跃连接绕过丰富的低频信息,使主网络专注于学习高频信息,并且随着Depth-ResNet的数量与深度的调整变化,图像重建效果将取得更好的准确性与视觉改进。此外,为了平衡Depth-ResNet在生成器上的卓越性能,本文在生成器的损失函数上采用了Charbonnier损失函数与对抗损失函数,并优化了判别器的结构。根据大量实验表明,重建的图像在清晰度、高频细节等方面都有一定的提高。
Abstract: Image super-resolution reconstruction is one of the significant research directions within the field of computer vision. This paper particularly addresses issues such as insufficient information recovery, incomplete feature extraction, and indistinct high-frequency details during the image super-resolution process by proposing, on the basis of SRGAN, a novel deep residual generative adversarial network-based image super-resolution algorithm. The proposed algorithm implements a Depth-Residual Network (Depth-ResNet) architecture, characterized by residuals within residuals, to construct an extremely deep network. This architecture comprises several residual groups interconnected with long skip connections, where each residual group further consists of several residual blocks linked via short skip connections. Significantly, the Depth-ResNet design allows for multiple skip connections bypassing rich low-frequency information, enabling the main network to focus on learning high-frequency information effectively. As the number and depth of the Depth-ResNet modules are varied and adjusted, the reconstructed images exhibit improved accuracy and visually enhanced outcomes. Moreover, in order to balance the outstanding performance of the Depth-ResNet in the generator, this study adopts the Charbonnier loss function along with the adversarial loss function in the generator’s objective function, while also optimizing the structure of the discriminator. Extensive experimentation demonstrates that the reconstructed images exhibit noticeable improvements in terms of clarity and high-frequency detail preservation.
文章引用:王刘胜, 由从哲. 基于深度残差生成对抗网络的超分辨率重建算法研究[J]. 计算机科学与应用, 2024, 14(5): 33-47. https://doi.org/10.12677/csa.2024.145112

参考文献

[1] Greenspn, H. (2008) Super-Resolution in Medical Imaging. The Computer Journal, 52, 43-63. [Google Scholar] [CrossRef
[2] Su, J.M. and Yang, L.X. (2019) Super Resolution of Single Frame Remote Sensing Image Based on Generative Adversarial Nets. Computer Engineering and Applications, 55, 202-207.
[3] Zou, W.W. and Yuen, P.C. (2012) Very Low Resolution Face Recognition Problem. IEEE Transactions on Image Processing, 21, 327-340. [Google Scholar] [CrossRef
[4] Sajjadi, M.S., Schölkopf, B. and Hirsch, M. (2017) EnhanceNet: Single Image Super-Resolution through Automated Texture Synthesis. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 4501-4510. [Google Scholar] [CrossRef
[5] Dai, D., Wang, Y., Chen, Y. and Van Gool, L. (2016) Is Image Super-Resolution Helpful for Other Vision Tasks? 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, 7-10 March 2016, 1-9. [Google Scholar] [CrossRef
[6] Keys, R. (1981) Cubic Convolution Interpolation for Digital Image Processing. IEEE Transactions on Acoustics, Speech, and Signal Processing, 29, 1153-1160. [Google Scholar] [CrossRef
[7] Freedman, G. and Fattal, R. (2011) Image and Video Upscaling from Local Self-Examples. ACM Transactions on Graphics, 30, Article No. 12. [Google Scholar] [CrossRef
[8] Freeman, W.T., Jones, T.R. and Pasztor, E.C. (2002) Example-Based Super-Resolution. IEEE Computer Graphics and Applications, 22, 56-65. [Google Scholar] [CrossRef
[9] Yang, J., Wright, J., Huang, T.S. and Ma, Y. (2010) Image Super-Resolution via Sparse Representation. IEEE Transactions on Image Processing, 19, 2861-2873. [Google Scholar] [CrossRef
[10] Kim, K.I. and Kwon, Y. (2010) Single-Image Super-Resolution Using Sparse Regression and Natural Image Prior. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32, 1127-1133. [Google Scholar] [CrossRef
[11] Dong, C., Loy, C.C., He, K. and Tang, X. (2014) Learning a Deep Convolutional Network for Image Super-Resolution. 13th European Conference Computer Vision-ECCV 2014, Zurich, 6-12 September 2014, 184-199. [Google Scholar] [CrossRef
[12] Kim, J., Kwon Lee, J. and Mu Lee, K. (2016) Accurate Image Super-Resolution Using Very Deep Convolutional Networks. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 1646-1654. [Google Scholar] [CrossRef
[13] Kim, J., Kwon Lee, J. and Mu Lee, K. (2016) Deeply-Recursive Convolutional Network for Image Super-Resolution. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 1637-1645. [Google Scholar] [CrossRef
[14] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[15] Lim, B., Son, S., Kim, H., Nah, S. and Lee, K.M. (2017) Enhanced Deep Residual Networks for Single Image Super-Resolution. 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Honolulu, 21-26 July 2017, 1132-1140. [Google Scholar] [CrossRef
[16] Goodfellow, I.J., Mirza, M., et al. (2014) Generative Adversarial Nets. Advances in Neural Information Processing Systems, Vol. 27, 2672-2680.
[17] Ledig, C., Theis, L., Huszár, F., et al. (2017) Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 21-26 July 2017, 4681-4690. [Google Scholar] [CrossRef
[18] Nah, S., Kim, T.H. and Lee, K.M. (2016) Deep Multi-Scale Convolutional Neural Network for Dynamic Scene Deblurring. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 257-265. [Google Scholar] [CrossRef
[19] Lai, W.S., Huang, J.B., Ahuja, N., et al. (2017) Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 21-26 July 2017, 624-632. [Google Scholar] [CrossRef