基于多级轴向加性网络的轻量级单图超分辨率
Lightweight Single Image Super-Resolution with Multi-Level Axial Additive Network
摘要: 信息技术发展日新月异,视觉信息的质量广受重视,图像超分辨率技术正因此经过了长久的迭代。但作为一个不适定问题,这项技术仍将是一个长久的难题。随着自注意力机制的出现及引入,传统卷积神经网络方法逐渐在性能上落后。然而,包含自注意力的方法通常计算成本高昂,或是只能为节约计算成本在性能上妥协。因此,本文提出了一种多级轴向加性网络,很好地平衡了性能与成本。具体来说,我们首先设计了一种多级轴向注意力模块,在注意力机制内实现了轴向窗口的模式。然后,我们提出了一种高效的加性注意力,使注意力计算免于矩阵乘法运算。同时,我们还构建了一个轻量级的超分辨率网络MLAAN。最后,我们在五个基准数据集上评估了所提出的MLAAN的效果。在与SOTA方法的对比中,MLAAN在参数量较少的前提下体现了优越的超分辨率性能。
Abstract: The importance of visual data has been increasingly emphasized due to the swift advancement of information technology nowadays. As an ill-posed problem, Single Image Super-Resolution continues to present an enduring challenge even after years of progression. Massive self-attention based methods proposed have shown performance exceeding traditional Convolutional Neural Networks based methods. However, methods including self-attention either suffer from large computational cost, or have to compromise on the weakened ability on capturing information thanks to modification on attention. We propose a Multi-Level Axial Additive Network with well-balanced trade-off in this work. Specifically, we first elaborate a Multi-Level Axial Attention Block enabling axial window patterns within attention. Then we present an effective additive attention that eliminates the need for expensive matrix multiplication operations in attention. We also construct a Feature Extraction Module base on shift-convolution to extract local features. We evaluate the efficacy of our proposed MLAAN on five benchmark datasets and show that it significantly enhances the super-resolution performance of the network. Our experimental results demonstrate state-of-the-art performance in lightweight SISR while using a low number of parameters.
文章引用:邹观哲, 黄可言. 基于多级轴向加性网络的轻量级单图超分辨率[J]. 应用数学进展, 2024, 13(4): 1842-1852. https://doi.org/10.12677/aam.2024.134173

参考文献

[1] 周登文, 李文斌, 李金新, 黄志勇. 一种轻量级的多尺度通道注意图像超分辨率重建网络[J]. 电子学报, 2022(10): 2336-2346.
[2] 江明. 基于深度学习的图像超分辨率轻量级算法研究[D]: [硕士学位论文]. 赣州: 江西理工大学, 2022.
https://link.cnki.net/doi/10.27176/d.cnki.gnfyc.2022.000740
[3] Dong, C., Loy, C.C., He, K. and Tang, X. (2015) Image Super-Resolution Using Deep Convolutional Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38, 295-307. [Google Scholar] [CrossRef
[4] Zhang, Y., Tian, Y., Kong, Y., Zhong, B. and Fu, Y. (2018) Residual Dense Network for Image Super-Resolution. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 2472-2481. [Google Scholar] [CrossRef
[5] Tai, Y., Yang, J. and Liu, X. (2017) Image Super-Resolution via Deep Recursive Residual Network. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, 21-26 July 2017, 3147-3155. [Google Scholar] [CrossRef
[6] Ahn, N., Kang, B. and Sohn, K.A. (2018) Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network. Computer Vision -ECCV 2018, Munich, 256-272. [Google Scholar] [CrossRef
[7] Hui, Z., Gao, X., Yang, Y. and Wang, X. (2019) Lightweight Image Super-Resolution with Information Multi-Distillation Network. Proceedings of the 27th ACM International Conference on Multimedia, Nice, 21-25 October 2019, 2024-2032. [Google Scholar] [CrossRef
[8] Liu, J., Tang, J. and Wu, G. (2020) Residual Feature Distillation Network for Lightweight Image Super-Resolution. Computer Vision-ECCV 2020 Workshops, Glasgow, UK, 23-28 August 2020, 41-55. [Google Scholar] [CrossRef
[9] Li, Z., Liu, Y., Chen, X., Cai, H., Gu, J., Qiao, Y. and Dong, C. (2022) Blueprint Separable Residual Network for Efficient Image Super-Resolution. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, 19-20 June 2022, 833-843. [Google Scholar] [CrossRef
[10] Liang, J., Cao, J., Sun, G., Zhang, K., Van Gool, L. and Timofte, R. (2021) Swinir: Image Restoration Using Swin Transformer. 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, 11-17 October 2021, 1833-1844. [Google Scholar] [CrossRef
[11] Lu, Z., Li, J., Liu, H., Huang, C., Zhang, L. and Zeng, T. (2022) Transformer for Single Image Super-Resolution. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, 19-20 June 2022, 457-466. [Google Scholar] [CrossRef
[12] Zhang, X., Zeng, H., Guo, S. and Zhang, L. (2022) Efficient Long-Range Attention Network for Image Super-Resolution. Computer Vision - ECCV 2022, Tel Aviv, 649-667. [Google Scholar] [CrossRef