基于深度学习的无线图像跨层优化传输方法综述
Review of Deep Learning-Driven Cross-Layer Optimization Methods Wireless Image Transmission
DOI: 10.12677/csa.2026.164129, PDF,   
作者: 叶兴涛, 宋佳铭:赣南师范大学数学与计算机科学学院,江西 赣州;王 敏*:赣南师范大学智能制造与未来能源学院,江西 赣州;江西省教育厅数据科学与人工智能重点实验室,江西 赣州
关键词: 无线图像传输跨层优化深度学习自适应机制Wireless Image Transmission Cross-Layer Optimization Deep Learning Adaptive Transmission
摘要: 随着5G/6G技术的快速发展,高清图像、虚拟现实等新兴应用对无线图像传输的速率、实时性与可靠性提出了更高要求。然而,无线信道的时变特性使传统分离式的信源与信道编码方法在恶劣环境下性能急剧下降。为此,文章综述了基于深度学习的无线图像跨层优化传输方法的研究进展,系统回顾了从传统分离传输到端到端跨层联合优化的演变过程,重点介绍了自适应机制在无线图像跨层优化传输方法中的应用。同时,也指出了当前研究在更加复杂信道、实际部署以及其他传输场景等方面面临的挑战。
Abstract: With the rapid development of 5G/6G technologies, emerging applications such as high-definition images and virtual reality have imposed higher requirements for the rate, real-time performance, and reliability of wireless image transmission. However, the time-varying characteristics of wireless channels cause the performance of traditional separated source and channel coding methods to drop sharply in harsh environments. Therefore, this article reviews the research progress of wireless image cross-layer optimization transmission methods based on deep learning, systematically examines the evolution process from traditional separated transmission to end-to-end joint optimization, and focuses on introducing the application of adaptive mechanisms in wireless image cross-layer optimization transmission methods. At the same time, it also points out the challenges faced by current research in more complex channels, actual deployment, and other transmission scenarios.
文章引用:叶兴涛, 宋佳铭, 王敏. 基于深度学习的无线图像跨层优化传输方法综述[J]. 计算机科学与应用, 2026, 16(4): 274-286. https://doi.org/10.12677/csa.2026.164129

参考文献

[1] Wang, J., Al-Khalidi, A., Ahearne, S. and Wasige, E. (2021) 1080P HD Video Transmission Using RTD Transmitter. 2021 14th UK-Europe-China Workshop on Millimetre-Waves and Terahertz Technologies (UCMMT), Lancaster, 13-15 September 2021, 1-3. [Google Scholar] [CrossRef
[2] Chaccour, C., Soorki, M.N., Saad, W., Bennis, M. and Popovski, P. (2022) Can Terahertz Provide High-Rate Reliable Low-Latency Communications for Wireless VR? IEEE Internet of Things Journal, 9, 9712-9729. [Google Scholar] [CrossRef
[3] Zeng, T., Semiari, O., Saad, W. and Bennis, M. (2019) Joint Communication and Control for Wireless Autonomous Vehicular Platoon Systems. IEEE Transactions on Communications, 67, 7907-7922. [Google Scholar] [CrossRef
[4] Wang, M., Wu, J., Shi, S.F., Luo, C. and Wu, F. (2012) Fast Decoding and Hardware Design for Binary-Input Compressive Sensing. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2, 591-603. [Google Scholar] [CrossRef
[5] Wallace, G.K. (1992) The JPEG Still Picture Compression Standard. IEEE Transactions on Consumer Electronics, 38, 18-36. [Google Scholar] [CrossRef
[6] Wiegand, T., Sullivan, G.J., Bjontegaard, G. and Luthra, A. (2003) Overview of the H.264/AVC Video Coding Standard. IEEE Transactions on Circuits and Systems for Video Technology, 13, 560-576. [Google Scholar] [CrossRef
[7] Sullivan, G.J., Ohm, J., Han, W. and Wiegand, T. (2012) Overview of the High Efficiency Video Coding (HEVC) Standard. IEEE Transactions on Circuits and Systems for Video Technology, 22, 1649-1668. [Google Scholar] [CrossRef
[8] Shannon, C.E. (2001) A Mathematical Theory of Communication. ACM SIGMOBILE Mobile Computing and Communications Review, 5, 3-55. [Google Scholar] [CrossRef
[9] Ninkovic, V., Vukobratovic, D., Hager, C., et al. (2021) Autoencoder-Based Unequal Error Protection Codes. IEEE Communications Letters, 25, 3575-3579. [Google Scholar] [CrossRef
[10] 朱勇刚, 陈亮, 朱义勇. 一种基于鲁棒矢量量化的信源-信道联合编码算法[J]. 解放军理工大学学报(自然科学版), 2005, 6(2): 118-122.
[11] Cui, H., Xiong, R.Q., Luo, C., Song, Z.H. and Wu, F. (2015) Denoising and Resource Allocation in Uncoded Video Transmission. IEEE Journal of Selected Topics in Signal Processing, 9, 102-112. [Google Scholar] [CrossRef
[12] Bourtsoulatze, E., Burth Kurka, D. and Gunduz, D. (2019) Deep Joint Source-Channel Coding for Wireless Image Transmission. IEEE Transactions on Cognitive Communications and Networking, 5, 567-579. [Google Scholar] [CrossRef
[13] Ding, M., Li, J., Ma, M. and Fan, X. (2021) SNR-Adaptive Deep Joint Source-Channel Coding for Wireless Image Transmission. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, 06-11 June 2021, 1555-1559. [Google Scholar] [CrossRef
[14] Xu, J., Ai, B., Chen, W., Yang, A., Sun, P. and Rodrigues, M. (2022) Wireless Image Transmission Using Deep Source Channel Coding with Attention Modules. IEEE Transactions on Circuits and Systems for Video Technology, 32, 2315-2328. [Google Scholar] [CrossRef
[15] Chen, W., Chen, Y., Yang, Q., Huang, C., Wang, Q. and Zhang, Z. (2023) Deep Joint Source-Channel Coding for Wireless Image Transmission with Entropy-Aware Adaptive Rate Control. 2023 IEEE Global Communications Conference, Kuala Lumpur, 4-8 December 2023, 2239-2244. [Google Scholar] [CrossRef
[16] Kurka, D.B. and Gunduz, D. (2021) Bandwidth-Agile Image Transmission with Deep Joint Source-Channel Coding. IEEE Transactions on Wireless Communications, 20, 8081-8095. [Google Scholar] [CrossRef
[17] Yuan, H., Xu, W., Wang, Y. and Wang, X. (2024) Channel-Blind Joint Source-Channel Coding for Wireless Image Transmission. Sensors, 24, Article 4005. [Google Scholar] [CrossRef] [PubMed]
[18] Li, H., Xia, G., Yu, C., Zhang, Y. and Li, H. (2024) Adaptive Wireless Image Transmission Transformer Architecture for Image Transmission and Reconstruction. Sensors, 24, Article 6772. [Google Scholar] [CrossRef] [PubMed]
[19] Yang, K., Wang, S., Dai, J., Qin, X., Niu, K. and Zhang, P. (2025) SwinJSCC: Taming Swin Transformer for Deep Joint Source-Channel Coding. IEEE Transactions on Cognitive Communications and Networking, 11, 90-104. [Google Scholar] [CrossRef
[20] Raha, A.D., Adhikary, A., Gain, M., Park, Y., Saad, W. and Hong, C.S. (2025). DD-JSCC: Dynamic Deep Joint Source-Channel Coding for Semantic Communications. IEEE International Conference on Communications, Montreal, 8-12 June 2025, 3754-3759.[CrossRef
[21] Li, H., Zhang, G., Zhou, K., et al. (2024) Coarse-to-Fine: A Dual-Phase Channel-Adaptive Method for Wireless Image Transmission.
https://arxiv.org/abs/2412.08211
[22] Salim, M.M., Abdalzaher, M.S., Muqaibel, A.H., et al. (2025) SNR-Aware Semantic Image Transmission with Deep Learning-Based Channel Estimation in Fading Channels.
https://arxiv.org/abs/2504.20557
[23] Wu, H., Shao, Y., Mikolajczyk, K. and Gunduz, D. (2022) Channel-Adaptive Wireless Image Transmission with OFDM. IEEE Wireless Communications Letters, 11, 2400-2404. [Google Scholar] [CrossRef
[24] Guo, Y., Chen, W., Xu, J., Li, L. and Ai, B. (2025) Deep Joint CSI Feedback and Multiuser Precoding for MIMO OFDM Systems. IEEE Transactions on Vehicular Technology, 74, 1730-1735. [Google Scholar] [CrossRef
[25] Chen, W., Yang, Q., Chen, Y., et al. (2025) Semantic Communication with Entropy-and-Channel-Adaptive Rate Control over Multi-User MIMO Fading Channels.
https://arxiv.org/abs/2501.15414
[26] Zhang, G., Hu, Q., Cai, Y., et al. (2023) Adaptive CSI Feedback for Deep Learning-Enabled Image Transmission. IEEE International Conference on Communications, 74, 1730-1735.
[27] 穆天杰, 陈晓辉, 汪逸云, 等. 基于深度学习的信源信道联合编码方法综述[J]. 电信科学, 2020, 36(10): 56-66.
[28] Secker, A. and Taubman, D. (2003) Lifting-Based Invertible Motion Adaptive Transform (LIMAT) Framework for Highly Scalable Video Compression. IEEE Transactions on Image Processing, 12, 1530-1542. [Google Scholar] [CrossRef] [PubMed]
[29] Song, Y., Zhu, X., Fang, Y. and Zhang, H. (2010) Threshold Optimization for Rate Adaptation Algorithms in IEEE 802.11 WLANs. IEEE Transactions on Wireless Communications, 9, 318-327. [Google Scholar] [CrossRef
[30] Ballé, J., Laparra, V. and Simoncelli, E.P. (2025) End-to-End Optimized Image Compression.
https://arxiv.org/abs/1611.01704
[31] Ballé, J., Minnen, D., Singh, S., et al. (2018) Variational Image Compression with a Scale Hyperprior. arXiv:1802.01436.
https://arxiv.org/abs/1802.01436
[32] Minnen, D., Balle, J. and Toderici, G. (2018) Joint Autoregressive and Hierarchical Priors for Learned Image Compression.
https://arxiv.org/abs/1809.02736
[33] Toderici, G., O’malley, S.M., Hwang, S.J., et al. (2016) Variable Rate Image Compression with Recurrent Neural Networks.
https://arxiv.org/abs/1809.02736
[34] Toderici, G., Vincent, D., Johnston, N., et al. (2017) Full Resolution Image Compression with Recurrent Neural Networks.
https://arxiv.org/abs/1511.06085
[35] Agustsson, E., Tschannen, M., Mentzer, F., Timofte, R. and Van Gool, L. (2019) Generative Adversarial Networks for Extreme Learned Image Compression. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27 October-2 November 2019, 221-231. [Google Scholar] [CrossRef
[36] Lu, G., Ouyang, W., Xu, D., et al. (2018) DVC: An End-to-End Deep Video Compression Framework. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 4569-4585.
[37] Felix, A., Cammerer, S., Dorner, S., Hoydis, J. and Ten Brink, S. (2018) OFDM-Autoencoder for End-to-End Learning of Communications Systems. 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Kalamata, 25-28 June 2018, 1-5. [Google Scholar] [CrossRef
[38] Tung, T. and Gunduz, D. (2022) Deepwive: Deep-Learning-Aided Wireless Video Transmission. IEEE Journal on Selected Areas in Communications, 40, 2570-2583. [Google Scholar] [CrossRef
[39] Gündüz, D., Wigger, M.A., Tung, T., Zhang, P. and Xiao, Y. (2025) Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs. Proceedings of the IEEE, 113, 888-919. [Google Scholar] [CrossRef
[40] Kurka, D.B. and Gunduz, D. (2020) DeepJSCC-f: Deep Joint Source-Channel Coding of Images with Feedback. IEEE Journal on Selected Areas in Information Theory, 1, 178-193. [Google Scholar] [CrossRef
[41] Yang, M., Bian, C. and Kim, H. (2021) Deep Joint Source Channel Coding for Wireless Image Transmission with OFDM. 2021 IEEE International Conference on Communications, Montreal, 14-23 June 2021, 1-6. [Google Scholar] [CrossRef
[42] Wu, H., Shao, Y., Bian, C., Mikolajczyk, K. and Gündüz, D. (2024) Deep Joint Source-Channel Coding for Adaptive Image Transmission over MIMO Channels. IEEE Transactions on Wireless Communications, 23, 15002-15017. [Google Scholar] [CrossRef
[43] Jiang, W., Chen, W. and Ai, B. (2024) Deep Joint Source Channel Coding with Attention Modules over MIMO Channels. 2024 IEEE 99th Vehicular Technology Conference (VTC2024-Spring), Singapore, 24-27 June 2024, 1-5. [Google Scholar] [CrossRef
[44] Bian, C., Shao, Y. and Gündüz, D. (2023). Deepjscc-1++: Robust and Bandwidth-Adaptive Wireless Image Transmission. 2023 IEEE Global Communications Conference, Kuala Lumpur, 4-8 December 2023, 3148-3154.[CrossRef
[45] Yang, K., Wang, S., Dai, J., Tan, K., Niu, K. and Zhang, P. (2023) WITT: A Wireless Image Transmission Transformer for Semantic Communications. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, 4-10 June 2023, 1-5. [Google Scholar] [CrossRef
[46] Wu, T., Chen, Z., Tao, M., Xu, X., Zhang, W. and Zhang, P. (2024) Mambajscc: Deep Joint Source-Channel Coding with Visual State Space Model. 2024 IEEE Global Communications Conference, Cape Town, 8-12 December 2024, 1677-1682. [Google Scholar] [CrossRef
[47] Wu, H., Shao, Y., Ozfatura, E., Mikolajczyk, K. and Gündüz, D. (2024) Transformer-Aided Wireless Image Transmission with Channel Feedback. IEEE Transactions on Wireless Communications, 23, 11904-11919. [Google Scholar] [CrossRef