面向复杂退化场景的视频逆色调映射算法

doi:10.12677/csa.2026.165194

期刊菜单

面向复杂退化场景的视频逆色调映射算法
Video Inverse Tone Mapping for Complex Degradation Scenarios

DOI: 10.12677/csa.2026.165194, PDF,
作者: 张文友, 段盼君：合肥工业大学计算机与信息学院，安徽合肥
关键词: 逆色调映射；动态范围；递归神经网络；自适应学习；Inverse Tone Mapping； Dynamic Range； Recurrent Neural Network； Adaptive Learning

摘要: 针对现有视频逆色调映射方法对降质过程先验依赖较强且泛化能力不足的问题，文章提出一种降质类型自适应的视频逆色调映射方法，以提升复杂退化场景下的高动态范围重建性能。该方法首先构建包含多种色调映射算子的训练数据集，并利用分类器对输入视频的降质特性进行预测，通过嵌入编码将类别信息引入映射过程。在此基础上，设计全局色彩变换模块实现初步动态范围扩展，引入曝光引导的空间注意力机制对过曝与欠曝区域进行细节恢复，同时采用时空特征协同对齐策略融合多帧上下文信息。实验结果表明，在所构建的包含11类退化形式的视频数据集上，所提方法在峰值信噪比、结构相似性及色度差异等指标上均优于现有主流方法。

Abstract: Existing inverse tone mapping methods heavily rely on prior knowledge of degradation processes, which limits their generalization across diverse scenarios. To address this issue, this paper proposes a degradation-type adaptive video inverse tone mapping network to improve performance under complex degradation conditions. Specifically, the proposed method constructs training data covering multiple tone mapping operators, employs a classifier to identify degradation characteristics of input videos, and embeds the predicted category information into the mapping process via an embedding encoding mechanism. Within the mapping pipeline, a global color transformation module is designed to perform initial dynamic range expansion, while an exposure-guided spatial attention mechanism is introduced to restore over-exposed and under-exposed regions. In addition, a spatio-temporal feature collaborative alignment strategy is adopted to aggregate multi-frame contextual information. Experimental results demonstrate that, on the proposed video dataset containing 11 degradation types, the proposed method outperforms existing state-of-the-art approaches in terms of peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and chrominance difference.

文章引用：张文友, 段盼君. 面向复杂退化场景的视频逆色调映射算法[J]. 计算机科学与应用, 2026, 16(5): 414-426. https://doi.org/10.12677/csa.2026.165194

参考文献

[1]	Guo, C., Fan, L., Xue, Z. and Jiang, X. (2023) Learning a Practical SDR-to-HDRTV Up-Conversion Using New Dataset and Degradation Models. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 18-22 June 2023, 22231-22241. [Google Scholar] [CrossRef]
[2]	Banterle, F., Ledda, P., Debattista, K. and Chalmers, A. (2006) Inverse Tone Mapping. Proceedings of the 4th International Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia, Kuala Lumpur, 29 November-2 December 2006, 349-356. [Google Scholar] [CrossRef]
[3]	Chen, X., Zhang, Z., Ren, J.S., Tian, L., Qiao, Y. and Dong, C. (2021) A New Journey from SDRTV to HDRTV. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, 10-17 October 2021, 4500-4509. [Google Scholar] [CrossRef]
[4]	Chen, X., Li, Z., Zhang, Z., Ren, J.S., Liu, Y., He, J., et al. (2025) Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation. IEEE Transactions on Multimedia, 27, 8340-8354. [Google Scholar] [CrossRef]
[5]	Xu, G., Hou, Q., Zhang, L. and Cheng, M. (2022) FMNet: Frequency-Aware Modulation Network for SDR-to-HDR Translation. Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, 10-14 October 2022, 6425-6435. [Google Scholar] [CrossRef]
[6]	Xu, G., Hou, Q. and Cheng, M. (2024) Dual Frequency Transformer for Efficient SDR-to-HDR Translation. Machine Intelligence Research, 21, 538-548. [Google Scholar] [CrossRef]
[7]	He, G., Xu, K., Xu, L., Wu, C., Sun, M., Wen, X., et al. (2022) SDRTV-to-HDRTV via Hierarchical Dynamic Context Feature Mapping. Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, 10-14 October 2022, 2890-2898. [Google Scholar] [CrossRef]
[8]	Shao, T., Zhai, D., Jiang, J. and Liu, X. (2022) Hybrid Conditional Deep Inverse Tone Mapping. Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, 10-14 October 2022, 1016-1024. [Google Scholar] [CrossRef]
[9]	Huang, P., Cao, G., Zhou, F. and Qiu, G. (2023) Video Inverse Tone Mapping Network with Luma and Chroma Mapping. Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, 29 October-3 November 2023, 1383-1391. [Google Scholar] [CrossRef]
[10]	Cao, G., Zhou, F., Yan, H., Wang, A. and Fan, L. (2022) KPN-MFI: A Kernel Prediction Network with Multi-Frame Interaction for Video Inverse Tone Mapping. Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Vienna, 23-29 July 2022, 806-812. [Google Scholar] [CrossRef]
[11]	Yan, H., Zhang, H., Zhao, M., et al. (2025) Video Inverse Tone Mapping Network with GlobalColor Mapping and Multi-Frame Interaction. IEEE Transactions on Consumer Electronics, 71, 2930-2943.
[12]	Zhang, Y., Ni, Z., Yang, W., et al. (2026) Wavelet-Domain Masked Image Modeling for Color-Consistent HDR Video Reconstruction.
[13]	He, G., Xu, K., Xu, L., Yu, W. and Wu, X. (2025) Beyond Feature Mapping GAP: Integrating Real HDRTV Priors for Superior SDRTV-to-HDRTV Conversion. Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, Montreal, 16-22 August 2025, 1089-1097. [Google Scholar] [CrossRef]
[14]	Xu, L., Wang, S., Xu, K., Zhang, L., He, G., Wang, W., et al. (2026) RealRep: Generalized SDR-to-HDR Conversion via Attribute-Disentangled Representation Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40, 11305-11313. [Google Scholar] [CrossRef]
[15]	Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. and Chen, L. (2018) MobileNetV2: Inverted Residuals and Linear Bottlenecks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-22 June 2018, 4510-4520. [Google Scholar] [CrossRef]
[16]	ITU-R (2018) Image Parameter Values for High Dynamic Range Television for Use in Production and International Programme Exchange: BT.2100-2.
[17]	Hable, J. (2010) Uncharted 2: HDR Lighting. Proceedings of the Game Developers Conference, San Francisco, 56.
[18]	Reinhard, E., Stark, M., Shirley, P. and Ferwerda, J. (2023) Photographic Tone Reproduction for Digital Images. In: Seminal Graphics Papers: Pushing the Boundaries, Volume 2, ACM, 661-670. [Google Scholar] [CrossRef]
[19]	Zhang, Y., Li, Q., Qi, M., Liu, D., Kong, J. and Wang, J. (2023) Multi-Scale Frequency Separation Network for Image Deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33, 5525-5537. [Google Scholar] [CrossRef]
[20]	Zamir, S.W., Arora, A., Khan, S., Hayat, M., Khan, F.S. and Yang, M. (2022) Restormer: Efficient Transformer for High-Resolution Image Restoration. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 19-20 June 2022, 5728-5739. [Google Scholar] [CrossRef]
[21]	ITU-R (2021) Methods for Conversion of High Dynamic Range Content to Standard Dynamic Range Content and Vice-versa: BT.2446-1.
[22]	Banterle, F., Artusi, A., Debattista, K., et al. (2017) Advanced High Dynamic Range Imaging. 2nd Edition, AK Peters (CRC Press).
[23]	Drago, F., Myszkowski, K., Annen, T. and Chiba, N. (2003) Adaptive Logarithmic Mapping for Displaying High Contrast Scenes. Computer Graphics Forum, 22, 419-426. [Google Scholar] [CrossRef]
[24]	Ferwerda, J.A., Pattanaik, S.N., Shirley, P. and Greenberg, D.P. (1996) A Model of Visual Adaptation for Realistic Image Synthesis. Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, New Orleans, 4-9 August 1996, 249-258. [Google Scholar] [CrossRef]
[25]	Lischinski, D., Farbman, Z., Uyttendaele, M. and Szeliski, R. (2006) Interactive Local Adjustment of Tonal Values. ACM Transactions on Graphics, 25, 646-653. [Google Scholar] [CrossRef]
[26]	Reinhard, E., Pouli, T., Kunkel, T., Long, B., Ballestad, A. and Damberg, G. (2012) Calibrated Image Appearance Reproduction. ACM Transactions on Graphics, 31, 1-11. [Google Scholar] [CrossRef]
[27]	Shan, Q., DeRose, T. and Anderson, J. (2012) Tone Mapping High Dynamic Range Videos Using Wavelets. Pixar Technical Memo, 1, 1-20.
[28]	Larson, G.W., Rushmeier, H. and Piatko, C. (1997) A Visibility Matching Tone Reproduction Operator for High Dynamic Range Scenes. IEEE Transactions on Visualization and Computer Graphics, 3, 291-306. [Google Scholar] [CrossRef]
[29]	Kingma, D.P. and Ba, J. (2014) Adam: A Method for Stochastic Optimization.
[30]	Zhang, L. and Li, H. (2012) SR-SIM: A Fast and High Performance IQA Index Based on Spectral Residual. 2012 19th IEEE International Conference on Image Processing, Orlando, 30 September-3 October 2012, 1473-1476. [Google Scholar] [CrossRef]
[31]	ITU-R (2019) Objective Metric for the Assessment of the Potential Visibility of Colour Differences in Television: BT.2124-0.
[32]	Mantiuk, R.K., Hammou, D. and Hanji, P. (2023) HDR-VDP-3: A Multi-Metric for Predicting Image Differences, Quality and Contrast Distortions in High Dynamic Range and Regular Content.

为你推荐

友情链接