基于双分支Transformer与动态图增强的多曝光图像融合方法
A Multi-Exposure Image Fusion Method Based on Dual-Branch Transformer and Dynamic Graph Enhancement
摘要: 多曝光图像融合(MEF)的目的是将不同曝光度的图像融合为一张细节清晰且亮度均衡的结果图像。针对现有方法在亮度调节不足、结构保持有限及多尺度特征建模不充分等问题,本文提出一种基于双分支Transformer与动态图增强的多曝光图像融合方法。通过双分支Transformer架构实现局部与全局特征的互补建模,有效兼顾细节清晰度与结构一致性的同时引入动态图增强机制,自适应捕捉跨曝光图像间的动态依赖关系,克服了传统静态融合的局限,进一步结合亮度调节与边缘保持策略,使得结果图像在视觉自然性与感知层次方面显著提升。在SICE、MEFB等公开数据集上的实验结果表明,本方法在主观感知与客观指标方面均优于现有方法,展现出良好的鲁棒性与视觉表现。
Abstract: Multi-exposure image fusion (MEF) aims to merge images with different exposure levels into a single result image featuring clear details and balanced brightness. Addressing limitations in existing methods—such as inadequate brightness adjustment, limited structural preservation, and insufficient multi-scale feature modeling—this paper proposes a multi-exposure image fusion approach based on a dual-branch Transformer and dynamic image enhancement. The dual-branch Transformer architecture enables complementary modeling of local and global features, effectively balancing detail clarity and structural consistency. Simultaneously, the introduction of a dynamic image enhancement mechanism adaptively captures dynamic dependencies across exposure images, overcoming the limitations of traditional static fusion. Furthermore, the integration of brightness adjustment and edge preservation strategies significantly enhances the visual naturalness and perceptual quality of the resulting images. Experimental results on public datasets such as SICE and MEFB demonstrate that our method outperforms existing approaches in both subjective perception and objective metrics, exhibiting robust performance and superior visual quality.
文章引用:张志超, 李玉, 祁艳杰. 基于双分支Transformer与动态图增强的多曝光图像融合方法[J]. 图像与信号处理, 2026, 15(1): 1-14. https://doi.org/10.12677/jisp.2026.151001

参考文献

[1] Mertens, T., Kautz, J. and Van Reeth, F. (2007) Exposure Fusion. 15th Pacific Conference on Computer Graphics and Applications (PG’07), Maui, 29 October-2 November 2007, 382-390. [Google Scholar] [CrossRef
[2] Liu, Y., Chen, X. and Ward, R.K. (2011) Image Fusion with Convolutional Sparse Representation. IEEE Signal Processing Letters, 20, 449-452.
[3] Ma, Y., Ma, J., Zong, H., Huang, J. and Mei, X. (2021) Multi-Exposure Image Fusion via a Deep Sparse Denoising Network. Information Fusion, 76, 48-64.
[4] Wang, X., Zhang, Y., Yang, J. and Zhang, L. (2022) MEF-Net: A Memory-Enhanced Fusion Network for Multi-Exposure Image Fusion. IEEE Transactions on Circuits and Systems for Video Technology, 33, 763-777.
[5] Qu, L., Liu, S., Wang, M. and Song, Z. (2022) Transmef: A Transformer-Based Multi-Exposure Image Fusion Framework Using Self-Supervised Multi-Task Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36, 2126-2134. [Google Scholar] [CrossRef
[6] Xu, H., Haochen, L. and Ma, J. (2023) Unsupervised Multi-Exposure Image Fusion Breaking Exposure Limits via Contrastive Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 37, 3010-3017. [Google Scholar] [CrossRef
[7] Mu, P., Du, Z., Liu, J. and Bai, C. (2023) Little Strokes Fell Great Oaks: Boosting the Hierarchical Features for Multi-Exposure Image Fusion. Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, 29 October-3 November 2023, 2985-2993. [Google Scholar] [CrossRef
[8] Xu, H., Liu, H., He, X., Chen, Y. and Li, C. (2022) Deep Multiscale Fusion Network for Multi-Exposure Image Fusion. Neurocomputing, 507, 91-104.
[9] Xu, Y., Liu, Z., Wu, X., Chen, W., Wen, C. and Li, Z. (2022) Deep Joint Demosaicing and High Dynamic Range Imaging within a Single Shot. IEEE Transactions on Circuits and Systems for Video Technology, 32, 4255-4270. [Google Scholar] [CrossRef
[10] Wang, J., Wei, Z., Zhang, T., et al. (2016) Deeply-Fused Nets.
https://arxiv.org/abs/1605.07716
[11] Ma, K., Li, H., Yong, H., Wang, Z., Meng, D. and Zhang, L. (2017) Robust Multi-Exposure Image Fusion: A Structural Patch Decomposition Approach. IEEE Transactions on Image Processing, 26, 2519-2532. [Google Scholar] [CrossRef] [PubMed]
[12] Ma, K., Duanmu, Z., Zhu, H., Fang, Y. and Wang, Z. (2019) Deep Guided Learning for Fast Multi-Exposure Image Fusion. IEEE Transactions on Image Processing, 29, 2808-2819. [Google Scholar] [CrossRef] [PubMed]
[13] Xu, H., Ma, J., Jiang, J., Guo, X. and Ling, H. (2019) U2Fusion: A Unified Unsupervised Image Fusion Network. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 502-518. [Google Scholar] [CrossRef] [PubMed]
[14] Zhang, Y., Liu, Y., Sun, P., Yan, H., Zhao, X. and Zhang, L. (2020) IFCNN: A General Image Fusion Framework Based on Convolutional Neural Network. Information Fusion, 54, 99-118. [Google Scholar] [CrossRef
[15] Liu, J., Shang, J., Liu, R. and Fan, X. (2022) Attention-Guided Global-Local Adversarial Learning for Detail-Preserving Multi-Exposure Image Fusion. IEEE Transactions on Circuits and Systems for Video Technology, 32, 5026-5040. [Google Scholar] [CrossRef
[16] Li, H., Ma, K., Yong, H. and Zhang, L. (2020) Fast Multi-Scale Structural Patch Decomposition for Multi-Exposure Image Fusion. IEEE Transactions on Image Processing, 29, 5805-5816. [Google Scholar] [CrossRef] [PubMed]
[17] Han, D., Li, L., Guo, X. and Ma, J. (2022) Multi-Exposure Image Fusion via Deep Perceptual Enhancement. Information Fusion, 79, 248-262. [Google Scholar] [CrossRef