|
[1]
|
Ha, Q., Watanabe, K., Karasawa, T., Ushiku, Y. and Harada, T. (2017) MFNet: Towards Real-Time Semantic Segmentation for Autonomous Vehicles with Multi-Spectral Scenes. 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, 24-28 September 2017, 5108-5115. [Google Scholar] [CrossRef]
|
|
[2]
|
Deng, F., Feng, H., Liang, M., Wang, H., Yang, Y., Gao, Y., et al. (2021) FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-Time Semantic Segmentation. 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, 27 September-1 October 2021, 4467-4473. [Google Scholar] [CrossRef]
|
|
[3]
|
Liang, M., Hu, J., Bao, C., Feng, H., Deng, F. and Lam, T.L. (2023) Explicit Attention-Enhanced Fusion for RGB-Thermal Perception Tasks. IEEE Robotics and Automation Letters, 8, 4060-4067. [Google Scholar] [CrossRef]
|
|
[4]
|
Tang, L., Yuan, J., Zhang, H., Jiang, X. and Ma, J. (2022) PIAFusion: A Progressive Infrared and Visible Image Fusion Network Based on Illumination Aware. Information Fusion, 83, 79-92. [Google Scholar] [CrossRef]
|
|
[5]
|
Sun, Y., Zuo, W. and Liu, M. (2019) RTFNet: RGB-Thermal Fusion Network for Semantic Segmentation of Urban Scenes. IEEE Robotics and Automation Letters, 4, 2576-2583. [Google Scholar] [CrossRef]
|
|
[6]
|
Sun, Y., Zuo, W., Yun, P., Wang, H. and Liu, M. (2021) FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion. IEEE Transactions on Automation Science and Engineering, 18, 1000-1011. [Google Scholar] [CrossRef]
|
|
[7]
|
Zhang, Q., Zhao, S., Luo, Y., Zhang, D., Huang, N. and Han, J. (2021) ABMDRNet: Adaptive-Weighted Bi-Directional Modality Difference Reduction Network for RGB-T Semantic Segmentation. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 2633-2642. [Google Scholar] [CrossRef]
|
|
[8]
|
Dong, S., Zhou, W., Xu, C. and Yan, W. (2024) EGFNet: Edge-Aware Guidance Fusion Network for RGB-Thermal Urban Scene Parsing. IEEE Transactions on Intelligent Transportation Systems, 25, 657-669. [Google Scholar] [CrossRef]
|
|
[9]
|
Zhou, W., Liu, J., Lei, J., Yu, L. and Hwang, J. (2021) GMNet: Graded-Feature Multilabel-Learning Network for RGB-Thermal Urban Scene Semantic Segmentation. IEEE Transactions on Image Processing, 30, 7790-7802. [Google Scholar] [CrossRef] [PubMed]
|
|
[10]
|
Zhou, W., Lin, X., Lei, J., Yu, L. and Hwang, J. (2022) MFFENet: Multiscale Feature Fusion and Enhancement Network for Rgb-Thermal Urban Road Scene Parsing. IEEE Transactions on Multimedia, 24, 2526-2538. [Google Scholar] [CrossRef]
|
|
[11]
|
Li, J., Yun, P., Chen, Q. and Fan, R. (2024) HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion. arXiv: 2404.03527.
|
|
[12]
|
Zhang, J., Liu, H., Yang, K., Hu, X., Liu, R. and Stiefelhagen, R. (2023) CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers. IEEE Transactions on Intelligent Transportation Systems, 24, 14679-14694. [Google Scholar] [CrossRef]
|
|
[13]
|
Zhang, J., Liu, R., Shi, H., Yang, K., Reiß, S., Peng, K., et al. (2023) Delivering Arbitrary-Modal Semantic Segmentation. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 1136-1147. [Google Scholar] [CrossRef]
|
|
[14]
|
Li, B., Zhang, D., Zhao, Z., Gao, J. and Li, X. (2025) StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation. Proceedings of the 33rd ACM International Conference on Multimedia, Dublin, 27-31 October 2025, 1308-1317. [Google Scholar] [CrossRef]
|
|
[15]
|
Shin, U., Lee, K., Kweon, I.S. and Oh, J. (2024) Complementary Random Masking for RGB-Thermal Semantic Segmentation. 2024 IEEE International Conference on Robotics and Automation (ICRA), Yokohama, 13-17 May 2024, 11110-11117. [Google Scholar] [CrossRef]
|
|
[16]
|
Zheng, X., Xue, H., Chen, J., Yan, Y., Jiang, L., Lyu, Y., Yang, K., Zhang, L. and Hu, X. (2024) Learning Robust Anymodal Segmentor with Unimodal and Cross-Modal Distillation. arXiv: 2411.17141.
|
|
[17]
|
Reza, M.K., Prater-Bennette, A. and Asif, M.S. (2025) Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47, 742-754. [Google Scholar] [CrossRef] [PubMed]
|
|
[18]
|
Liu, J., Liu, Z., Wu, G., Ma, L., Liu, R., Zhong, W., et al. (2023) Multi-Interactive Feature Learning and a Full-Time Multi-Modality Benchmark for Image Fusion and Segmentation. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, 1-6 October 2023, 8081-8090. [Google Scholar] [CrossRef]
|
|
[19]
|
Xie, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M. and Luo, P. (2021) SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. Advances in Neural Information Processing Systems, 34, 12077-12090.
|
|
[20]
|
Hazirbas, C., Ma, L., Domokos, C. and Cremers, D. (2017) FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-Based CNN Architecture. In: Lai, S.H., Lepetit, V., Nishino, K. and Sato, Y., Eds., Computer Vision—ACCV 2016, Springer, 213-228. [Google Scholar] [CrossRef]
|
|
[21]
|
Sun, Y., Dong, W., Wang, S., Wu, P., Feng, M., Li, X., et al. (2025) Distilling Hierarchical Knowledge from Multimodal Fusion for Unimodal Image Segmentation. IEEE Transactions on Circuits and Systems for Video Technology, 35, 11797-11809. [Google Scholar] [CrossRef]
|
|
[22]
|
Lin, B., Lin, Z., Guo, Y., Zhang, Y., Zou, J. and Fan, S. (2023) Variational Probabilistic Fusion Network for RGB-T Semantic Segmentation. arXiv: 2307.08536.
|