多模态影像融合在神经系统疾病诊断中的 应用进展——基于人工智能的技术范式与临床转化
Advances in the Application of Multi-Modal Image Fusion in the Diagnosis of Neurological Diseases—AI-Based Technical Paradigms and Clinical Translation
DOI: 10.12677/acm.2026.1641356, PDF,   
作者: 刘子毅:延安大学延安医学院,陕西 延安;延安大学附属医院放射科,陕西 延安
关键词: 多模态融合神经影像人工智能深度学习阿尔茨海默病可解释性Multi-Modal Fusion Neuroimaging Artificial Intelligence Deep Learning Alzheimer’s Disease Explainability
摘要: 神经系统疾病具有病理机制复杂、临床异质性显著、早期隐匿性强等特点,传统单一模态影像学检查难以全面反映疾病的整体病理生理过程。多模态影像融合技术通过整合结构、功能、分子等多维度成像信息,突破单一模态的局限性,实现从“形态学描述”向“系统生物学表征”的跨越式发展。近年来,随着大规模多模态神经影像数据库的建立和人工智能算法的快速发展,基于深度学习的多模态融合策略在阿尔茨海默病、帕金森病、脑卒中、自闭症谱系障碍等疾病的早期诊断、进展预测和个体化治疗决策中展现出显著优势。融合范式从传统的早期、中期、晚期融合向基于正交补偿、跨层引导、注意力机制等高级交互模式演进;可解释人工智能技术的引入为破解“黑箱”困境、提升临床信任度提供了有效路径。本文系统综述多模态影像融合技术的最新进展,阐述MRI-PET、结构–功能成像、影像–电生理等多模态组合在神经系统疾病评估中的应用现状,分析数据异质性、模态缺失、模型泛化能力等核心挑战,并展望多模态人工智能在神经疾病精准医疗中的发展前景。
Abstract: Neurological diseases are characterized by complex pathological mechanisms, significant clinical heterogeneity, and strong early concealment. Traditional single-modal imaging examinations are difficult to fully reflect the overall pathophysiological process of the diseases. Multi-modal image fusion technology breaks through the limitations of single modality by integrating multi-dimensional imaging information such as structure, function and molecule, and realizes a leaping development from “morphological description” to “systems biology characterization”. In recent years, with the establishment of large-scale multi-modal neuroimaging databases and the rapid development of artificial intelligence algorithms, deep learning-based multi-modal fusion strategies have shown significant advantages in the early diagnosis, progression prediction and individualized treatment decision-making of Alzheimer’s disease, Parkinson’s disease, stroke, autism spectrum disorder and other diseases. The fusion paradigm has evolved from the traditional early, middle and late fusion to advanced interaction modes based on orthogonal compensation, cross-layer guidance and attention mechanism; the introduction of explainable artificial intelligence technology provides an effective path to break the “black box” dilemma and improve clinical trust. This paper systematically reviews the latest progress of multi-modal image fusion technology, expounds the application status of multi-modal combinations such as MRI-PET, structural-functional imaging and imaging-electrophysiology in the evaluation of neurological diseases, analyzes the core challenges such as data heterogeneity, modality missing and model generalization ability, and prospects the development prospect of multi-modal artificial intelligence in precision medicine of neurological diseases.
文章引用:刘子毅. 多模态影像融合在神经系统疾病诊断中的 应用进展——基于人工智能的技术范式与临床转化[J]. 临床医学进展, 2026, 16(4): 1233-1242. https://doi.org/10.12677/acm.2026.1641356

参考文献

[1] Zhu, Z., Zhang, X., Xu, C. and Shen, Y. (2026) Construction and Interpretability of a Multimodal Deep Learning Model of Electronystagmography-Optical Coherence Tomography Angiography for Early Screening of Alzheimer’s Disease. American Journal of Alzheimers Disease & Other Dementias, 41. [Google Scholar] [CrossRef
[2] Zhang, R., Sheng, J., Zhang, Q., Wang, J. and Wang, B. (2025) A Review of Multimodal Fusion-Based Deep Learning for Alzheimer’s Disease. Neuroscience, 576, 80-95. [Google Scholar] [CrossRef] [PubMed]
[3] Guan, Y., Wang, W., Chen, J., Yang, P., Xu, J. and Qi, J. (2026) A Survey of Multimodal Fusion for Alzheimer’s Disease Prediction: A New Taxonomy and Trends. Information Fusion, 131, 104098. [Google Scholar] [CrossRef
[4] Hemmerling, D., Dudek, M., Krzywdziak, J., Żbik, M., Szecowka, W., Daniol, M., et al. (2026) Gru-Based Deep Multimodal Fusion of Speech and Head-IMU Signals in Mixed Reality for Parkinson’s Disease Detection. Sensors, 26, Article 269. [Google Scholar] [CrossRef
[5] Huang, W. and Shu, N. (2025) AI-Powered Integration of Multimodal Imaging in Precision Medicine for Neuropsychiatric Disorders. Cell Reports Medicine, 6, Article ID: 102132. [Google Scholar] [CrossRef] [PubMed]
[6] El-Askary, N.S., Gawish, M., Morsey, M.M., Mahmoud, A.M., Aref, M. and El-Arif, T.I. (2025) Towards Explainable Multi-Modal Fusion Strategies for ASD Detection: A Review. 2025 Twelfth International Conference on Intelligent Computing and Information Systems (ICICIS), Cairo, 25-27 November 2025, 560-567. [Google Scholar] [CrossRef
[7] Viswan, V., Shaffi, N., Malathy, E., Chemmalar Selvi, G., Kavitha, B.R., Abdesselam, A., et al. (2026) Multimodal Fusion and Explainability of Artificial Intelligence Models in Alzheimer’s Disease Detection. Brain Informatics, 13, Article No. 5. [Google Scholar] [CrossRef
[8] Tuxunjiang, P., Huang, C., Zhou, Z., Zhao, W., Han, B., Tan, W., et al. (2025) Prediction of NIHSS Scores and Acute Ischemic Stroke Severity Using a Cross-Attention Vision Transformer Model with Multimodal MRI. Academic Radiology, 32, 5453-5467. [Google Scholar] [CrossRef] [PubMed]
[9] Amador, K., Winder, A.J., Fiehler, J., Barber, P.A., Wilms, M. and Forkert, N.D. (2025) A Multimodal Multitask Deep Learning Model for Predicting Stroke Lesion and Functional Outcomes Using 4D CTP Imaging and Clinical Metadata. Scientific Reports, 15, Article No. 38136. [Google Scholar] [CrossRef
[10] Xiao, L., Zheng, Q., Li, S., Wei, Y., Si, W. and Pan, Y. (2025) Integration of Spatiotemporal Dynamics and Structural Connectivity for Automated Epileptogenic Zone Localization in Temporal Lobe Epilepsy. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 33, 3065-3075. [Google Scholar] [CrossRef] [PubMed]
[11] Zhang, L., Sheng, S., Wang, X., Gao, J., Sun, Y., Xiao, K., et al. (2025) CrossConvPyramid: Deep Multimodal Fusion for Epileptic Magnetoencephalography Spike Detection. IEEE Journal of Biomedical and Health Informatics, 29, 3194-3205. [Google Scholar] [CrossRef] [PubMed]
[12] Xie, K., Sahlas, E., Ngo, A., Chen, J., Arafat, T., Royer, J., et al. (2025) Personalized Biomarkers of Multiscale Functional Alterations in Temporal Lobe Epilepsy. Nature Communications, 16, 10145. [Google Scholar] [CrossRef
[13] Rokham, H., Falakshahi, H., Pearlson, G.D. and Calhoun, V.D. (2025) Neuroimaging Data Informed Mood and Psychosis Diagnosis Using an Ensemble Deep Multimodal Framework. Human Brain Mapping, 46, e70347. [Google Scholar] [CrossRef
[14] Yoon, V., Kim, S., Park, J., Lee, H., Choi, M., Kang, D., et al. (2025) Multimodal Meta-Analytically Anchored Clustering Reveals Five Distinct Schizophrenia Spectrum Disorder Subgroups with Divergent Brain-Symptom-Genetics Profiles. medRxiv.
[15] Zhao, J., Wang, Y., Liu, M., Guo, B., Wang, Y., Gao, B., et al. (2025) Deciphering the Molecular Tapestry of Schizophrenia: Integrating Transcriptomics, Neuroimaging, and Clinical Data for Precision Medicine. Translational Psychiatry, 15, Article No. 489. [Google Scholar] [CrossRef