基于多模态特征融合的高血压危险分层预测与评估
Prediction and Evaluation of Hypertension Risk Based on Multimodal Feature Fusion
DOI: 10.12677/mos.2025.142171, PDF,    国家自然科学基金支持
作者: 翁雯浩, 杨晶东*, 罗晓烽, 李 琳:上海理工大学光电信息与计算机工程学院,上海;王斯曼, 燕海霞:上海中医药大学,中医学院,上海市健康辨识与评估重点实验室,上海
关键词: 高血压危险分层多模态特征融合长短期记忆网络DS证据理论树形结构的贝叶斯优化Hypertension Risk Stratification Multimodal Feature Fusion Long Short-Term Memory Network DS Evidence Theory Tree-Structured Bayesian Optimization
摘要: 机器学习应用于高血压危险分层预测,经常由于特征提取困难且异常值多、模型调优成本高,而导致模型预测精度低、泛化性能差。本文提出了一种多模态融合模型。该模型构建了多长短期记忆网络串联结构,实现无超参数的脉搏波特征提取,降低特征提取成本,提高特征区分度。使用三种不同的机器学习方法分别对脉搏波、理化、证素特征进行分类,采用基于树结构估计的贝叶斯优化算法,动态优化各机器学习模型超参数,降低超参数优化成本。采用DS证据理论冲突消解策略,减少各模型预测冲突,实现多模态特征决策级融合。本文采用上海中医药大学附属中西医结合医院体检中心等医院提供的临床数据。5-Fold交叉验证后分类模型评估指标F1-score、Accuracy、Recall、Specificity、AUC值分别为:89.1%、90.9%、89.3%、94.9%、97.7%。与典型模型相比,本文方法具有较高的分类精度和泛化性能。此外,本文分别基于最近邻、支持向量机、极限梯度提升树算法,研究了脉搏波脉图特征与高血压危险分层的相关性,深入挖掘潜在的风险因素,为高血压临床诊断提供有效参考。
Abstract: The application of machine learning in hypertension risk prediction is often hindered by difficulties in feature extraction, the presence of outliers, and high tuning costs, which result in low prediction accuracy and poor generalization performance. This paper proposes a multimodal fusion model. The model constructs a multi-long short-term memory (LSTM) network concatenation structure to achieve hyperparameter-free pulse wave feature extraction, reducing feature extraction costs and improving feature discrimination. Three different machine learning methods are used to classify pulse wave, physicochemical, and syndromic features, while a tree-structured Bayesian optimization algorithm is employed to dynamically optimize the hyperparameters of each machine learning model, thus reducing the cost of hyperparameter optimization. A DS evidence theory conflict resolution strategy is used to reduce prediction conflicts between models, enabling decision-level fusion of multimodal features. The clinical data used in this study were provided by the Health Check-up Center of the Affiliated Hospital of Shanghai University of Traditional Chinese Medicine, among others. After 5-fold cross-validation, the evaluation metrics are as follows: F1-score 89.1%, Accuracy 90.9%, Recall 89.3%, Specificity 94.9%, and AUC 97.7%. Compared with typical models, the proposed method demonstrates higher classification accuracy and generalization. In addition, based on k-nearest neighbors, support vector machines, and extreme gradient boosting tree algorithms, the paper explores the correlation between pulse wave morphology features, and hypertension risk, uncovering potential risk factors, and providing effective references for clinical hypertension diagnosis.
文章引用:翁雯浩, 杨晶东, 罗晓烽, 李琳, 王斯曼, 燕海霞. 基于多模态特征融合的高血压危险分层预测与评估[J]. 建模与仿真, 2025, 14(2): 506-520. https://doi.org/10.12677/mos.2025.142171

参考文献

[1] Kario, K., Okura, A., Hoshide, S. and Mogi, M. (2024) The WHO Global Report 2023 on Hypertension Warning the Emerging Hypertension Burden in Globe and Its Treatment Strategy. Hypertension Research, 47, 1099-1102. [Google Scholar] [CrossRef] [PubMed]
[2] Zhang, W. (2019) Cardiovascular Risk Stratification in Chinese Patients with Hypertension. Clinical Hypertension, 21, 1607-1608.
[3] 徐月贞. 乌鲁木齐市南山牧区哈萨克族高血压发病风险预测模型构建及风险管理研究[D]: [博士学位论文]. 乌鲁木齐: 新疆医科大学, 2020.
[4] 由淑萍, 徐月贞, 陶宁, 等. Framingham及其改良模型在新疆牧区哈萨克族牧民中的应用研究[J]. 医学信息, 2021, 34(11): 1-4, 8.
[5] 樊星, 顾军, 郭建锋, 等. 颈动脉内中膜厚度结合Framingham风险评分用于高血压患者危险分级的评估[J]. 江苏大学学报(医学版), 2016, 26(5): 418-421.
[6] 吴雪怡, 宋雷. 人工智能技术在高血压领域的应用[J]. 中国分子心脏病学杂志, 2020, 20(6): 3598-3601.
[7] 胡静. 基于机器学习方法和基因型数据的高血压风险预测[D]: [硕士学位论文]. 西安: 西安电子科技大学, 2020.
[8] 陈娇. 我国老年人高血压现状及其影响因素分析——基于CHARLS2015数据[D]: [硕士学位论文]. 南京: 南京邮电大学, 2020.
[9] 崔伟锋, 林萍, 刘萧萧, 等. 基于机器学习的原发性高血压心血管风险预后模型[J]. 中国老年学杂志, 2022, 42(15): 3625-3629.
[10] 方艳, 王群华, 陈红波. 妊娠期高血压疾病的早期预测模型构建[J]. 现代妇产科进展, 2024, 33(7): 506-511.
[11] 关雨婷, 祝丙华, 马建新, 等. 不同机器学习算法对老年原发性高血压发病风险的预测价值[J]. 中国医药导报, 2024, 21(18): 49-52.
[12] 逯鹏, 王汉章, 毛晓波, 等. 基于卷积自编码器网络的脉搏波分类模型[J]. 郑州大学学报(工学版), 2021, 42(5): 56-61.
[13] Lim, J., Li, J., Feng, X., Feng, L., Xiao, X., Xia, Y., et al. (2023) Machine Learning-Based Evaluation of Application Value of Traditional Chinese Medicine Clinical Index and Pulse Wave Parameters in the Diagnosis of Polycystic Ovary Syndrome. European Journal of Integrative Medicine, 64, Article 102311. [Google Scholar] [CrossRef
[14] Bialonczyk, U., Debowska, M., Dai, L., Qureshi, A.R., Söderberg, M., Lindholm, B., et al. (2024) Detection of Medial Vascular Calcification in Chronic Kidney Disease Based on Pulse Wave Analysis in the Frequency Domain. Biomedical Signal Processing and Control, 94, Article 106250. [Google Scholar] [CrossRef
[15] 谈冰冰, 高国伟. 基于机器学习的脉搏信号辅助中医预测血压[J]. 智能计算机与应用, 2023, 13(12): 149-153, 157.
[16] 韩红娟, 陈杜荣, 秦瑶, 等. 基于Stacking集成策略的阿尔茨海默病诊断模型研究[J]. 现代预防医学, 2022, 49(22): 4045-4051, 4089.
[17] 莫有桦, 徐婷, 孟诗迪, 等. 基于特征选择结合Boosting算法模型在预测矿工非致命性职业伤害严重等级中的适用性[J]. 环境与职业医学, 2023, 40(10): 1115-1120.
[18] Fei, L., Li, T. and Ding, W. (2024) Dempster-Shafer Theory-Based Information Fusion for Natural Disaster Emergency Management: A Systematic Literature Review. Information Fusion, 112, Article 102585. [Google Scholar] [CrossRef
[19] Maharjan, R., Kim, K.H., Lee, K., Han, H. and Jeong, S.H. (2024) Machine Learning-Driven Optimization of mRNA-Lipid Nanoparticle Vaccine Quality with XGBoost/Bayesian Method and Ensemble Model Approaches. Journal of Pharmaceutical Analysis, 14, Article 100996. [Google Scholar] [CrossRef] [PubMed]
[20] Nikolaidis, P. and Chatzis, S. (2021) Gaussian Process-Based Bayesian Optimization for Data-Driven Unit Commitment. International Journal of Electrical Power & Energy Systems, 130, Article 106930. [Google Scholar] [CrossRef
[21] Dong, H., He, D. and Wang, F. (2020) SMOTE-XGBoost Using Tree Parzen Estimator Optimization for Copper Flotation Method Classification. Powder Technology, 375, 174-181. [Google Scholar] [CrossRef
[22] 胡文锐. 基于深度学习的心电脉搏特征识别与应用[D]: [硕士学位论文]. 银川: 北方民族大学, 2021.
[23] He, Y., Chen, H., Liu, D. and Zhang, L. (2021) A Framework of Structural Damage Detection for Civil Structures Using Fast Fourier Transform and Deep Convolutional Neural Networks. Applied Sciences, 11, Article 9345. [Google Scholar] [CrossRef
[24] Kilik, R. (2021) Histogram-Based Weighted Median Filtering Used for Noise Reduction of Digital Elevation Model Data. Acta Geodaetica et Geophysica, 56, 743-764. [Google Scholar] [CrossRef
[25] Wang, G. and Wang, K. (2013) Study and Design of Exponential and Butterworth Low-Pass Filters Used for Digital Speckle Interference Fringe Filtering. Optik, 124, 6713-6717. [Google Scholar] [CrossRef
[26] Puri, D.V., Gawande, J.P., Rajput, J.L. and Nalbalwar, S.L. (2023) A Novel Optimal Wavelet Filter Banks for Automated Diagnosis of Alzheimer’s Disease and Mild Cognitive Impairment Using Electroencephalogram Signals. Decision Analytics Journal, 9, Article 100336. [Google Scholar] [CrossRef
[27] Yousefi, M., Oskoei, V., Esmaeli, H.R. and Baziar, M. (2024) An Innovative Combination of Extra Trees within Adaboost for Accurate Prediction of Agricultural Water Quality Indices. Results in Engineering, 24, Article 103534. [Google Scholar] [CrossRef
[28] Zeng, T., Chen, Y., Zhu, D., Huang, Y., Huang, Y., Chen, Y., et al. (2024) AI Diagnostics in Bone Oncology for Predicting Bone Metastasis in Lung Cancer Patients Using DenseNet-264 Deep Learning Model and Radiomics. Journal of Bone Oncology, 48, Article 100640. [Google Scholar] [CrossRef] [PubMed]
[29] Basith Ali Khan, M.A. and Reddy, E.S. (2024) Post-COVID Effect on Heart after Recovery Based on Hybrid EfficientNet-DBN with Multilevel Classification Using ECG Images. EngMedicine, 1, Article 100021. [Google Scholar] [CrossRef
[30] 丁晓东. 基于中医问诊和脉诊信息的冠心病风险事件预警模型研究[D]: [硕士学位论文]. 上海: 上海中医药大学, 2021.
[31] 袁浩博, 吴云. 静息心率与高血压的研究进展[J]. 内蒙古医学杂志, 2021, 53(9): 1078-1080.
[32] 施仲伟. 心率与高血压[J]. 中华心脏与心律电子杂志, 2013, 1(1): 7-9.