基于卷积神经网络的糖尿病预测
Prediction of Diabetes Based on Convolutional Neural Network
DOI: 10.12677/CSA.2020.105094, PDF,  被引量    科研立项经费支持
作者: 王 莹, 张 衡*, 左 健:西南大学计算机与信息科学学院,重庆;徐匡一:西南大学计算机与信息科学学院,重庆;北京新能源汽车技术创新中心有限公司,北京
关键词: 糖尿病辅助诊断与治疗数据分析机器学习Diabetes Assisted Diagnosis and Treatment Data Analysis Machine Learning
摘要: 生物技术和医学的显著进步导致了生物医学数据的大量产生。糖尿病(Diabetes mellitus, DM)作为一种常见的慢性病,在诊断和治疗过程中也产生了大量的医学数据。因此,医学数据的探索成为了一个热点。本研究旨在探讨出院病人的短期入院率。根据30天内再次入院的概率,我们可以判断这种治疗的效果。从而协助医生为患者提供更有效的治疗,以便提高患者的生活质量。在本研究中,对数据进行了处理,利用改进的卷积神经网络算法(Convolutional neural network algorithm, CNN-EI)对糖尿病病例数据集进行数据挖掘。实验结果表明,改进后的算法能够很好地处理高维、大样本的医学数据。将该方法的结果与其他先进方法的结果进行比较,其准确率为83.7%。
Abstract: The remarkable progress of biotechnology and medical science has led to the significant production of biomedical data. For diabetes mellitus (DM), a common chronic disease, a large number of medi-cal data has also been generated in the process of diagnosis and treatment. So the exploration of medical data has become a hotspot. The purpose of this study was to explore the short-term admission probability of discharged patients. According to the probability of re-admission within 30 days, we can judge the effect of this treatment, thus assisting doctors to provide more efficient treatment for patients, so as to improve the quality of life of patients. In this study, the data has been processed, and the improved convolutional neural network algorithm (CNN-EI) was used to perform data mining on the dataset of diabetic case data. The experimental results show that the improved algorithm can well perform high-dimensional and large-sample medical data. Accuracy is 83.7%. The result is compared to the result of other state-of-art methods.
文章引用:王莹, 张衡, 左健, 徐匡一. 基于卷积神经网络的糖尿病预测[J]. 计算机科学与应用, 2020, 10(5): 914-926. https://doi.org/10.12677/CSA.2020.105094

参考文献

[1] World Health Organization (2017) World Health Statistics 2017: Monitoring Health for the SDGs.
[2] World Health Organization (2016) Global Report on Diabetes: Executive Summary.
[3] Russell, S. and Norvig, P. (2003) Artificial Intelligence: A Modern Approach. 2nd Edition, Prentice Hall, Upper Saddle River.
[4] Russell, S. and Norvig, P. (2009) Artificial Intelligence: A Modern Approach. 3rd Edition, Prentice Hall, Upper Saddle River.
[5] Sattar, N., Wannamethee, G., Sarwar, N., Tchernova, J., Cherry, L., Wallace, A.M., Danesh, J. and Whincup, P.H. (2006) Adi-ponectin and Coronary Heart Disease: A Prospective Study and Meta-Analysis. Circulation, 114, 623-629. [Google Scholar] [CrossRef
[6] Huang, G.-M., Huang, K.-Y., Lee, T.-Y. and Weng, J. (2015) An Interpretable Rule-Based Diagnostic Classification of Diabetic Nephropathy among Type 2 Diabetes Patients. BMC Bio-Information, 16, S5. [Google Scholar] [CrossRef
[7] Li, J.X., et al. (2019) Body Surface Feature-Based Multi-Modal Learning for Diabetes Mellitus Detection. Information Sciences, 472, 1-14. [Google Scholar] [CrossRef
[8] Kavakiotis, I., Tsave, O., Salifoglou, A., et al. (2017) Machine Learning and Data Mining Methods in Diabetes Research. Computational and Structural Biotechnology Journal, 15, 104-116. [Google Scholar] [CrossRef] [PubMed]
[9] Vijayanv, V. and Ravikumar, A. (2014) Study of Data Mining Algorithms for Prediction and Diagnosis of Diabetes Mellitus. International Journal of Computer Applications, 95, 12-16. [Google Scholar] [CrossRef
[10] Sneha, N. and Gangil, T. (2019) Analysis of Diabetes Mellitus for Early Prediction Using Optimal Features Selection. Journal of Big Data, 6, 13. [Google Scholar] [CrossRef
[11] Ye, L.L., Lee, T.-S. and Chi, R. (2018) A Hybrid Machine Learning Scheme to Analyze the Risk Factors of Breast Cancer Outcome in Patients with Diabetes Mellitus. Journal of Universal Computer Science, 24, 665-681.
[12] Goodfellow, I., Bengio, Y. and Courville, A. (2016) Deep Learning (Vol. 1). MIT Press, Cambridge.
[13] LeCun, Y., Bengio, Y. and Hinton, G. (2015) Deep Learning. Nature, 521, 436. [Google Scholar] [CrossRef] [PubMed]
[14] 周志华. 机器学习[M]. 北京: 清华大学出版社, 2015: 114-115.
[15] Alhassan, Z., McGough, A.S., Alshammari, R., Daghstani, T., Budgen, D. and Al Moubayed, N. (2018) Type 2 Diabetes Mellitus Diagnosis from Time Series Clinical Data Using Deep Learning Models. 27th International Conference on Artificial Neural Networks, Rhodes, Greece, 4-7 October 2018, Vol. 3, 468-478. [Google Scholar] [CrossRef
[16] Kogias, K., Andreadis, I., Dalakleidi, K. and Nikita, K.S. (2018) A Two-Level Food Classification System for People with Diabetes Mellitus Using Convolutional Neural Net-works. 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Honolulu, 2603-2606. [Google Scholar] [CrossRef
[17] Umpierrez, G.E., Isaacs, S.D., Bazargan, N., You, X., Thaler, L.M. and Kitabchi, A.E. (2002) Hyperglycemia: An Independent Marker of In-Hospital Mortality in Patients with Undiagnosed Diabetes. Journal of Clinical Endocrinology and Metabolism, 87, 978-982. [Google Scholar] [CrossRef] [PubMed]
[18] Levetan, C.S., Passaro, M., Jablonski, K., Kass, M. and Ratner, R.E. (1998) Unrecognized Diabetes among Hospitalized Patients. Diabetes Care, 21, 246-249. [Google Scholar] [CrossRef] [PubMed]
[19] Frank, A. and Asuncion, A. (2010) UCI Machine Learning Reposi-tory. University of California, School of Information and Computer Science, San Diego.
[20] Maratea, A., Petrosino, A. and Manzo, M. (2014) Adjusted F-Measure and Kernel Scaling for Imbalanced Data Learning. Information Sciences, 257, 331-341. [Google Scholar] [CrossRef
[21] Barua, S., Islam, M.M., Yao, X., et al. (2014) MWMOTE—Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning. IEEE Transac-tions on Knowledge and Data Engineering, 26, 405-425. [Google Scholar] [CrossRef
[22] Chen, T. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boost-ing System. [Google Scholar] [CrossRef
[23] Lin, K., Lin, Y. and Kong, G. (2018) A XGBoost Algorithm-Based In-Hospital Mortality Prediction Model for Patients with Sepsis in ICU. Chinese Journal of Health Informatics and Management, 15, 536-540+563.
[24] Lecun, Y., Bottou, L., Bengio, Y., et al. (1998) Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE, 86, 2278-2324. [Google Scholar] [CrossRef
[25] Zhou, Z.H., Wu, J. and Tang, W. (2002) Ensembling Neural Networks: Many Could Be Better than All. Artificial Intelligence, 137, 239-263. [Google Scholar] [CrossRef
[26] Ujjwal Maulik, S.B. (2002) Performance Evaluation of Some Clustering Algorithms and Validity Indices. IEEE Transactions on Pattern Analysis & Machine In-telligence, 24, 1650-1654. [Google Scholar] [CrossRef
[27] Strack, B., Deshazo, J.P., Gennings, C., et al. (2014) Impact of HbA1c Measurement on Hospital Readmission Rates: Analysis of 70,000 Clinical Database Patient Records. BioMed Research International, 2014, Article ID: 781670. [Google Scholar] [CrossRef] [PubMed]
[28] Cotha, N.K.P. and Sokolova, M. (2015) Multi-Labeled Classification of Demographic Attributes of Patients: A Case Study of Diabetics Patients.