基于PCA和BP神经网络的糖尿病预测
Diabetes Prediction Based on PCA and BP Neural Network
摘要: 糖尿病的发病率正在逐年上升且向低龄化发展,对我国乃至世界的健康安全造成了严重的影响,因此有必要对糖尿病的预测进行研究。本文对皮马印第安人糖尿病数据集进行分类,首先使用主成分分析法将数据从8维降到了3维,接着使用这3维数据建立BP神经网络模型。将基于PCA和BP神经网络的模型与单纯的BP神经网络模型进行对比。结果表明,基于PCA和BP神经网络的模型在精准率、召回率、F值、查准率和Matthews相关系数MCC 5项性能指标上均明显优于BP神经网络,可以作为糖尿病预测的一种有效方法。
Abstract:
The incidence of diabetes mellitus is increasing year by year and towards lower age, which has a serious impact on the health and safety of our country and the world, so there is a need to study the prediction of diabetes mellitus. In this paper, the Pima Indians diabetes dataset was categorized by first reducing the data from 8 dimensions to 3 dimensions using Principal Component Analysis (PCA), followed by BP neural network modeling using these 3 dimensions of data. The PCA and BP neural network based model was compared with the BP neural network model alone. The results show that the model based on PCA and BP neural network is significantly better than BP neural network in five performance indicators: precision, recall, F-value, checking accuracy and Matthews correlation coefficient MCC, and it can be used as an effective method for diabetes prediction.
参考文献
|
[1]
|
郑琳, 倪世伟. 基于支持向量机的妊娠期糖尿病预测模型的构建[J]. 安徽预防医学杂志, 2019, 25(6): 465-468.
|
|
[2]
|
李飞, 王贻坤, 朱灵, 等. 基于神经网络模式识别的糖尿病无创风险评估方法研究[J]. 光谱学与光谱分析, 2014, 34(5): 1327-1331.
|
|
[3]
|
Permana, B.A.C., Ahmad, R., Bahtiar, H., et al. (2021) Classification of Diabetes Disease Using Decision Tree Algorithm (C4.5). Journal of Physics: Conference Series, 1869, Article 012082. [Google Scholar] [CrossRef]
|
|
[4]
|
Lu, H., Uddin, S., Hajati, F., et al. (2021) A Patient Network-Based Machine Learning Model for Disease Prediction: The Case of Type 2 Diabetes Mellitus. Applied Intelligence, 52, 2411-2422. [Google Scholar] [CrossRef]
|
|
[5]
|
张志恒, 李超. 基于PCA-BP神经网络的审计风险识别研究[J]. 重庆理工大学学报(自然科学), 2021, 35(5): 253-261.
|
|
[6]
|
王鑫, 廖彬, 李敏, 等. 融合LightGBM与SHAP的糖尿病预测及其特征分析方法[J]. 小型微型计算机系统, 2022, 43(9): 9.
|
|
[7]
|
刘文博, 梁盛楠, 秦喜文, 等. 基于迭代随机森林算法的糖尿病预测[J]. 长春工业大学学报, 2019, 40(6): 604-611.
|