基于机器学习的前列腺癌患者分类研究
Research on Prostate Patients Classification Based on Machine Learning
摘要: 基于机器学习算法,本文对前列腺癌患者构建分类模型,目的是区分出不同患者,从而采取不同的治疗措施。基于随机森林、Adaboost、GradienBoosting、XGBoost、LightBGM和Stacking模型融合算法构建前列腺癌分类预测模型,调整模型参数,并验证模型效能。在单一的机器学习算法中,每一种机器学习算法都能对第一类(前列腺增生)患者进行识别,一部分模型在第二类(前列腺癌)患者和第三类(同时有前列腺癌和前列腺增生)患者中预测错误率较高;Adaboost算法的性能最优,对每一类都能够进行有效识别;Stacking融合算法优于所有单一的机器学习算法,在测试集上的准确率达到了96%。在前列腺癌分类预测模型中,Stacking融合算法效果明显优于单一的机器学习算法。
Abstract:
This paper aims to distinguish different kinds of prostate cancer patients. We constructed a classification model for patients based on machine learning algorithms, so that we can take different treatment measures. Based on Random Forest, Adaboost, GradienBoosting, XGBoost, LightGBM and Stacking fusion algorithm, we constructed a prostate cancer classification model. The parameters were adjusted, and the effectiveness of models was evaluated. In all single machine learning algorithms, they can identify patients in the first category (prostatic hyperplasia). Some models have a high error rate of classification in the second category (prostate cancer) and the third category (both prostatic hyperplasia and prostate cancer); Adaboost algorithm has the best performance and it can identify each category effectively. The accuracy of Stacking fusion algorithm is up to 96%, better than all single models. Stacking fusion algorithm is better than each single machine learning algorithm significantly in the prostate cancer classification.
参考文献
|
[1]
|
Berríos-Torres, S.I., Umscheid, C.A., Bratzler, D.W., et al. (2017) Centers for Disease Control and Prevention Guideline for the Prevention of Surgical Site Infection, 2017. JAMA Surgery, 152, 784-791.
[Google Scholar] [CrossRef] [PubMed]
|
|
[2]
|
方孙福, 周晓燕, 罗朝军, 等. 动态增强MRI结合DWI对前列腺癌的诊断价值分析[J]. 医学影像学杂志, 2017, 27(1): 186-188.
|
|
[3]
|
王逸飞, 吴欢, 薛万国, 等. 前列腺癌与前列腺增生的分类预测及癌症风险因素分析[J]. 解放军医学院学报, 2021, 42(3): 277-281+305.
|
|
[4]
|
Lundberg, S.M., Erion, G.G. and Lee, S.I. (2018) Consistent Individualized Feature Attribution for Tree Ensembles.
https://arxiv.org/abs/1802.03888
|
|
[5]
|
中华医学会泌尿外科分会. 前列腺癌诊断治疗指南[J]. 继续医学教育, 2007, 21(6): 30-39.
|
|
[6]
|
张洪侠, 郭贺, 王金霞, 等. 基于XGBoost算法的2型糖尿病精准预测模型研究[J]. 中国实验诊断学, 2018, 22(3): 408-412.
|
|
[7]
|
Jiang, J.X., Xiao, Z.B., Tang, Z.H., Zhong, Y.F. and Qiang, J.W. (2018) Differentiating between Benign and Malignant Sinonasal Lesions Using Dynamic Contrast-Enhanced MRI and Intravoxel Incoherent Motion. European Journal of Radiology, 98, 7-13. [Google Scholar] [CrossRef] [PubMed]
|
|
[8]
|
王华, 段青, 威晋, 等. 3.0T MRI动态增强在鉴别前列腺增生及前列腺癌中的价值[J]. 福建医科大学学报, 2010,44(3): 223-226.
|
|
[9]
|
法慧. 基于结构方程模型的前列腺癌与前列腺增生鉴别诊断的影响因素研究[D]: [硕士学位论文]. 长春: 吉林大学, 2020.
|