基于改进随机森林的肝硬化诊断预测研究
Diagnostic Prediction of Liver Cirrhosis Based on Improved Random Forest
DOI: 10.12677/CSA.2019.910216, PDF,  被引量    科研立项经费支持
作者: 刘佳星, 张宏烈, 刘艳菊*, 张惠玉, 刘彦忠:齐齐哈尔大学计算机与控制工程学院,黑龙江 齐齐哈尔
关键词: 肝硬化诊断改进随机森林算法深度限制数据预测Diagnosis of Liver Cirrhosis Improved Random Forest Algorithm Depth LimitationData Prediction
摘要: 当前机器学习在医疗诊断领域得到广泛应用。本文基于改进的随机森林算法,利用患者进行各项检查获得的大量数据,对照肝硬化指标,对患者数据进行分析处理,提出一种基于患者检查数据的肝硬化预测方法。该方法改进传统诊断技术,采用随机森林算法,利用其使用随机因子来控制数据多样性的特点,引入深度限制指标,提高算法对数据的判断和识别能力,增强预测的准确性。本文采用人体测量学组成的数据集进行实验,结果表明该方法预测准确率达到90%以上。
Abstract: Machine learning is widely applied in the field of medical diagnosis currently. Based on the improved random forest algorithm, a prediction method for liver cirrhosis diagnosis is proposed, in which the patients’ data with liver cirrhosis indicators is analyzed and processed by means of the large amount of data obtained by patients for each examination and liver cirrhosis indicators. The method of the paper has improved the traditional diagnosis technology, adopted the random forest algorithm, used its random factor to control the characteristics of data diversity, and introduced the depth limit index. And it has improved the judgment and recognition ability of the data, and enhanced the prediction accuracy. In this paper, the data set composed of anthropometrics is used for experiments. The results show that the prediction accuracy of this method is over 90%.
文章引用:刘佳星, 张宏烈, 刘艳菊, 张惠玉, 刘彦忠. 基于改进随机森林的肝硬化诊断预测研究[J]. 计算机科学与应用, 2019, 9(10): 1928-1938. https://doi.org/10.12677/CSA.2019.910216

参考文献

[1] 左颖婷. 遗传算法BP神经网络在肝硬化分期诊断中的应用[D]: [硕士学位论文]. 太原: 山西医科大学, 2017.
[2] 孙振球. 医学统计学[M]. 北京: 人民卫生出版社, 2007: 333-341.
[3] 张宁, 周双男, 宫嫚, 等. Fi-broScan评价复方鳖甲软肝片抗纤维化的疗效[J]. 临床肝胆病杂志, 2013, 29(10): 760-763.
[4] 窦智丽. 肝炎肝硬化患者症状、证候要素与瞬时弹性成像检测值的相关性研究[D]: [硕士学位论文]. 北京: 北京中医药大学, 2019.
[5] 范宏. 贝叶斯在医疗诊断系统中的应用研究[D]: [硕士学位论文]. 成都: 电子科技大学, 2013.
[6] 霍东雪, 刘辉, 尚振宏, 等. 一种异构集成学习的儿科疾病诊断方法研究[J]. 计算机应用与软件, 2018, 35(6): 54-57+157.
[7] Singh, B.K., Verma, K. and Thoke, A.S. (2015) A Dual Feature Selection Approach for Classification of Breast Tumors in Ultrasound Images Using ANN and SVM. Artificial Intelligent Systems & Machine Learning, 7, 78-84.
[8] Singh, B.K., Verma, K. and Thoke, A.S. (2016) Fuzzy Cluster Based Neural Classifier for Classifying Breast Tumors in Ultrasound Images. Expert Systems with Applications, 66, 114-123. [Google Scholar] [CrossRef
[9] Bikesh, K.S. (2019) Determining Relevant Biomarkers for Predic-tion of Breast Cancer Using Anthropometric and Clinical Features: A Comparative Investigation in Machine Learning Paradigm. Biocybernetics and Biomedical Engineering, 39, 393-409. [Google Scholar] [CrossRef
[10] Angshuman, P. and Dipti, P.M. (2019) Reinforced Quasi-Random Forest. Pattern Recognition, 94, 13-24. [Google Scholar] [CrossRef
[11] Feng, W., Dauphin, G., Huang, W.J., Quan, Y. and Liao, W. (2019) New Margin-Based Subsampling Iterative Technique in Modified Random Forests for Classification. Knowledge-Based Systems, 182, Article ID: 104845. [Google Scholar] [CrossRef
[12] 王宇燕. 基于决策树集成学习的癌症存活性预测分析[D]: [硕士学位论文]. 大连: 大连理工大学, 2018.
[13] Bedi, J. and Toshniwal, D. (2019) PP-NFR: An Improved Hybrid Learning Approach for Porosity Prediction from Seismic Attributes Using Non-Linear Feature Reduction. Journal of Applied Geophysics, 166, 22-32. [Google Scholar] [CrossRef
[14] Breiman, L. (2011) Random Forests. Machine Learning, 45, 5-32. [Google Scholar] [CrossRef
[15] Cutler, A., Cutler, D.R. and Stevens, J.R. (2012) Random-forests. In: Zhang, C. and Ma, Y., Eds., Ensemble Machine Learning, Springer, Boston, MA, 157-175. [Google Scholar] [CrossRef
[16] Nadi, A. and Moradi, H. (2019) Increasing the Views and Re-ducing the Depth in Random Forest. Expert Systems with Applications, 138, 112801. [Google Scholar] [CrossRef
[17] 普事业, 刘三阳, 白艺光. 网络拓扑特征的不平衡数据分类[J/OL]. 智能系统学报, 2019(5): 1-9.
http://kns.cnki.net/kcms/detail/23.1538.TP.20190527.0921.002.html, 2019-08-11.