基于混合深度学习算法的疾病预测模型
Disease Prediction Models Based on Hybrid Deep Learning Strategy
DOI: 10.12677/AIRR.2020.91003, PDF,  被引量    国家自然科学基金支持
作者: 梁 敏, 莫毓昌*, 陆 迁, 李宁宁:华侨大学数学科学学院,计算科学福建省高校重点实验室,福建 泉州;林 栋:福建中医药大学针灸学院,福建 福州
关键词: 电子健康档案长短时记忆网络卷积神经网络混合深度学习Electronic Health Record Long Short Term Memory Neural Network Convolutional Neural Network Hybrid Deep Learning
摘要: 利用电子健康档案中时间序列数据建立的预测模型在改善疾病管理方面发挥着重要作用。由于时态数据的序列相关性和特征空间维度大等特点,机器学习和非深度神经网络等传统方法难以提供疾病的准确预测。最新工作表明,长短时记忆(long short term memory, LSTM)神经网络性能优于大多数传统的疾病预测方法。为了进一步提高预测精度,本文提出了一种将卷积神经网络(convolutional neural network, CNN)与LSTM相结合的混合深度学习神经网络框架。使用电子健康档案中真实数据集的研究结果表明,相比传统SVM,CNN和LSTM模型,该算法的预测性能得到显著提高。
Abstract: Predictive models built using temporal data in electronic health records (EHRs) can potentially play a major role in improving management of diseases. Due to the sequence correlation and large feature space dimensions, traditional methods such as machine learning and non-deep neural networks are difficult to provide accurate predictions of disease. Recent works show that the long short term memory (LSTM) neural network outperforms most of those traditional methods for disease prediction problems. In this study, a hybrid deep learning neural network framework that combines convolutional neural network (CNN) with LSTM is proposed to further improve the pre-diction accuracy. Empirical studies using the real-world datasets in electronic health records have shown that using the proposed hybrid deep learning neural network for disease prediction signif-icantly improves predictive performance compared to the use of support vector machine (SVM) model, CNN and LSTM alone.
文章引用:梁敏, 莫毓昌, 林栋, 陆迁, 李宁宁. 基于混合深度学习算法的疾病预测模型[J]. 人工智能与机器人研究, 2020, 9(1): 16-23. https://doi.org/10.12677/AIRR.2020.91003

参考文献

[1] Wei, W.-Q., Teixeira, P.L., Mo, H., Cronin, R.M., Warner, J.L. and Denny, J.C. (2015) Combining Billing Codes, Clini-cal Notes, and Medications from Electronic Health Records Provides Superior Phenotyping Performance. Journal of the American Medical Informatics Association, 23, e20-e27.
[Google Scholar] [CrossRef] [PubMed]
[2] Henriksson, A., Zhao, J., Bostr?m, H. and Dalianis, H. (2015) Modeling Heterogeneous Clinical Sequence Data in Semantic Space for Adverse Drug Event Detection. IEEE International Conference on Data Science and Advanced Analytics, Paris, 19-21 October 2015, 1-8.
[Google Scholar] [CrossRef
[3] Hsieh, T.J., Hsiao, H.F. and Yeh, W.C. (2011) Forecasting Stock Markets Using Wavelet Transforms and Recurrent Neural Networks: An Integrated System Based on Artificial Bee Colony Algorithm. Applied Soft Computing, 11, 2510-2525.
[Google Scholar] [CrossRef
[4] Socher, R., Lin, C.C., Manning, C. and Ng, A.Y. (2011) Parsing Natural Scenes and Natural Language with Recursive Neural Networks. Proceedings of the 28th International Confe-rence on Machine Learning, Bellevue, 28 June-2 July 2011, 129-136.
[5] Kong, W., Dong, Z.Y., Hill, D.J., Luo, F. and Xu, Y. (2018) Short-Term Residential Load Forecasting Based on Resident Behaviour Learning. IEEE Transactions on Power Systems, 33, 1087-1088.
[Google Scholar] [CrossRef
[6] Yan, K., Du, Y. and Ren, Z. (2018) MPPT Perturbation Op-timization of Photovoltaic Power Systems Based on Solar Irradiance Data Classification. IEEE Transactions on Sus-tainable Energy, 10, 514-521.
[Google Scholar] [CrossRef
[7] Du, Y., Yan, K., Ren, Z. and Xiao, W. (2018) Designing Lo-calized MPPT for PV Systems Using Fuzzy-Weighted Extreme Learning Machine. Energies, 11, 2615.
[Google Scholar] [CrossRef
[8] Funahashi, K.I. and Nakamura, Y. (1993) Approximation of Dynamical Systems by Continuous Time Recurrent Neural Networks. Neural Networks, 6, 801-806.
[Google Scholar] [CrossRef
[9] Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) Imagenet Classification with Deep Convolutional Neural Networks. Proceedings of the 25th International Conference on Neural Information Processing Systems, Lake Tahoe, 3-6 December 2012, 1097-1105.
[10] Almalaq, A. and Edwards, G.A. (2017) Review of Deep Learning Methods Applied on Load Forecasting. Proceedings of the 16th IEEE International Conference on Machine Learning and Applications, Cancun, 18-21 December 2017, 511-516.
[Google Scholar] [CrossRef
[11] Wang, J., Yu, L.C., Lai, K.R. and Zhang, X. (2016) Dimensional Sentiment Analysis Using a Regional CNN-LSTM Model. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany, 7-12 August 2016, 225-230.
[Google Scholar] [CrossRef
[12] Kumar, U. and Jain, V. (2010) Time Series Models (Grey-Markov, Grey Model with Rolling Mechanism and Singular Spectrum Analysis) to Forecast Energy Consumption in India. Energy, 35, 1709-1716.
[Google Scholar] [CrossRef
[13] Wang, Y.-W., Shen, Z.-Z. and Jiang, Y. (2018) Comparison of ARIMA and GM(1,1) Models for Prediction of Hepatitis B in China. PLoS ONE, 13, e0201987.
[Google Scholar] [CrossRef] [PubMed]
[14] 马晓梅, 史鲁斌, 其木格. 基于ARIMA乘积季节模型和Holt-Winters季节模型的梅毒月发病率预测[J]. 郑州大学学报(医学版), 2018, 53(1): 79-84.
[15] Zhang, Y.M., Luo, L. and Yang, J.C. (2019) A Hybrid ARIMA-SVR Approach for Forecasting Emergency Patient Flow. Journal of Ambient Intelligence and Humanized Computing, 10, 3315-3323.
[Google Scholar] [CrossRef
[16] Kann, B.H., Aneja, S. and Loganadane, G.V. (2018) Pretreatment Identification of Head and Neck Cancer Nodal Metastasis and Extranodal Extension Using Deep Learning Neural Networks. Scientific Reports, 8, Article No. 14036.
[Google Scholar] [CrossRef] [PubMed]
[17] Gu, J.Y., Liang, L.Z. and Song, H.Q. (2019) A Method for Hand-Foot-Mouth Disease Prediction Using Geo Detector and LSTM Model in Guangxi, China. Scientific Reports, 9, Article No. 17928.
[Google Scholar] [CrossRef] [PubMed]
[18] Chae, S., Kwon, S. and Lee, D. (2018) Predicting Infectious Disease Using Deep Learning and Big Data. International Journal of Environmental Research and Public Health, 15, 1596.
[Google Scholar] [CrossRef] [PubMed]
[19] Zeiler, M.D. (2012) ADADELTA: An Adaptive Learning Rate Method.
[20] Jozefowicz, R., Zaremba, W. and Sutskever, I. (2015) An Empirical Exploration of Recurrent Network Architectures. Proceedings of the 32nd International Conference on International Conference on Machine Learning, Lille, 6-11 July 2015, 2342-2350.
[21] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780.
[Google Scholar] [CrossRef] [PubMed]
[22] Werbos, P.J. (1990) Backpropagation through Time: What It Does and How to Do It. Proceedings of the IEEE, 78, 1550-1560.
[Google Scholar] [CrossRef
[23] Ketkar, N. (2017) Convolutional Neural Networks. In: Deep Learning with Python, Springer, Berlin, 63-78.
[Google Scholar] [CrossRef
[24] Goodfellow, I., Bengio, Y., Courville, A. and Bengio, Y. (2016) Deep Learning. MIT Press, Cambridge, 1.