基于LightGBM的心电信号分类算法
ECG Signal’s Classification Algorithm Based on LightGBM
DOI: 10.12677/JISP.2020.93020, PDF,   
作者: 洪宇光, 王 波, 潘湖迪:宁波海关技术中心,浙江 宁波;钱升谊*:杭州电子科技大学电子信息学院,浙江 杭州
关键词: 心电图机器学习信号处理LightGBMElectrocardiogram Machine Learning Signal Processing LightGBM
摘要: 心电图(Electrocardiogram, ECG)被广泛应用于窦性心动过速、室性早搏和心房颤动等心律失常诊断中,进而在心脏疾病诊断分析方面展现巨大的临床应用价值。为提升计算机辅助诊断心电图的分类效果,本文提出一种基于LightGBM的心电信号分类算法。该算法从心电图中提取单心拍特征、心律波动性特征以及全波形特征建立混合特征集,并采用LightGBM实现正常心拍、心房颤动、其他心律不齐、噪声四个类别的分类。最终该算法的性能指标在PhysioNet/CinC Challenge数据集上达到0.824,优于CART和CatBoost算法。同时为了加快心电图特征提取的速度,本文根据特征重要性筛选关键特征来减少分类所需的特征数量,在保持分类性能的同时将特征提取时间降为原来的17.8%。
Abstract: Electrocardiogram (ECG) has been widely used in the diagnosis of arrhythmia such as sinus tachycardia, ventricular premature beats and atrial fibrillation, and has shown great clinical application value in the diagnosis and analysis of heart diseases. In order to improve the classification performance of ECG computer-aided diagnosis, an ECG signal classification algorithm based on LightGBM was proposed. The algorithm extracts single heart beat features, heart rhythm volatility features and full waveform features from ECGs to establish a mixed feature set, and uses LightGBM to classify four categories: normal heart beat, atrial fibrillation, other arrhythmia, and noise. Finally, the performance parameter  of the algorithm reached 0.824 on PhysioNet/CinC Challenge dataset, which was better than CART and CatBoost. At the same time, in order to accelerate the speed of ECG feature extraction, this paper screens the key features according to the importance of features to reduce the number of features required for classification, and the feature extraction time is reduced to 17.8% while maintaining the performance of classification.
文章引用:洪宇光, 王波, 潘湖迪, 钱升谊. 基于LightGBM的心电信号分类算法[J]. 图像与信号处理, 2020, 9(3): 165-171. https://doi.org/10.12677/JISP.2020.93020

参考文献

[1] Rajpurkar, P., Hannun, A.Y., Haghpanahi, M., et al. (2017) Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks.
[2] 卢喜烈. 现代心电图诊断大全[M]. 北京: 科学技术文献出版社, 1997.
[3] Tateno, K. and Glass, L. (2001) Automatic Detection of Atrial Fibrillation Using the Coefficient of Variation and Density Histograms of RR and deltaRR Intervals. Medical & Biological Engineering & Computing, 39, 664-671.
[Google Scholar] [CrossRef
[4] 宋莉, 孟庆建, 张光玉, 等. 基于波形特征和SVM的心电信号自动分类方法研究[J]. 中国医学物理学杂志, 2010(4): 2043-2046.
[5] Elhaj, F.A., Salim, N., Harris, A.R., et al. (2016) Arrhythmia Recognition and Classification Using Combined Linear and Nonlinear Features of ECG Signals. Computer Methods & Programs in Biomedicine, 127, 52-63.
[Google Scholar] [CrossRef] [PubMed]
[6] 颜昊霖, 安勇, 王宏飞, 等. 基于卷积神经网络的心电特征提取[J]. 计算机工程与设计, 2017(4): 1024-1028.
[7] Kiranyaz, S., Ince, T. and Gabbouj, M. (2016) Real-Time Patient-Specific ECG Classification by 1D Convolutional Neural Networks. IEEE Transactions on Biomedical Engineering, 63, 664-675.
[Google Scholar] [CrossRef
[8] 王润. 心电信号的预处理算法分析[J]. 现代计算机(专业版), 2018(7): 33-36.
[9] Barhatte, A.S., Ghongade, R. and Tekale, S.V. (2016) Noise Analysis of ECG Signal Using Fast ICA. 2016 Conference on Advances in Signal Processing (CASP) Cummins College of Engineering for Women, Pune, 9-11 Jun 2016, 1-5.
[Google Scholar] [CrossRef
[10] Ke, G., Meng, Q., Finley, T.W., et al. (2017) LightGBM: A Highly Efficient Gradient Boosting Decision Tree. Neural Information Processing Systems, Long Beach, CA, 4-9 December 2017, 3149-3157.
[11] Zhou, Z.-H. (2012) Ensemble Methods: Foundations and Algorithms. CRC Press, New York, 99-118.
[Google Scholar] [CrossRef