基于并行神经网络的疾病特征实体识别方法
Disease Feature Entity Recognition Method Based on Parallel Neural Network
DOI: 10.12677/csa.2024.1410202, PDF,   
作者: 杨建兴:河南财经政法大学计算机与信息工程学院,河南 郑州;浙江医院信息中心,浙江 杭州;卢照敢, 赵柴学正:河南财经政法大学计算机与信息工程学院,河南 郑州
关键词: 神经网络RoBERTa实体识别静脉血栓Neural Network RoBERTa Entity Recognition Venous Thromboembolism
摘要: 针对静脉血栓栓塞症电子病历文本语义复杂,疾病信息多维性导致的疾病特征学习不彻底、实体识别不准确的问题,本文提出了一种并行神经网络的疾病特征实体识别方法。首先,通过RoBERTa模型,更好地学习到病历实体中的特征信息。然后,通过双向长短期记忆网络,提取病历中的全局特征,再经过并行的迭代膨胀卷积神经网络提取病历中的局部特征。最后,利用CRF推理层修正神经网络输出的疾病特征标签。在医院提供的2000份静脉血栓电子病历上,本方法的平均准确率为85.26%,相对于单纯的卷积神经网络,该方案的识别准确率提高了13.52%。
Abstract: In response to the intricate semantics of electronic medical records for venous thromboembolism and the multi-dimensionality of disease information that gives rise to incomplete acquisition of disease features and inaccurate entity recognition, this paper presents a parallel neural network-based disease feature and entity recognition approach. Firstly, the language representation RoBERTa model is employed to more effectively acquire the feature information of medical record entities. Subsequently, a bidirectional long short-term memory network is utilized to extract global features from the medical record, followed by a parallel iterative dilated convolutional neural network for extracting local features from the medical record. Eventually, the CRF inference layer is adopted to rectify the disease feature labels output by the neural network. On the 2000 venous thromboembolism electronic medical records provided by the hospital, the average accuracy of the proposed method is 85.26%, which is 13.52% higher than that of the pure convolutional neural network.
文章引用:杨建兴, 卢照敢, 赵柴学正. 基于并行神经网络的疾病特征实体识别方法[J]. 计算机科学与应用, 2024, 14(10): 58-66. https://doi.org/10.12677/csa.2024.1410202

参考文献

[1] 中华人民共和国国家卫生和计划生育委员会. 电子病历应用管理规范(试行) [J]. 中国实用乡村医生杂志, 2017, 24(6): 1-2.
[2] Syachrul, M.A.K., Bijaksana, M.A. and Huda, A.F. (2019) Person Entity Recognition for the Indonesian Qur’an Translation with the Approach Hidden Markov Model-Viterbi. Procedia Computer Science, 157, 214-220. [Google Scholar] [CrossRef
[3] Glabal Tone Communication Technology Co. Ltd. (2020) Korean Named-Entity Recognition Method Based on Maximum Entropy Model and Neural Network Model. Patent Application Approval Process (USPTO 20200302118).
[4] Imam, A.T., Alhroob, A. and Jumah, W. (2021) SVM Machine Learning Classifier to Automate the Extraction of SRS Elements. International Journal of Advanced Computer Science and Applications, 12, 174-185. [Google Scholar] [CrossRef
[5] Liu, S., He, T. and Dai, J. (2021) A Survey of CRF Algorithm Based Knowledge Extraction of Elementary Mathematics in Chinese. Mobile Networks and Applications, 26, 1891-1903. [Google Scholar] [CrossRef
[6] Goyal, A., Gupta, V. and Kumar, M. (2021) Recurrent Neural Network-Based Model for Named Entity Recognition with Improved Word Embeddings. IETE Journal of Research, 69, 6970-6976. [Google Scholar] [CrossRef
[7] Chen, J.-Q., Zhu, Z.-C., Zhang, F., Zeng, K., Jiang, H.-Z. and Cheng, Z.-N. (2023) A BIGRU-Based Stacked Attention Network for Biomedical Named Entity Recognition with Chinese EMRs. Studies in Health Technology and Informatics, 308, 757-767. [Google Scholar] [CrossRef
[8] 叶恩光, 张晓如, 张再跃, 等. 基于BERT和领域词典融合的中文电子病历命名实体识别[J]. 计算机与数字工程, 2024, 52(3): 746-750, 767.
[9] 周宇辉, 何志琴. 基于改进注意力机制的图像描述算法[J]. 智能计算机与应用, 2022, 12(2): 58-63.
[10] Yu, F. and Koltun, V. (2015) Multi-Scale Context Aggregation by Dilated Convolutions. [Google Scholar] [CrossRef
[11] 杜敏, 黄红娟, 徐西楚, 等. Wells评分联合D-二聚体水平对重症肺血栓栓塞患者近期预后不良的预测分析[J]. 中国急救复苏与灾害医学杂志, 2023, 18(12): 1610-1614.