ABS-HDL:基于BIASRU的中文医学命名实体识别模型
ABS-HDL: Chinese Medical Named Entity Recognition Model Based on BIASRU
摘要: 中文医学命名实体识别旨在从中文非结构化医学文本中提取关键实体。针对模型训练时间长、传统字符向量处理方法容易忽视词边界等问题,提出了基于多头交互注意力的中文医学命名实体识别模型:ABS-HDL (ALBERT-BIASRU-SoftAttention-CRF Hybrid Deep Learning)。该方法首先使用ALBERT预训练模型分别获得词向量表示和字向量表示。其次,将字向量和词向量结合成一个字词向量矩阵。接着,本文提出了BIASRU语义提取层,通过将多头交互注意力融入到SRU中,实现了对字词向量矩阵特征的有效学习,并通过双向建模精确捕获序列上下文间的关系。此外,在软注意力机制权重分配层中,模型能够动态调整权重分配,增强了对实体边界的识别能力。最后,使用CRF解码层来优化标签序列的预测。实验结果表明,该模型在中文糖尿病数据集上与现有模型相比表现更好。
Abstract: Chinese medical named entity recognition aims to extract key entities from unstructured Chinese medical texts. Addressing issues such as the lengthy training time for models and the traditional character vector methods’ tendency to overlook word boundaries, a Chinese medical named entity recognition model based on multi-head interactive attention is proposed: ABS-HDL (ALBERT-BIASRU- SoftAttention-CRF Hybrid Deep Learning). This method initially employs the ALBERT pre-trained model to obtain separate word vector and character vector representations. Subsequently, it combines these vectors into a unified character-word vector matrix. Furthermore, this paper introduces the BIASRU semantic extraction layer, which integrates multi-head interactive attention into the SRU, effectively learning the features of the character-word vector matrix and precisely capturing the relationships within the sequence context through bidirectional modeling. Moreover, in the soft attention mechanism weight allocation layer, the model dynamically adjusts the distribution of weights, enhancing the ability to recognize entity boundaries. Lastly, a CRF decoding layer is used to optimize the prediction of the label sequence. Experimental results demonstrate that this model performs better on a Chinese diabetes dataset compared to existing models.
文章引用:盛萱妍, 邵清. ABS-HDL:基于BIASRU的中文医学命名实体识别模型[J]. 建模与仿真, 2024, 13(4): 4075-4089. https://doi.org/10.12677/mos.2024.134370

参考文献

[1] Pearson, C., Seliya, N. and Dave, R. (2021) Named Entity Recognition in Unstructured Medical Text Documents. [Google Scholar] [CrossRef
[2] Gao, Y., Wang, Y., Wang, P., et al. (2020) Medical Named Entity Extraction from Chinese Resident Admit Notes Using Character and Word Attention-Enhanced Neural Network. International Journal of Environmental Research and Public Health, 17, Article 1614. [Google Scholar] [CrossRef] [PubMed]
[3] 郑强, 刘齐军, 王正华, 等. 生物医学命名实体识别的研究与进展[J]. 计算机应用研究, 2010, 27(3): 811-815. [Google Scholar] [CrossRef
[4] 王颖洁, 张程烨, 白凤波, 等. 中文命名实体识别研究综述[J]. 计算机科学与探索, 2023, 17(2): 18. [Google Scholar] [CrossRef
[5] Kenton, J.D., Toutanova, M.W.C. and Bert, L.K. (2019) Pre-Training of Deep Bidirectional Transformers for Language Understanding. Proceedings of naacL-HLT, 1, 2.
[6] Kalyan, K.S., Rajasekharan, A. and Sangeetha, S. (2021) AMMU—A Survey of Transformer-Based Biomedical Pretrained Language Models. DOI:10.48550/arXiv.2105.00827.
[7] Sun, Z., Li, X., Sun, X., et al. (2021) Chinesebert: Chinese Pretraining Enhanced by Glyph and Pinyin Information. arXiv preprint arXiv: 2106.16038. [Google Scholar] [CrossRef
[8] Ramos-Vargas, R.E., Román-Godínez, I. and Torres-Ramos, S. (2021) Comparing General and Specialized Word Embeddings for Biomedical Named Entity Recognition. PeerJ Computer Science, 7, e384. [Google Scholar] [CrossRef] [PubMed]
[9] Nath, N., Lee, S.H. and Lee, I. (2023) Application of Specialized Word Embeddings and Named Entity and Attribute Recognition to the Problem of Unsupervised Automated Clinical Coding. Computers in Biology and Medicine, 165, Article ID: 107422. [Google Scholar] [CrossRef] [PubMed]
[10] Lei, S., Liu, B., Wang, Y., et al. (2023) Chinese Medical Named Entity Recognition Combined with Multi-Feature Embedding and Multi-Network Fusion. Journal of Electronics and Information Technology, 45, 3032-3039.
[11] Ding, J., Xu, W., Wang, A., et al. (2023) Joint Multi-View Character Embedding Model for Named Entity Recognition of Chinese Car Reviews. Neural Computing and Applications, 35, 14947-14962. [Google Scholar] [CrossRef
[12] Li, Y., Du, G., Xiang, Y., et al. (2020) Towards Chinese Clinical Named Entity Recognition by Dynamic Embedding Using Domain-Specific Knowledge. Journal of Biomedical Informatics, 106, Article ID: 103435. [Google Scholar] [CrossRef] [PubMed]
[13] Li, D., Yan, L., Yang, J., et al. (2022) Dependency Syntax Guided BERT-BiLSTM-GAM-CRF for Chinese NER. Expert Systems with Applications, 196, Article ID: 116682. [Google Scholar] [CrossRef
[14] 於张闲, 胡孔法. 基于 BERT-Att-biLSTM模型的医学信息分类研究[J]. 计算机时代, 2020(3): 1-4.
[15] 谢靖, 刘江峰, 王东波. 古代中国医学文献的命名实体识别研究——以Flat-lattice增强的SikuBERT预训练模型为例[J]. 图书馆论坛, 2022, 42(10): 51-60.
[16] Lan, Z., Chen, M., Goodman, S., et al. (2020) ALBERT: A Lite BERT for Self-Supervised Learning of Language Representations. arXiv preprint arxiv:1909.11942.
[17] Lei, T. (2021) When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute. arXiv preprint arXiv: 2102.12459, 2021. [Google Scholar] [CrossRef
[18] Aliyun. (year) A Labeled Chinese Dataset for Diabetes.
https://tianchi.aliyun.com/competition/entrance/231687/information
[19] Ye, W., Li, B., Xie, R., et al. (2019) Exploiting Entity BIO Tag Embeddings and Multi-task Learning for Relation Extraction with Imbalanced Data. [Google Scholar] [CrossRef
[20] Liu, L., Jiang, H., He, P., et al. (2019) On the Variance of the Adaptive Learning Rate and Beyond. arXiv preprint arXiv:1908.03265, 2019.
[21] Fan, Z., He, X., Wang, L., et al. (2020) Research on Entity Relationship Extraction for Diabetes Medical Literature. 2020 IEEE 9th Joint International Information Technology and Artificial Intelligence Conference (ITAIC), Chongqing, 11 December 2020, 424-430.
[22] Wang, Y., Sun, Y., Ma, Z., et al. (2020) Named Entity Recognition in Chinese Medical Literature Using Pretrainingmodels. Scientific Programming, 2020, 1-9. [Google Scholar] [CrossRef
[23] 韩普, 顾亮, 叶东宇, 等. 基于多任务和迁移学习的中文医学文献实体识别研究[J]. 数据分析与知识发现, 2023, 7(9): 136-145.