基于改进CasRel模型的风力发电机故障领域实体关系抽取研究
Research on Entity Relationship Extraction in Wind Turbine Fault Domain Based on Improved CasRel Model
DOI: 10.12677/airr.2025.143066, PDF,   
作者: 张友星*:北京信息科技大学计算机学院,北京;王兴芬#:北京信息科技大学管理科学与工程学院,北京
关键词: 风力发电机故障CasRel余弦注意力SOTAWind Turbine Faults CasRel Cosine Attention SOTA
摘要: 针对风力发电机故障领域关系抽取任务存在大量的专业术语、单实体重叠和多实体重叠、单句子关系多以及句子结构复杂的问题,本文提出一种改进的CasRel二元级联标记框架CA-CasRel。由于ERNIE模型在预训练过程中引入了实体信息和知识图谱信息,本研究首先用ERNIE模型代替BERT模型进行句子编码嵌入,深层次地捕捉文本序列的实体关系信息。另外,本研究在预测头实体之后,对头实体向量用余弦注意力机制进行编码,余弦注意力具有方向敏感性和语义纯度强化,可高效提取语义的本质特征和对稀疏特征的无偏捕捉。最终,所提架构和基线CasRel模型相比,其精确率、召回率和F1值分别提升了1.71%、6.09%和4.05%,达到了领域内的SOTA效果。
Abstract: In the task of relation extraction within the wind turbine fault domain, there are numerous challenges, such as the presence of domain-specific terminology, overlapping of single and multiple entities, multiple relations in a single sentence, and complex sentence structures. This paper proposes an improved CasRel-based binary tagging framework, CA-CasRel. Since the ERNIE model incorporates entity information and knowledge graph data during pre-training, this study replaces the BERT model with ERNIE for sentence encoding and embedding, enabling a deeper capture of entity-relation information in text sequences. Furthermore, after predicting the head entity, the head entity vector is encoded using a cosine attention mechanism. Cosine attention, with its direction sensitivity and enhanced semantic purity, efficiently captures essential semantic features and unbiasedly detects sparse features. Compared to the original CasRel model, the proposed architecture improves precision, recall, and F1 score by 1.71%, 6.09%, and 4.05%, respectively, achieving state-of-the-art performance in the domain.
文章引用:张友星, 王兴芬. 基于改进CasRel模型的风力发电机故障领域实体关系抽取研究[J]. 人工智能与机器人研究, 2025, 14(3): 670-683. https://doi.org/10.12677/airr.2025.143066

参考文献

[1] Detroja, K., Bhensdadia, C.K. and Bhatt, B.S. (2023) A Survey on Relation Extraction. Intelligent Systems with Applications, 19, Article ID: 200244. [Google Scholar] [CrossRef
[2] Zhang, M., Zhou, G. and Aw, A. (2008) Exploring Syntactic Structured Features over Parse Trees for Relation Extraction Using Kernel Methods. Information Processing & Management, 44, 687-701. [Google Scholar] [CrossRef
[3] Hasegawa, T., Sekine, S. and Grishman, R. (2004) Discovering Relations among Named Entities from Large Corpora. Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, 21-26 July 2004, 415-422. [Google Scholar] [CrossRef
[4] Xu, Y., Mou, L., Li, G., Chen, Y., Peng, H. and Jin, Z. (2015) Classifying Relations via Long Short-Term Memory Networks along Shortest Dependency Paths. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, September 2015, 1785-1794. [Google Scholar] [CrossRef
[5] Zhong, Z. and Chen, D. (2021) A Frustratingly Easy Approach for Entity and Relation Extraction. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, June 2021, 50-61. [Google Scholar] [CrossRef
[6] Yang, C., Deng, J., Chen, X. and An, Y. (2023) SPBERE: Boosting Span-Based Pipeline Biomedical Entity and Relation Extraction via Entity Information. Journal of Biomedical Informatics, 145, Article ID: 104456. [Google Scholar] [CrossRef] [PubMed]
[7] Tuo, M. and Yang, W. (2023) Review of Entity Relation Extraction. Journal of Intelligent & Fuzzy Systems, 44, 7391-7405. [Google Scholar] [CrossRef
[8] Wei, Z., Su, J., Wang, Y., Tian, Y. and Chang, Y. (2020) A Novel Cascade Binary Tagging Framework for Relational Triple Extraction. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, July 2020, 1476-1488. [Google Scholar] [CrossRef
[9] Zheng, H., Wen, R., Chen, X., Yang, Y., Zhang, Y., Zhang, Z., et al. (2021) PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Online, August 2021, 6225-6235. [Google Scholar] [CrossRef
[10] Wadhwa, S., Amir, S. and Wallace, B. (2023) Revisiting Relation Extraction in the Era of Large Language Models. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Toronto, July 2023, 15566-15589. [Google Scholar] [CrossRef] [PubMed]
[11] Wei, J., Wang, X., Schuurmans, D., et al. (2022) Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. Advances in Neural Information Processing Systems, 35, 24824-24837.
[12] Brown, T.B. (2020) Language Models Are Few-Shot Learners. arXiv: 2005.14165.
[13] Chung, H.W., Hou, L., Longpre, S., et al. (2024) Scaling Instruction-Finetuned Language Models. Journal of Machine Learning Research, 25, 1-53.
[14] Wang, S., Zhang, Y., Che, W. and Liu, T. (2018) Joint Extraction of Entities and Relations Based on a Novel Graph Scheme. Proceedings of the 27th International Joint Conference on Artificial Intelligence, Stockholm, 13-19 July 2018, 4461-4467. [Google Scholar] [CrossRef
[15] Zeng, D., Liu, K., Chen, Y. and Zhao, J. (2015) Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, September 2015, 1753-1762. [Google Scholar] [CrossRef
[16] 戴志辉, 张富泽, 张近月, 等. 基于MacBERT-BiLSTM-CRF模型的继电保护装置缺陷知识图谱构建方法[J]. 电力系统保护与控制, 2024, 52(20): 131-143.
[17] 文雯. 面向汽车故障知识图谱构建的实体关系抽取算法研究[D]: [硕士学位论文]. 西安: 西安工业大学, 2024.
[18] Zeng, X., Zeng, D., He, S., Liu, K. and Zhao, J. (2018) Extracting Relational Facts by an End-to-End Neural Model with Copy Mechanism. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Melbourne, July 2018, 506-514. [Google Scholar] [CrossRef
[19] Fu, T., Li, P. and Ma, W. (2019). GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, July 2019, 1409-1418.[CrossRef