基于深度神经网络和注意力机制的实体关系抽取方法研究
Entity Relation Extraction Based on Deep Neural Network and Attention Mechanism
摘要: 关系抽取是自动构建知识图谱的关键技术之一,其根本目标是抽取实体间的语义关联关系。针对非结构化文本实体关系抽取方法中存在的上下文环境信息难以准确表征、句子间的实体关系特征未被充分利用等问题,本文提出了一种新的基于神经网络和注意力机制相结合的关系抽取模型PCNN-ATT-BiLSTM。该模型采用“RNN + CNN”网络框架,其中的RNN利用双向长短期记忆神经网络(Bi-LSTM)来捕获文本语句的上下文信息和浅层语义特征,利用分段卷积神经网络(PCNN)捕获文本语句的局部短语特征,并结合注意力机制捕获文本语句的关键信息进行关系预测。该模型在公开数据集SemEval-2010 Task8上取得了82.92%的F1值,实验结果表明,该方法在非结构化文本的实体关系抽取方面表现出了较好的性能,为实体关系的自动获取提供了新的方法支持。
Abstract: The relation extraction is one of the key technologies to automatically construct knowledge graph, and its fundamental goal is to extract semantic relation between entities. The current methods for extracting entity relation from unstructured text are difficult to accurately describe the context information, and the entity relation between sentences is not fully utilized. This paper proposes a new relation extraction model PCNN-ATT-BiLSTM based on the combination of neural network structure and attention mechanism. The model adopts the “RNN + CNN” framework, in which RNN uses the Bi-directional Long Short-Term Memory network (Bi-LSTM) to capture the context information and shallow semantic features of text data statements, uses the Piecewise Convolutional Neural Network (PCNN) to capture the local phrase features of text statements, and combines the attention mechanism to capture the key information of text statements, so as to better capture the information in sentences for relationship prediction. The model achieves 82.92% F1 value on the public dataset SemEval-2010 Task8. The experiment shows that the method shows good performance and provides a new method support for the automatic acquisition of entity relation.
文章引用:陈泽峰, 赵占芳. 基于深度神经网络和注意力机制的实体关系抽取方法研究[J]. 计算机科学与应用, 2022, 12(10): 2395-2404. https://doi.org/10.12677/CSA.2022.1210245

参考文献

[1] 庄传志, 靳小龙, 李忠, 孙智. 基于注意力机制的混合神经网络关系分类方法[J]. 计算机工程与科学, 2020, 42(1): 64-70.
[2] Hendrickx, I., Kim, S.N., Kozareva, Z., Nakov, P., Séaghdha, D.Ó., Padó, S., et al. (2009) SemEval-2010 Task 8: Multi-Way Classification of Semantic Relation between Pairs of Nominals. Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, Boulder, 4 July 2009, 94-99. [Google Scholar] [CrossRef
[3] 刘辉, 江千军, 桂前进, 张祺, 王梓豫, 王磊, 等. 实体关系抽取技术研究进展综述[J]. 计算机应用研究, 2020, 37(S2): 1-5.
[4] Lin, Y., Shen, S., Liu, Z., Luan, H. and Sun, M. (2016) Neural Relation Extraction with Selective Attention over Instances. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, 7-12 August 2016, 2124-2133. [Google Scholar] [CrossRef
[5] Xu, Y., Jia, R., Mou, L., Li, G., Chen, Y.C., Lu, Y.Y., et al. (2016) Improve Relation Classification by Deep Recurrent Neural Networks with Data Augmentation. Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers, Osaka, 11-16 December 2016, 1461-1470.
[6] 王红, 史金钏, 张志伟. 基于注意力机制的LSTM的语义关系抽取[J]. 计算机应用研究, 2018, 35(5): 1417-1420+1440.
[7] Kim, Y. (2014) Convolutional Neural Networks for Sentence Classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, 25-19 October 2014, 1746-1751. [Google Scholar] [CrossRef
[8] Zeng, D., Liu, K., Lai, S., Zhou, G.Y. and Zhao, J. (2014) Relation Classification via Convolutional Deep Neural Network. Proceedings of COLING 2014, the 25th International Confer-ence on Computational Linguistics: Technical Papers, Dublin, 23-29 August 2014, 2335-2344.
[9] Zeng, D., Liu, K., Chen, Y. and Zhao, J. (2015) Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, 17-21 September 2015, 1753-1762. [Google Scholar] [CrossRef
[10] Socher, R., Huval, B., Manning, C.D. and Ng, A.Y. (2012) Semantic Compositionality through Recursive Matrix-Vector Spaces. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, 12-14 July 2012, 1201-1211.
[11] Zhang, D. and Wang, D. (2015) Relation Classification via Recurrent Neural Net-work. arXiv:1508.01006.
[12] Zhang, X.B., Chen, F.C., Huang, R.Y., et al. (2018) Entity Relationship Extraction Based on the Fusion of CNN and Bidirectional LSTM. Chinese Journal of Network and Information Security, 4, 48-55.
[13] Zhou, P., Shi, W., Tian, J., Qi, Z.Y., Li, B.C., Hao, H.W., et al. (2016) Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Berlin, 7-12 August 2016, 207-212. [Google Scholar] [CrossRef
[14] Pennington, J., Socher, R. and Manning, C.D. (2014) GloVe: Global Vectors for Word Representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, 25-29 October 2014, 1532-1543. [Google Scholar] [CrossRef
[15] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[16] 石磊, 王毅, 成颖, 魏瑞斌. 自然语言处理中的注意力机制研究综述[J]. 数据分析与知识发现, 2020, 4(5): 1-14.