基于字符词组特征与原型网络融合训练的事件分类
Event Classification Based on Fusion Training of Character Phrase Features and Prototype Network
摘要: 事件检测与分类任务,包含两个步骤的子任务:识别事件触发词和将其分类为正确的事件类型。在这项任务中首要关键的就是触发词的识别,利用基于神经网络的模型来识别句子中的触发词是这些年的主流方法。然而,当涉及到由语义结构不清和语义相近的字符和词组组成的句子时,识别事件的触发词变得有些困难。本文提出一个融合字与词信息,再通过原型网络来精确事件分类的模型:输入融合字与词的信息的嵌入信息,将各个组成的嵌入信息投影到一个高维的特征空间中,对于每个维度类型的样本信息提取他们的均值作为聚类中心即原型,使用欧几里得距离作为距离度量,训练使得测试样本到自己类别原型的距离越近越好,到其他类别原型的距离越远越好,更精确地识别出句子所包含的触发词,分辨出事件类型。
Abstract: The event detection and classification task consists of two-step subtasks: identifying the event trigger word and classifying it into the correct event type. The most important thing in this task is the recognition of trigger words. Using neural network-based models to identify trigger words in sentences is the mainstream method in these years. However, when it comes to sentences composed of characters and phrases with unclear semantic structure and similar semantics, it becomes difficult to identify the trigger words of the event. This paper proposes to train an n-dimensional prototype network that integrates the embedded information of the word information: input the embedded information of the fused word and word information, and project the embedded information of each composition into a high-dimensional feature space. For each dimension type, the sample information extracts their mean value as the cluster center or prototype, and uses the Euclidean distance as the distance metric. Training makes the test sample the closer to the prototype of its own category, the better, and the farther the distance to prototypes of other categories, the better. Accurately identify the trigger words contained in the sentence and distinguish the type of event.
文章引用:赵芝茵, 程良伦, 陈光明. 基于字符词组特征与原型网络融合训练的事件分类[J]. 计算机科学与应用, 2021, 11(4): 920-927. https://doi.org/10.12677/CSA.2021.114095

参考文献

[1] Chen, Z. and Ji, H. (2009) Language Specific Issue and Feature Exploration in Chinese Event Extraction. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, Boulder, June 2009, 209-212. [Google Scholar] [CrossRef
[2] Chen, Y.B., Xu, L.H., Liu, K., Zeng, D.J. and Zhao, J. (2015) Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Pro-cessing (Volume 1: Long Papers), Beijing, July 2015, 167-176. [Google Scholar] [CrossRef
[3] Zeng, Y., Yang, H.H., Feng, Y.S., Wang, Z. and Zhao, D.Y. (2016) A Convolution BiLSTM Neural Network Model for Chinese Event Extraction. Natural Language Understanding and Intelligent Applications: 5th CCF Conference on Natural Lan-guage Processing and Chinese Computing, NLPCC 2016, and 24th International Conference on Computer Processing of Oriental Languages, ICCPOL 2016, Kunming, 2-6 December 2016, 275-287. [Google Scholar] [CrossRef
[4] Lin, H.Y., Lu, Y.J., Han, X.P. and Sun, L. (2018) Nugget Proposal Networks for Chinese Event Detection. Proceedings of the 56th Annual Meeting of the Association for Compu-tational Linguistics (Volume 1: Long Papers), Melbourne, July 2018, 1565-1574.
[5] Banerjee, A., Merugu, S., Dhil-lon, I.S. and Ghosh, J. (2005) Clustering with Bregman Divergences. Journal of Machine Learning Research, 6, 1705-1749.
[6] Ahn, D. (2006) The Stages of Event Extraction. Proceedings of the Workshop on Annotating and Rea-soning about Time and Events, Sydney, July 2006, 1-8. [Google Scholar] [CrossRef
[7] Ji, H. and Grishman, R. (2008) Refining Event Extraction through Cross-Document Inference. Proceedings of ACL-08: HLT, Co-lumbus, June 2008, 254-262.
[8] Gupta, P. and Ji, H. (2009) Predicting Unknown Time Arguments Based on Cross-Event Propagation. Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, Suntec, August 2009, 369-372. [Google Scholar] [CrossRef
[9] Liao, S.S. and Grishman, R. (2010) Using Document Level Cross-Event Inference to Improve Event Extraction. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, 11-16 July 2010, 789-797.
[10] Hong, Y., Zhang, J.F., Ma, B., Yao, J.M., Zhou, G.D. and Zhu, Q.M. (2011) Using Cross-Entity Inference to Improve Event Extraction. In: Proceedings of the 49th An-nual Meeting of the Association for Computational Linguistics: Human Language Technologies, Association for Com-putational Linguistics, Stroudsburg, 1127-1136.
[11] Li, Q., Ji, H. and Huang, L. (2013) Joint Event Extraction via Structured Prediction with Global Features. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers, 73-82.
[12] Liu, S.L., Liu, K., He, S.Z. and Zhao, J. (2016) A Probabilistic Soft Logic Based Approach to Exploiting Latent and Global Information in Event Classification. The Thirtieth AAAI Confer-ence on Artificial Intelligence, Phoenix, 12-17 February 2016.
[13] Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. and Salakhutdinov, R. (2014) Dropout: A Simple Way to Prevent Neural Networks from Overfitting. The Journal of Machine Learning Research, 15, 1929-1958.
[14] Nguyen, H.T. and Grishman, R. (2015) Event Detection and Domain Adaptation with Convolutional Neural Networks. The 53rd Annual Meeting of the Association for Compu-tational Linguistics, Beijing, 26-31 July 2015. [Google Scholar] [CrossRef
[15] Nguyen, H.T. and Grishman, R. (2016) Modeling Skip-Grams for Event Detection with Convolutional Neural Networks. The 2016 Conference on Empirical Methods in Natural Language Pro-cessing, Austin, 1-5 November 2016. [Google Scholar] [CrossRef
[16] Liu, S.L., Chen, Y.B., Liu, K. and Zhao, J. (2017) Exploiting Argument Information to Improve Event Detection via Supervised Attention Mechanisms. The 55th Annual Meeting of the Associa-tion for Computational Linguistics, Vancouver, 30 July-4 August 2017.
[17] Mikolov, T., Chen, K., Corrado, G. and Dean, J. (2013) Efficient Estimation of Word Representations in Vector Space.
[18] Kingma, D.P. and Ba, J. (2014) Adam: A Method for Stochastic Optimization. arXiv preprintarXiv:1412.6980.
[19] Lin, H.Y., Lu, Y.J., Han, X.P. and Sun, L. (2018) Nugget Proposal Networks for Chinese Event Detection. arXiv preprint arXiv:1805.00249.
[20] Feng, X.C., Qin, B. and Liu, T. (2018) A Language-Independent Neural Network for Event Detection. Science China Infor-mation Sciences, 61, Article ID: 092106. [Google Scholar] [CrossRef