基于实体关系双向引导的中文少样本关系三元组抽取
Bidirectional Entity-Relation Guided Chinese Few-Shot Relation Triple Extraction
DOI: 10.12677/sea.2025.144080, PDF,    国家自然科学基金支持
作者: 李雨辰, 张明西*, 殷菘泽, 曹祥龙, 王凌璇:上海理工大学出版学院,上海
关键词: 关系三元组抽取知识图谱少样本学习Relation Triple Extraction Knowledge Graph Few-Shot Learning
摘要: 关系三元组抽取是知识图谱构建中的关键任务。然而,实际应用中关系类别普遍呈现长尾分布,许多低频关系缺乏充足的标注样本,严重制约了监督学习方法的性能。为此,我们提出了一种基于实体关系双向引导的中文少样本关系三元组抽取方法,通过构建实体识别与关系分类之间的双向信息流,增强子任务间的信息交互与协同建模能力。为提升模型的表示能力和分类判别性能,我们设计了基于token置信度的加权原型构建方法,以动态评估各token的语义贡献并优化原型表示。同时,我们引入原型多样性约束损失,在提升类内一致性的基础上增强类间可分性,从而强化类别判别效果。实验结果表明,所提的方法在多个评价指标上显著优于主流基线模型,验证了其在少样本场景下的有效性与泛化能力。
Abstract: Relation triple extraction is a critical task in the construction of knowledge graphs. However, in real-world applications, the distribution of relation types often follows a long-tail pattern, where many low-frequency relations lack sufficient annotated examples, severely limiting the performance of supervised learning methods. To address this challenge, we propose a Chinese few-shot relation triple extraction method based on a bidirectional entity-relation guidance mechanism. By constructing a bidirectional information flow between entity recognition and relation classification, our method enhances the interaction and joint modeling between the two subtasks. To improve the model’s representation capability and classification performance, we further design a token-confidence-based weighted prototype construction method, which dynamically assesses the semantic contribution of each token to optimize prototype representations. In addition, we introduce a prototype diversity constraint loss to improve intra-class consistency while enhancing inter-class separability, thereby strengthening the model’s discriminative ability. Experimental results show that our method significantly outperforms mainstream baseline models across multiple evaluation metrics, demonstrating its effectiveness and generalization ability in few-shot scenarios.
文章引用:李雨辰, 张明西, 殷菘泽, 曹祥龙, 王凌璇. 基于实体关系双向引导的中文少样本关系三元组抽取[J]. 软件工程与应用, 2025, 14(4): 906-918. https://doi.org/10.12677/sea.2025.144080

参考文献

[1] Yan, Y., Sun, H. and Liu, J. (2021) A Review and Outlook for Relation Extraction. Proceedings of the 5th International Conference on Computer Science and Application Engineering, Sanya, 19-21 October 2021, 1-5. [Google Scholar] [CrossRef
[2] Devlin, J., Chang, M.-W., Lee, K. and Toutanova, K. (2019) BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, Minneapolis, June 2019, 4171-4186.
[3] Wang, J., Zhang, L., Liu, J., et al. (2022) TGIN: Translation-Based Graph Inference Network for Few-Shot Relational Triplet Extraction. IEEE Transactions on Neural Networks and Learning Systems, 35, 9147-9161.
[4] Yu, H., Zhang, N., Deng, S., Ye, H., Zhang, W. and Chen, H. (2020) Bridging Text and Knowledge with Multi-Prototype Embedding for Few-Shot Relational Triple Extraction. Proceedings of the 28th International Conference on Computational Linguistics, Barcelona, 8-13 December 2020, 6399-6410. [Google Scholar] [CrossRef
[5] Cong, X., Sheng, J., Cui, S., Yu, B., Liu, T. and Wang, B. (2022) Relation-Guided Few-Shot Relational Triple Extraction. Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, 11-15 July 2022, 2206-2213. [Google Scholar] [CrossRef
[6] Yang, C., Jiang, S., He, B., Ma, C. and He, L. (2023) Mutually Guided Few-Shot Learning for Relational Triple Extraction. 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, 4-10 June 2023, 1-5. [Google Scholar] [CrossRef
[7] Chan, Y.S. and Roth, D. (2011) Exploiting Syntactico-Semantic Structures for Relation Extraction. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human LANGUAGE Technologies, Stroudsburg, 19-24 June 2011, 551-560.
[8] Zhao, S., Hu, M., Cai, Z., Chen, H. and Liu, F. (2021) Dynamic Modeling Cross-and Self-Lattice Attention Network for Chinese Ner. Proceedings of the AAAI Conference on Artificial Intelligence, 35, 14515-14523. [Google Scholar] [CrossRef
[9] Lin, Y., Liu, Z. and Sun, M. (2017) Neural Relation Extraction with Multi-Lingual Attention. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, July 30-August 4, 2017, 34-43. [Google Scholar] [CrossRef
[10] Zeng, X., Zeng, D., He, S., Liu, K. and Zhao, J. (2018) Extracting Relational Facts by an End-to-End Neural Model with Copy Mechanism. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Melbourne, 15-20 July 2018, 506-514. [Google Scholar] [CrossRef
[11] Wang, Y., Yao, Q., Kwok, J.T. and Ni, L.M. (2020) Generalizing from a Few Examples: A Survey on Few-Shot Learning. ACM Computing Surveys, 53, 1-34. [Google Scholar] [CrossRef
[12] Feng, J., Xu, G., Wang, Q., Yang, Y. and Huang, L. (2024) Note the Hierarchy: Taxonomy-Guided Prototype for Few-Shot Named Entity Recognition. Information Processing & Management, 61, Article 103557. [Google Scholar] [CrossRef
[13] Han, J., Cheng, B., Wan, Z. and Lu, W. (2023) Towards Hard Few-Shot Relation Classification. IEEE Transactions on Knowledge and Data Engineering, 35, 9476-9489. [Google Scholar] [CrossRef
[14] Fei, J., Zeng, W., Zhao, X., Li, X. and Xiao, W. (2022) Few-Shot Relational Triple Extraction with Perspective Transfer Network. Proceedings of the 31st ACM International Conference on Information & Knowledge Management, Atlanta, 17-21 October 2022, 488-498. [Google Scholar] [CrossRef
[15] Liao, B., Lu, Z. and Guo, Y. (2024) RCTE: Relation Candidate-Guided Few-Shot Relational Triple Extraction. Proceedings of the 2024 3rd International Conference on Cyber Security, Artificial Intelligence and Digital Economy, Nanjing, 1-3 March 2024, 486-491. [Google Scholar] [CrossRef
[16] Gao, C., Zhang, X., Jin, Z., Cai, W., Wang, D., Du, K., et al. (2025) SQGE: Support-Query Prototype Guidance and Enhancement for Few-Shot Relational Triple Extraction. Neural Networks, 185, Article 107172. [Google Scholar] [CrossRef] [PubMed]
[17] Seo, M., Kembhavi, A., Farhadi, A. and Hajishirzi, H. (2017) Bidirectional Attention Flow for Machine Comprehension. 5th International Conference on Learning Representations, Toulon, 24-26 April 2017.
https://openreview.net/forum?id=HJ0UKP9ge
[18] Li, S., He, W., Shi, Y., Jiang, W., Liang, H., Jiang, Y., et al. (2019) Duie: A Large-Scale Chinese Dataset for Information Extraction. In: Tang, J., Kan, M.Y., Zhao, D., Li, S. and Zan, H., Eds., Lecture Notes in Computer Science, Springer International Publishing, 791-800. [Google Scholar] [CrossRef
[19] 张龙辉, 尹淑娟, 任飞亮, 等. BSLRel: 基于二元序列标注的级联关系三元组抽取模型[J]. 中文信息学报, 2021, 35(6): 74-84.
[20] He, Y., Li, Z. and Dou, Z. (2025) Chinese Relation Extraction for Constructing Satellite Frequency and Orbit Knowledge Graph: A Survey. Digital Communications and Networks. In Press. [Google Scholar] [CrossRef
[21] 百度. DuIE2.0中文关系抽取数据集[EB/OL].
https://aistudio.baidu.com/datasetdetail/180082, 2025-07-01.
[22] Kingma, D.P. (2014) Adam: A Method for Stochastic Optimization. arXiv:1412.6980.
[23] Cui, Y., Che, W., Liu, T., Qin, B. and Yang, Z. (2021) Pre-Training with Whole Word Masking for Chinese Bert. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 3504-3514. [Google Scholar] [CrossRef
[24] Sun, Y., Wang, S., Feng, S., et al. (2021) Ernie 3.0: Large-Scale Knowledge Enhanced Pre-Training for Language Understanding and Generation. arXiv:2107.02137.