融合动态知识图谱构建的跨语言船舶设计问答方法
A Cross-Lingual Question-Answering Method for Ship Design Integrating Dynamic Knowledge Graph Construction
摘要: 针对船舶情报系统中知识图谱构建成本高、多语言知识碎片化及动态更新困难等问题,文章提出融合动态知识图谱构建的跨语言检索增强生成方法。该方法通过迭代式精细化多标签抽取算法,从多语言非结构化文档中自动构建可演化的船舶设计知识图谱;引入路径置信度加权的混合检索机制,将结构化知识路径与语义检索协同融合;并通过跨语言自然语言推理驱动的协同验证策略,提升生成答案的事实可靠性。实验在英、中、法、德四语种文档上进行评估,多语言平均F1值达86.8%,跨语言实体对齐准确率达94.7%;端到端问答事实正确性为92.3%,较基线方法提升7.8个百分点;针对近期新发布规范的查询,动态更新机制使事实正确性较静态图谱版本提升23.7个百分点,验证了本方法在知识时效性维护方面的有效性。
Abstract: To address the challenges of high construction cost, multilingual knowledge fragmentation, and difficulty in dynamic updating of knowledge graphs in maritime intelligence systems, this paper proposes a cross-lingual retrieval-augmented generation method integrating dynamic knowledge graph construction. The method automatically builds an evolvable ship design knowledge graph from multilingual unstructured documents via an iterative multi-label extraction algorithm, introduces a hybrid retrieval mechanism with path confidence weighting, and enhances the factual reliability of generated answers through a cross-lingual NLI-driven collaborative verification strategy. Evaluated on English, Chinese, French, and German documents, the method achieves a multilingual average F1-score of 86.8% and a cross-lingual entity alignment accuracy of 94.7%. The end-to-end question-answering factual correctness reaches 92.3%, outperforming the baseline by 7.8 percentage points. For queries involving recently updated regulations, the dynamic update mechanism improves factual correctness by 23.7 percentage points over the static graph version, verifying the effectiveness of this method in maintaining the timeliness of knowledge.
文章引用:梁钟渝, 程良伦. 融合动态知识图谱构建的跨语言船舶设计问答方法[J]. 计算机科学与应用, 2026, 16(5): 365-375. https://doi.org/10.12677/csa.2026.165190

参考文献

[1] International Maritime Organization (2023) MARPOL Annex VI Amendments. IMO Publishing.
[2] 彭子成, 徐国印, 陈建华. 船舶设计知识管理现状与发展趋势[J]. 中国造船, 2021, 62(2): 1-12.
[3] Lewis, P., Perez, E., Piktus, A., et al. (2020) Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, 6-12 December 2020, 9459-9474.
[4] 张明, 李海涛. 面向船舶情报系统的跨语言知识图谱构建与推理方法[J]. 计算机科学, 2024, 51(8): 112-121.
[5] Yao, L., Mao, C. and Luo, Y. (2019) KG-BERT: BERT for Knowledge Graph Completion.
[6] He, G., Lan, Y., Jiang, J., Zhao, W.X. and Wen, J. (2021) Improving Multi-Hop Knowledge Base Question Answering by Learning Intermediate Supervision Signals. Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 8-12 March 2021, 553-561. [Google Scholar] [CrossRef
[7] Edge, J., Trinh, H., Cheng, N., et al. (2024) From Local to Global: A Graph RAG Approach to Query-Focused Summarization.
[8] 郑志明, 李晔, 陈亮. 海事情报知识图谱的自动构建方法研究[J]. 计算机工程与应用, 2022, 58(14): 78-88.
[9] Chen, J., Xiao, S., Zhang, P., Luo, K., Lian, D. and Liu, Z. (2024) M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings through Self-Knowledge Distillation. Findings of the Association for Computational Linguistics ACL 2024, Bangkok, August 2024, 2318-2335. [Google Scholar] [CrossRef
[10] Zhang, H., Wang, T., Liang, Z., Huang, Z., Chen, C. and Cheng, L. (2025) Multilingual Graph Retrieval-Augmented Generation for Product Design Using Design Knowledge. Journal of Engineering Design. [Google Scholar] [CrossRef
[11] Tong, S., Li, J. and Gao, X. (2020) Random Walk with Restart for Information Retrieval in Knowledge Graphs. Information Sciences, 520, 331-344.