面向态靶辨治的语料库标注系统设计与实现
Design and Implementation of a Corpus Annotation System for State-Target Differentiation and Treatment
摘要: 针对中医态靶辨治领域非结构化文本阻碍知识挖掘的问题,本研究提出一种面向对象的语料库标注系统解决方案。基于筛选自CNKI的32篇态靶辨治核心文献,系统化定义10类实体(如疾病、标靶、症型态)与7类实体关系(如打靶、靶药、药理),并设计三层次标注框架:实体标注、实体属性标注、实体关系标注。系统实现了可视化的标注界面与BIE结构化的输出,支持人工高效构建高质量标注语料库。本工作既可以为后续基于深度学习的态靶辨治文本标注提供训练数据集,又能为中医同类标注系统研发提供参考。
Abstract: To address the challenge of unstructured text hindering knowledge mining in the domain of State-Target Differentiation and Treatment (STDT) of Traditional Chinese Medicine, this study proposes an object-oriented corpus annotation system solution. Based on 32 core STDT literature pieces curated from CNKI, we systematically define 10 entity categories (e.g., disease, clinical indicator, syndrome state) and 7 entity relationships (e.g., precision targeting, target-specific herb, pharmacological mechanism). A three-tier annotation framework is designed: entity annotation, entity attribute annotation, and entity relationship annotation. The system implements a visualized annotation interface with BIE-structured output, enabling efficient manual construction of high-quality annotated corpora. It not only provides a training dataset for deep learning-based STDT text annotation, but also serves as a reference framework for developing analogous TCM annotation systems.
参考文献
|
[1]
|
仝小林. 态靶医学——中医未来发展之路[J]. 中国中西医结合杂志, 2021, 41(1): 16-18.
|
|
[2]
|
李致重. 中西医防治观之比较[J]. 中医药通报, 2023, 22(5): 1-4.
|
|
[3]
|
林轶群, 赵林华, 王强, 等. 代谢综合征态靶辨治体系的构建[J]. 中医杂志, 2022, 63(13): 1223-1226.
|
|
[4]
|
何莉莎, 顾成娟, 王涵, 等. 态靶结合辨治代谢性高血压病[J]. 中医杂志, 2019, 60(16): 1423-1424+1427.
|
|
[5]
|
委李楠, 张丽, 薄彤. 浅谈中医汉语语料库的建设[J]. 中国中医药现代远程教育, 2023, 21(24): 7-9.
|
|
[6]
|
刘丽红, 付璐, 姚克宇, 等. 中医药古籍文献实体标注规范探索[J]. 中华医学图书情报杂志, 2022, 31(12): 1-6.
|
|
[7]
|
张仕娜, 高远, 郑爱华, 等. 中医厥证领域本体构建研究[J]. 湖南中医药大学学报, 2024, 44(3): 427-434.
|
|
[8]
|
朱彦, 乔幸潮, 崔一迪, 等. 中医药文献语义标注系统研究与开发[J]. 中国中医药图书情报杂志, 2020, 44(3): 5-8.
|
|
[9]
|
杨洋, 关毅, 李雪, 等. 中文医学细粒度知识表示体系与标注语料库构建[J]. 中文信息学报, 2023, 37(6): 52-66.
|
|
[10]
|
Li, H.L., Pei, X.M., Yu, H., Wang, W. and Mao, D.G. (2024) Autophagic and Apoptotic Proteins in Goat Corpus Luteum and the Effect of Adiponectin/AdipoRon on Luteal Cell Autophagy and Apoptosis. Theriogenology, 214, 245-256. [Google Scholar] [CrossRef] [PubMed]
|