标题:
基于HNC的现代汉语句子基本语义类型例句库建设Building a Token Corpus of Canonical Semantic Sentence Types for Modern Chinese Based on HNC Theory
作者:
蒋严, 苗传江, 刘小蝶
关键字:
语料库, 句子语义类型, HNC(概念层次网络)理论, 现代汉语Token Corpus; Semantic Sentence Type; HNC (Hierarchical Network of Concepts) Theory; Modern Chinese
期刊名称:
《Modern Linguistics》, Vol.1 No.3, 2013-11-29
摘要:
句子语义类型例句库是开展基于语义的句子研究所需要的基础资源。我们以HNC (Hierarchical Network of Concepts,概念层次网络)理论为指导建立句子语义类型例句库,该理论建立了完整的句子语义类型体系,为基于语义的句子研究提供了良好的理论框架。我们已经建立了一个现代汉语句子基本语义类型的例句库,为每个类型配备了典型而真实的例句,并且采用XML (Extensible Markup Language,可扩展标记语言)技术标注了每个例句的语义结构,还提供了例句查询功能。我们将以这个例句库为基础,逐步扩展,为基于语义的句子研究不断积累资源。A token corpus of semantic sentence types provides elementary resource for the study of sentences from semantic perspectives. We have built such a corpus using the theory of HNC (Hierarchical Network of Concepts). As HNC contains a complete system of sentence semantic types, it provides a good theoretical framework for meaning-based sentence analysis. The corpus we built contains token sentences taken from real-life whose semantic structures are labeled using XML techniques (Extensible Markup Language). The search function for token sentence is also provided. We will use this token corpus as a basis for further developments so as to accumulate resources for meaning-based sentence analysis.