老挝语命名实体识别研究综述
A Survey of Lao Named Entity Recognition
DOI: 10.12677/ML.2018.63051, PDF,    国家自然科学基金支持
作者: 何阳宇, 易绵竹, 贾惠心, 李宏欣:中国人民解放军战略支援部队信息工程大学,河南 洛阳
关键词: 老挝语命名实体识别分词信息检索信息抽取Lao Language Named Entity Recognition Word Segmentation Information Retrieval Information Extraction
摘要: 命名实体识别是自然语言处理中一项关键的基础技术,已成为信息检索、机器翻译等诸多任务的重要组成部分。在诸多语种的命名实体识别研究工作中,老挝语命名实体识别近年来开始受到关注,然而老挝语与汉语类似的特点又使得其命名实体识别工作非常具有挑战性。本文对国内外命名实体识别的研究历程进行了简要回顾,同时介绍了老挝语分词和命名实体识别的最新成果,并对比分析了每种方法的优点和不足。最后,对老挝语命名实体识别的发展趋势作了展望。
Abstract: Named entity recognition is a key basic technology in natural language processing and has become an important part of many tasks such as information retrieval, machine translation, and so on. The recognition of Lao named entity has begun to attract attention in recent years, and the similarity of Lao and Chinese makes dealing with NER a challenge. This article briefly reviews the research history of domestic and foreign named entity recognition, introduces the latest achievements of Lao word segmentation and named entity recognition, and compares the advantages and disadvantages of each method. Finally, it forecasts the development trend of Lao named entity recognition.
文章引用:何阳宇, 易绵竹, 贾惠心, 李宏欣. 老挝语命名实体识别研究综述[J]. 现代语言学, 2018, 6(3): 449-461. https://doi.org/10.12677/ML.2018.63051

参考文献

[1] Rau, L.F. (1991) Extracting Company Names from Text. Seventh IEEE Conference on Artificial Intelligence Applications, Miami Beach, 29-32.
[2] Sundheim, B., et al. (1996) Message Understanding Conference-6: A Brief History. Conference on Computational Linguistics, Association for Computational Linguistics, Philadelphia, 466-471.
[3] Nadeau, D. and Sekine, S. (2007) A Survey of Named Entity Recognition and Classification. Lingvisticae Investigationes, 30, 3-26.
[4] 银莎格. 国内老挝语研究综述[J]. 铜仁学院学报, 2014(1): 113-116.
[5] 宗成庆. 统计自然语言处理[M]. 北京: 清华大学出版社, 2013: 152.
[6] 张晓艳, 王挺, 陈火旺. 命名实体识别研究[J]. 计算机科学, 2005, 32(4): 44-48.
[7] Srithirath, A. and Seresangtakul, P. (2013) A Hybrid Approach to Lao Word Segmentation Using Longest Syllable Level Matching with Named Entities Recognition. International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, Krabi, 1-5.
[8] Vanthanavong, S., Haruechaiyasak, C. and Lao, W.S. (2011) Lao Word Segmentation Based on Conditional Random Fields.
[9] Haruechaiyasak, C., Kongyoung, S. and Dailey, M. (2008) A Comparative Study on Thai Word Segmentation Approaches. International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, Ecti-Con, Krabi, 125-128.
[10] 孙镇, 王惠临. 命名实体识别研究进展综述[J]. 数据分析与知识发现, 2010, 26(6): 42-47.
[11] 杨梦杰. 老挝语命名实体识别方法的研究[D]: [硕士学位论文]. 昆明: 昆明理工大学, 2016.
[12] 段韶鹏. 老挝语命名实体识别研究[D]: [硕士学位论文]. 昆明: 昆明理工大学, 2017.
[13] 奚雪峰, 周国栋. 面向自然语言处理的深度学习研究[J]. 自动化学报, 2016, 42(10): 1445-1465.
[14] 麻泽武. 基于深度学习的命名实体识别技术研究[D]: [硕士学位论文]. 杭州: 浙江大学, 2016.
[15] 张海楠, 伍大勇, 刘悦, 等. 基于深度神经网络的中文命名实体识别[J]. 中文信息学报, 2017, 31(4): 28-35.
[16] 冯艳红, 于红, 孙庚, 等. 基于BLSTM的命名实体识别方法[J]. 计算机科学, 2018, 45(2): 261-268.
[17] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]