英语母语背景汉语学习者书面中介语的词汇发展研究
A Study on the Vocabulary Development of the Written Interlanguage of Chinese Learners with English Native Language Background
DOI: 10.12677/ml.2025.13121262, PDF,   
作者: 王文卓:华侨大学华文教育研究院/华文学院,福建 厦门
关键词: 汉语中介语书面语词汇丰富度语言水平Chinese Interlanguage Written Language Lexical Richness Language Proficiency
摘要: 本文基于自建语料库,从计量语言学的角度,以衡量词汇丰富度的三个指标:吉罗指数、密度及单现率,以汉语母语者数据为参照,考察了真实语料中不同水平英语母语汉语学习者在书面语中的词汇使用发展情况。中介语和母语者语料分别源于HSK动态作文语料库及《人民日报》,并使用Python处理语料得到数据后用SPSS进行相关系数检测分析。研究结果表明,随着汉语水平的提高,ECSL学习者书面语中词汇的吉罗指数及单现率也会提高,其输出的词汇密度符合书面语特征,输出词汇的丰富度和复杂度也得到了提升,并逐步向目的语水平靠近。
Abstract: Based on a self-built corpus, this study adopts a quantitative linguistic approach to investigate the development of vocabulary usage in the written production of Chinese learners of different proficiency levels with English as their first language. Using three indicators of lexical richness—the Guiraud Index, lexical density, and the hapax legomena rate—the study compares learner data against a reference corpus of native Chinese speakers. The interlanguage data were sourced from the HSK Dynamic Composition Corpus, while the native speaker data came from the People’s Daily. Corpus processing was carried out using Python, and correlation analyses were conducted via SPSS. The results show that as Chinese proficiency increases, ECSL learners exhibit higher values in both the Guiraud Index and hapax legomena rate in their writing. In addition, the lexical density of their output aligns with the profile of written discourse, indicating a development toward greater lexical richness and complexity, which progressively approximates target-language norms.
文章引用:王文卓. 英语母语背景汉语学习者书面中介语的词汇发展研究[J]. 现代语言学, 2025, 13(12): 275-283. https://doi.org/10.12677/ml.2025.13121262

参考文献

[1] Henriksen, B. (1999) Three Dimensions of Vocabulary Development. Studies in Second Language Acquisition, 21, 303-317. [Google Scholar] [CrossRef
[2] Nation, P. and Webb, S. (2011) Researching and Analyzing Vocabulary. Heinle Centre Learning.
[3] Laufer, B. and Nation, P. (1995) Vocabulary Size and Use: Lexical Richness in L2 Written Production. Applied Linguistics, 16, 307-322. [Google Scholar] [CrossRef
[4] 万丽芳. 中国英语专业大学生二语写作中的词汇丰富性研究[J]. 外语界, 2010(1): 40-46.
[5] 朱慧敏, 王俊菊. 英语写作的词汇丰富性发展特征——一项基于自建语料库的纵贯研究[J]. 外语界, 2013(6): 77-86.
[6] 黄立, 钱旭菁. 第二语言汉语学习者的生成性词汇知识考察——基于看图作文的定量研究[J]. 汉语学习, 2003(1): 56-61.
[7] 曹贤文, 邓素娟. 汉语母语和二语书面表现的对比分析——以小学高年级中国学生和大学高年级越南学生的同题汉语作文为例[J]. 华文教学与研究, 2012(2): 39-46.
[8] 吴继峰. 英语母语者汉语写作中的词汇丰富性发展研究[J]. 世界汉语教学, 2016(1): 129-142.
[9] Ortega, L. (2003) Syntactic Complexity Measures and Their Relationship to L2 Proficiency: A Research Synthesis of College-Level L2 Writing. Applied Linguistics, 24, 492-518. [Google Scholar] [CrossRef
[10] Pallotti, G. (2014) A Simple View of Linguistic Complexity. Second Language Research, 31, 117-134. [Google Scholar] [CrossRef
[11] 胡显耀. 基于语料库的汉语翻译小说词语特征研究[J]. 外语教学与研究, 2007(3): 214-220, 241.
[12] Ure, J. (1971) Lexical Density and Register Differentiation. In: Perren, G.E. and Trim, J.L.M., Eds., Applications of Linguistics: Selected Papers of the Second International Congress of Applied Linguistics, Cambridge University Press, 443-452.
[13] 李春琳. 汉语二语学习者产出型词汇水平和写作质量相关关系分析[J]. 华文教学与研究, 2017(3): 54-61.
[14] Selinker, L. (1972) Interlanguage. IRALInternational Review of Applied Linguistics in Language Teaching, 10, 209-231. [Google Scholar] [CrossRef
[15] Ellis, N.C. (2002) Frequency Effects in Language Processing. Studies in Second Language Acquisition, 24, 143-188. [Google Scholar] [CrossRef
[16] 文秋芳. 频率作用与二语习得——《第二语言习得研究》2002年6月特刊评述[J]. 外语教学与研究, 2003(2): 151-154.
[17] 曹逢甫. 主题在汉语中的功能研究——迈向语段分析的第一步[M]. 北京: 语文出版社, 1995.
[18] 朱钰麒, 熊文新. 基于小句复合体理论的汉语中介语语篇连贯性探讨[J]. 语言教学与研究, 2023(1): 1-11.
[19] Crews, F.B. (1974) The Random House Handbook. Random House.
[20] 连淑能. 英汉对比研究(增订本) [M]. 北京: 高等教育出版社, 2010.
[21] 王寅. 英汉语言宏观结构区别特征[J]. 外国语(上海外国语学院学报), 1990(6): 38-42, 26.
[22] 吴继峰. 英语母语者汉语书面语句法复杂性研究[J]. 语言教学与研究, 2016(4): 27-35.
[23] Hao, Y., Wang, X. and Yu, Q. (2021) Typological Characteristics of Interlanguage: Across Native Language Types and L2 Proficiency Levels. Lingua, 257, Article ID: 103085. [Google Scholar] [CrossRef