英语写作自动评分系统研究概述
Review of Research on English Writing Automated Scoring System
摘要: 由于大规模考试数量的增多和考试人数的增加,传统的人工阅卷不再能够满足跨地域、跨学科的大规模考试阅卷,各地人工阅卷工作不仅给评阅教师增加了极大的负担,同时也大大降低了评阅工作的效率。随着自然语言处理技术的不断成熟,自动评分系统有望成为解决这一难题的希望。作为现代教育技术的自动评分系统在云端技术的支持下,以语料库为基础,对学生的答卷进行评分和反馈,这不仅符合了“重视现代信息技术应用,丰富英语课程学习资源”的政策要求,同时也顺应了大数据时代和人工智能兴起的时代背景,对教育教学工作者、教师、学生都产生了不可低估的作用。本文将从自动评分系统发展概述、自动评分系统信效度研究、自动评分系统应用三个方面展开论述。
Abstract: Due to the increasing number of large-scale exams and the growing number of test takers, traditional manual grading is no longer able to meet the demands of large-scale exams across regions and disciplines. Manual grading not only imposes a significant burden on grading teachers but also greatly reduces the efficiency of grading work. With the continuous maturation of natural language processing technology, automated scoring systems are expected to provide hope in addressing this challenge. Supported by cloud technology, automated scoring systems in modern educational technology, based on corpora, score and provide feedback on students’ answer sheets. This not only aligns with the policy requirements of “emphasizing the application of modern information technology and enriching English course learning resources” but also meets the era background of big data and the rise of artificial intelligence, which plays an undeniable role for educational and teaching staff, teachers, and students. This research discusses three aspects of automated scoring systems: an overview of their development, research on their reliability and validity, and their specific applications.
文章引用:汪鑫济. 英语写作自动评分系统研究概述[J]. 现代语言学, 2026, 14(2): 13-20. https://doi.org/10.12677/ml.2026.142108

参考文献

[1] 王跃武. 大学英语四、六级考试作文网上阅卷实验研究[J]. 外语界, 2004(5): 74-79.
[2] 曾用强. 过程化的写作评估模式[J]. 福建外语, 2002(3): 26-31.
[3] 梁茂成. 学习者英语书面语料自动词性赋码的信度研究[J]. 外语教学与研究, 2006(4): 279-286+320.
[4] Burstein , J. and Chodorow, M. (n.d.) Automated Essay Scoring for Nonnative English Speakers.
https://www.ets.org/Media/Research/pdf/erater_acl99rev.pdf
[5] Lonsdale, D. and Strong-Krause, D. (2003) Automated Rating of ESL Essays.
https://aclanthology.org/W03-0209/
[6] Elliot, S. and Mikulas, C. (2004) The Impact of My Access! Use on Student Writing Performance: A Technology Overview and Four Studies. The Annual Meeting of American Educational Research Association, San Diego, April 2004.
[7] 梁茂成, 文秋芳. 国外作文自动评分系统评述及启示[J]. 外语电化教学, 2007(5): 18-24.
[8] 李艳, 葛诗利. 大学英语作文自动评分中分级词表的效度研究[J]. 外语与外语教学, 2008(10): 48-52.
[9] 黄红兵. 在线大学英语写作形成性评价模型构建研究[J]. 现代教育技术, 2015, 25(1): 79-86.
[10] 李艳玲, 田夏春. iWrite 2.0在线英语作文评分信度研究[J]. 现代教育技术, 2018, 28(2): 75-80.
[11] 张国强, 何芳. 英语作文自动评分系统的信度和效度研究——基于不同类型写作任务文本量化特征分析[J]. 外语测试与教学, 2022(1): 44-56.
[12] Shermis, M., Burstein, J. and Bliss, L. (2004) The Impact of Automated Essay Scoring on High Stakes Writing Assessments. The Annual Meeting of the National Council on Measurement in Education, San Diego, April 2004.
[13] Warschauer, M. and Ware, P. (2006) Automated Writing Evaluation: Defining the Classroom Research Agenda. Language Teaching Research, 10, 157-180. [Google Scholar] [CrossRef
[14] Rich, C., Harrington, H., Kim, J. and West, B. (2008) Automated Essay Scoring in State Formative and Summative Assessment. The Annual Meeting of American Educational Research Association, New York, March 2008.
[15] Jiang, Y. (2015) An Automated Essay‐Evaluation Corpus of English as a Foreign Language Writing. British Journal of Educational Technology, 46, 1109-1117. [Google Scholar] [CrossRef
[16] 韩宁. 几个英语作文自动评分系统的原理与评述[J]. 中国考试(研究版), 2009(3): 38-44.
[17] Page, E. (2003) Project Essay Grade: PEG. In: Shermis, M. and Burstein, J., Eds., Automated Essay Scoring: A Cross-disciplinary Perspective, Lawrence Erlbaum, 43-54.
[18] 陈潇潇, 葛诗利. 自动作文评分研究综述[J]. 解放军外国语学院学报, 2008(5): 78-83.
[19] 唐锦兰, 吴一安. 在线英语写作自动评价系统应用研究述评[J]. 外语教学与研究, 2011, 43(2): 273-282+321.
[20] 张双祥. 大学英语写作教学中在线写作自动评价系统应用研究[J]. 当代教育理论与实践, 2014, 6(11): 100-102.
[21] 冯庆华, 张开翼. 人工智能辅助外语教学与研究的能力探析——以ChatGPT-4o和文心大模型4.0为例[J]. 外语电化教学, 2024(3): 3-12+109.
[22] 陈茉, 吕明臣. ChatGPT环境下的大学英语写作教学[J]. 当代外语研究, 2024(1): 161-168.
[23] 毛延生, 王一航, 邢艳茹. ChatGPT辅助高中英语写作反馈的实证研究[J]. 教育测量与评价, 2024(1): 3-13.
[24] 桂诗春. 语言测试: 新技术与新理论[J]. 外语教学与研究, 1989(3): 2-10+80.