数智驱动背景下的语音识别技术在英语口语教学中的应用研究
A Study on the Application of Speech Recognition Technology in English Speaking Instruction under the Data-Intelligence-Driven Context
摘要: 全球化与智慧教育背景下,大学英语口语教学受大班制反馈不足、学生自主练习低效等问题制约。本文定位为基于系统性文献回顾的教学模式构建与应用提案,基于纠正性反馈理论,系统梳理国内外相关研究与典型案例,分析数智驱动下语音识别技术在英语口语“识别–教学–反馈”环节的应用。研究表明,语音识别技术可实现段音规模化检测、客观评估与即时反馈,缓解教师反馈压力,支撑“课前–课中–课后”学习;但在超段音检测、错误原因解释及情感支持上存在局限,需与教师协同形成混合反馈模式。本文通过综述明确语音识别技术的应用价值与局限,为大学英语口语教学智能化转型提供参考,也为纠正性反馈理论在新技术场景的应用拓展思路。
Abstract: In the context of globalization and smart education, college English speaking instruction faces challenges such as limited feedback in large-class settings and low efficiency in students’ autonomous practice. This study positions itself as a teaching model construction and application proposal based on a systematic literature review. Based on corrective feedback theory, this paper systematically reviews relevant domestic and international research and representative cases, analyzing the application of Automatic Speech Recognition (ASR) technology in the “recognition-instruction-feedback” stages of English speaking under intelligent data-driven conditions. The study finds that ASR can achieve large-scale phoneme-level detection, objective evaluation, and instant feedback, thereby alleviating teachers’ feedback burden and supporting learning before, during, and after class. However, limitations remain in suprasegmental feature detection, error cause interpretation, and emotional support. Thus, a hybrid feedback model integrating teacher participation is necessary. Through this review, the paper clarifies the value and limitations of ASR in college English speaking instruction, providing references for the intelligent transformation of English teaching and expanding the application scope of corrective feedback theory in emerging technological contexts.
文章引用:宁晓婷. 数智驱动背景下的语音识别技术在英语口语教学中的应用研究[J]. 教育进展, 2025, 15(11): 1544-1553. https://doi.org/10.12677/ae.2025.15112199

参考文献

[1] Dai, Y. and Wu, Z. (2021) Mobile-Assisted Pronunciation Learning with Feedback from Peers and/or Automatic Speech Recognition: A Mixed-Methods Study. Computer Assisted Language Learning, 36, 861-884. [Google Scholar] [CrossRef
[2] Eskenazi, M. (2009) An Overview of Spoken Language Technology for Education. Speech Communication, 51, 832-844. [Google Scholar] [CrossRef
[3] Inceoglu, S., Lim, H. and Chen, W. (2020) ASR for EFL Pronunciation Practice: Segmental Development and Learners’ Beliefs. The Journal of AsiaTEFL, 17, 824-840. [Google Scholar] [CrossRef
[4] Wang, Y. and Young, S.S. (2014) Effectiveness of Feedback for Enhancing English Pronunciation in an ASR‐Based CALL System. Journal of Computer Assisted Learning, 31, 493-504. [Google Scholar] [CrossRef
[5] Cucchiarini, C. and Strik, H. (2017) Automatic Speech Recognition for Second Language Pronunciation Training. In: Kang, O. and Thomson, R.I., Eds., The Routledge Handbook of Contemporary English Pronunciation, Routledge, 556-569. [Google Scholar] [CrossRef
[6] Swain, M. (1985) Communicative Competence: Some Roles of Comprehensible Input and Comprehensible Output in Its Development. In: Gass, S. and Madden, C., Eds., Input in Second Language Acquisition, Newbury House, 165-179.
[7] Schmidt, R.W. (1990) The Role of Consciousness in Second Language Learning. Applied Linguistics, 11, 129-158. [Google Scholar] [CrossRef
[8] Long, M.H. (1996) The Role of the Linguistic Environment in Second Language Acquisition. In: Doughty, C.J. and Long, M.H., Eds., Handbook of Second Language Acquisition, Elsevier, 413-468. [Google Scholar] [CrossRef
[9] Lyster, R. and Ranta, L. (1997) Corrective Feedback and Learner Uptake: Negotiation of Form in Communicative Classrooms. Studies in Second Language Acquisition, 19, 37-66. [Google Scholar] [CrossRef
[10] Thi-Nhu Ngo, T., Hao-Jan Chen, H. and Kuo-Wei Lai, K. (2023) The Effectiveness of Automatic Speech Recognition in ESL/EFL Pronunciation: A Meta-Analysis. ReCALL, 36, 4-21. [Google Scholar] [CrossRef
[11] Liakin, D., Cardoso, W. and Liakina, N. (2015) Learning L2 Pronunciation with a Mobile Speech Recognizer: French /y/. CALICO Journal, 32, 1-25. [Google Scholar] [CrossRef
[12] Derwing, T.M. and Munro, M.J. (2005) Second Language Accent and Pronunciation Teaching: A Research-Based Approach. TESOL Quarterly, 39, 379-397. [Google Scholar] [CrossRef
[13] McCrocklin, S. (2019) Learners’ Feedback Regarding ASR-Based Dictation Practice for Pronunciation Learning. CALICO Journal, 36, 119-137. [Google Scholar] [CrossRef
[14] Zhong, H., Xie, Y. and Yao, Z. (2024) Leveraging Large Language Models to Refine Automatic Feedback Generation at Articulatory Level in Computer Aided Pronunciation Training. Interspeech 2024, Kos Island, 1-5 September 2024, 2600-2604. [Google Scholar] [CrossRef
[15] 陈桦, 吴奎, 李景娜. 英语口语自动评测新方法——中国学生英语朗读自动评测系统[J]. 外语电化教学, 2019(1): 72-77
[16] 涂惠燕, 陈一宁. 基于语音识别和手机平台的英语口语发音学习系统[J]. 计算机应用与软件, 2011, 28(9): 64-66, 147.
[17] 于娇娜.人工智能在英语口语教学中的应用[C]//人民教育出版社人教数字教育研究院. 数字教材∙数字化教学——第四届中小学数字化教学研讨会论文案例集. 北京: 北京市通州区永乐店中学, 2019: 414-416.
[18] 霍雨佳. 智能化工具在英语教学中的辅助作用[J]. 基础教育课程, 2019(14): 75-77.
[19] 苏雯超, 招晓杏, 李德凤. 计算机辅助口译技术在翻译实践中的可用性研究[J]. 外语教学, 2022, 43(4): 69-75.