基于多智能体协同的动态教学优化系统

doi:10.12677/csa.2026.165175

期刊菜单

基于多智能体协同的动态教学优化系统
Dynamic Teaching Optimization System Based on Multi-Agent Collaboration

DOI: 10.12677/csa.2026.165175, PDF,
作者: 阮子豪, 冯婧琳, 李晶：北京工商大学计算机与人工智能学院，北京
关键词: 强化学习；多智能体系统；个性化教学；Q-Learning；智能教学系统；Reinforcement Learning； Multi-Agent System； Personalized Teaching； Q-Learning； Intelligent Tutoring System

摘要: 针对传统智能教学系统缺乏动态适应性、忽视学生心理状态以及师生交互不足等问题，文章提出了一种基于多智能体强化学习的个性化教学系统。该系统通过学生智能体和教师智能体的协同优化，实现教学策略的动态调整。学生智能体基于Q-Learning算法，综合考虑掌握度、动机、疲劳度等多维状态构建智能体状态与动作空间，自主选择最优学习动作；教师智能体融合启发式规则与强化学习，根据学生状态制定个性化的教学节奏、难度和策略；评估智能体负责对学习效果进行深度诊断与可解释反馈。该系统能有效提升学生的知识点掌握度，提高教师教学质量。同时，系统集成DeepSeek大语言模型，实现个性化学习报告的生成，为用户提供沉浸式的交互体验。研究为智能教学系统的设计提供了新的思路，具有良好的理论价值和应用前景。

Abstract: Aiming at the problems of traditional intelligent tutoring systems, such as a lack of dynamic adaptability, neglect of students’ psychological states, and insufficient teacher-student interaction, this paper proposes a personalized teaching system based on multi-agent reinforcement learning. The system realizes the dynamic adjustment of teaching strategies through the collaborative optimization of student agents and teacher agents. The student agent, based on the Q-Learning algorithm, constructs the agent state and action space by comprehensively considering multidimensional states including knowledge mastery, motivation, and fatigue level, and independently selects the optimal learning action; the teacher agent integrates heuristic rules and reinforcement learning to formulate personalized teaching rhythm, difficulty level, and strategies according to students’ states; the evaluation agent is responsible for in-depth diagnosis of learning effects and interpretable feedback. The system can effectively improve students’ knowledge mastery and enhance the quality of teachers’ teaching. Meanwhile, the system integrates the DeepSeek large language model to realize the generation of personalized learning reports and provide users with an immersive, interactive experience. The work of this paper provides a new idea for the design of intelligent tutoring systems, with good theoretical value and application prospects.

文章引用：阮子豪, 冯婧琳, 李晶. 基于多智能体协同的动态教学优化系统[J]. 计算机科学与应用, 2026, 16(5): 183-195. https://doi.org/10.12677/csa.2026.165175

参考文献

[1]	郑娅峰, 黄璟玥, 包昊罡. 教育智能体赋能科学教育: 概念特征、应用价值与实施策略[J]. 远程教育杂志, 2025, 43(3): 24-32.
[2]	国务院. 国务院关于印发新一代人工智能发展规划的通知[EB/OL]. 2017-07-20. https://www.gov.cn/zhengce/content/2017-07/20/content_5211996.htm, 2025-04-25.
[3]	教育部等六部门. 关于推进教育新型基础设施建设构建高质量教育支撑体系的指导意见[EB/OL]. 2021-07-01. https://www.gov.cn/gongbao/content/2021/content_5636150.htm, 2026-03-17.
[4]	邱燕楠, 李政涛. 挑战∙融合∙变革: “ChatGPT与未来教育”会议综述[J]. 现代远程教育研究, 2023, 35(3): 3-12+21.
[5]	于济凡, 李睿淼, 李曼丽, 等. 多智能体协同交互的高临场感在线学习环境构建[J]. 现代教育技术, 2024, 34(12): 17-26.
[6]	吴永和, 姜元昊, 陈圆圆, 等. 大语言模型支持的多智能体: 技术路径、教育应用与未来展望[J]. 开放教育研究, 2024, 30(5): 63-75.
[7]	郑娅峰, 赵亚宁, 黄璟玥, 等. 教育智能体: 研究现状和发展趋势[J]. 现代远程教育研究, 2025, 37(4): 3-13+59.
[8]	梁竹梅, 李鲍, 赵冬梅. 以AI智能体重构学习过程——教学智能体创建案例分析与思考[J]. 中国大学教学, 2025(9): 80-86.
[9]	赵德京, 马洪聪, 廖登宇, 等. 一种基于动作采样的Q学习算法[J]. 控制工程, 2024, 31(1): 70-79.
[10]	李明阳, 许可儿, 宋志强, 等. 多智能体强化学习算法研究综述[J]. 计算机科学与探索, 2024, 18(8): 1979-1997.
[11]	郭锐, 吴敏, 彭军, 等. 一种新的多智能体Q学习算法[J]. 自动化学报, 2007(4): 367-372.
[12]	林鸿生, 刘尚富, 赵磊. “AI大模型 + 教师”人机协同教学策略研究[J]. 中国教育技术装备, 2026(3): 16-18+23.
[13]	仵政源, 赵诗奎, 解瑞建, 等. 融合贪心策略的遗传-禁忌搜索算法求解分布式装配作业车间调度问题[J/OL]. 中国机械工程, 1-16. https://link.cnki.net/urlid/42.1294.TH.20260121.1041.006, 2026-03-22.
[14]	蒋忠元, 陶梅悦, 赵晓庆, 等. 基于启发式规则的流式在线日志解析方法[J]. 通信学报, 2024, 45(4): 95-113.
[15]	王旭, 朱其新, 朱永红, 等. 改进Q学习算法的移动机器人路径规划[J]. 计算机仿真, 2025, 42(4): 371-377.

为你推荐

友情链接