基于多源数据和模型融合的考研成功率预测研究

doi:10.12677/ae.2025.1591711

期刊菜单

基于多源数据和模型融合的考研成功率预测研究
Research on Predicting Postgraduate Entrance Examination Success Rate Based on Multi-Source Data and Model Fusion

DOI: 10.12677/ae.2025.1591711, PDF, 科研立项经费支持
作者: 刘彬, 黎俊伟, 邓欣, 丁晓宇, 胡珂, 王一雄, 王进：重庆邮电大学人工智能学院，重庆；岳曦：重庆邮电大学集成电路学院，重庆
关键词: 考研预测；多源数据整合；特征工程；模型融合；Postgraduate Entrance Examination Prediction； Multi-Source Data Integration； Feature Engineering； Model Fusion

摘要: 面对考研报考规模激增而录取率持续走低的现实困境，以及考生普遍存在的自我评估与志愿决策难题，本研究致力于构建精准的考研成功率预测模型。研究整合了涵盖学业、行为、心理及主观认知等多源数据，通过特征工程方法提取关键信息，并构建了基于LightGBM与CatBoost的融合预测模型。模型在真实考研学生数据上展现出优异的预测性能，F1-score和AUC指标分别达到0.78和0.88。该模型为考生提供了客观、个性化的能力评估与志愿决策支持，有助于缓解因信息不对称和集群效应导致的资源错配问题，提升研究生报考的科学性与成功率。

Abstract: Facing the dual challenges of a booming postgraduate entrance examination applicant pool alongside persistently declining admission rates, as well as the widespread difficulties candidates encounter in self-assessment and decision-making regarding institution/program choice, this paper aims to construct an accurate predictive model for postgraduate examination success probability. Our research integrates multi-source data encompassing academic performance, behavioral patterns, psychological traits, and subjective cognitive factors. Key predictive information was extracted through feature engineering methodologies, and a fusion prediction model based on LightGBM and CatBoost was developed. Evaluated on real-world data from examination candidates, the model demonstrates superior predictive performance, achieving an F1-score of 0.78 and an AUC of 0.88. This model provides candidates with objective and personalized ability assessment and decision support for institution/program selection. It contributes to mitigating resource misallocation stemming from information asymmetry and herding effects, thereby enhancing the scientific rigor and success rate of postgraduate application strategies.

文章引用：刘彬, 黎俊伟, 邓欣, 岳曦, 丁晓宇, 胡珂, 王一雄, 王进. 基于多源数据和模型融合的考研成功率预测研究[J]. 教育进展, 2025, 15(9): 584-591. https://doi.org/10.12677/ae.2025.1591711

参考文献

[1]	中国教育在线. 2024全国研究生招生调查报告[R]. 北京: 中国教育在线, 2024.
[2]	李东洁, 袁善良, 黄林萱, 等. 本科成绩对考研成绩的影响及成功考研的共性调研与分析——以哈尔滨理工大学自动化学院为例[J]. 高教学刊, 2022, 8(19): 53-55, 59.
[3]	李梦莹, 王晓东, 阮书岚, 等. 基于双路注意力机制的学生成绩预测模型[J]. 计算机研究与发展, 2020, 57(8): 12-34.
[4]	郑宝乐, 李思成, 唐庭龙. 基于机器学习的考研结果预测算法设计[J]. 信息通信, 2020(10): 17-46.
[5]	张懿, 胡春美. 基于线性回归的考研成绩分析与预测[J]. 中国宽带, 2020(5): 21-25.
[6]	Aljasmi, S., Nassif, A.B., Shahin, I., et al. (2020) Graduate Admission Prediction Using Machine Learning. International Journal of Computers and Communications, 14, 79-83. [Google Scholar] [CrossRef]
[7]	Fedynich, L.V. (2017) The Grand Question: Do Entrance Examinations Determine Graduate Student Academic Success? Journal of Academic and Business Ethics, 11, 12-29.
[8]	周建华, 单正义, 覃红霞. 新高考是否促进了学生自主选科?——基于CatBoost回归树模型的实证分析[J]. 华东师范大学学报(教育科学版), 2024, 42(3): 12-25.

为你推荐

友情链接