基于多源数据和模型融合的考研成功率预测研究
Research on Predicting Postgraduate Entrance Examination Success Rate Based on Multi-Source Data and Model Fusion
DOI: 10.12677/ae.2025.1591711, PDF,    科研立项经费支持
作者: 刘 彬, 黎俊伟, 邓 欣, 丁晓宇, 胡 珂, 王一雄, 王 进:重庆邮电大学人工智能学院,重庆;岳 曦:重庆邮电大学集成电路学院,重庆
关键词: 考研预测多源数据整合特征工程模型融合Postgraduate Entrance Examination Prediction Multi-Source Data Integration Feature Engineering Model Fusion
摘要: 面对考研报考规模激增而录取率持续走低的现实困境,以及考生普遍存在的自我评估与志愿决策难题,本研究致力于构建精准的考研成功率预测模型。研究整合了涵盖学业、行为、心理及主观认知等多源数据,通过特征工程方法提取关键信息,并构建了基于LightGBM与CatBoost的融合预测模型。模型在真实考研学生数据上展现出优异的预测性能,F1-score和AUC指标分别达到0.78和0.88。该模型为考生提供了客观、个性化的能力评估与志愿决策支持,有助于缓解因信息不对称和集群效应导致的资源错配问题,提升研究生报考的科学性与成功率。
Abstract: Facing the dual challenges of a booming postgraduate entrance examination applicant pool alongside persistently declining admission rates, as well as the widespread difficulties candidates encounter in self-assessment and decision-making regarding institution/program choice, this paper aims to construct an accurate predictive model for postgraduate examination success probability. Our research integrates multi-source data encompassing academic performance, behavioral patterns, psychological traits, and subjective cognitive factors. Key predictive information was extracted through feature engineering methodologies, and a fusion prediction model based on LightGBM and CatBoost was developed. Evaluated on real-world data from examination candidates, the model demonstrates superior predictive performance, achieving an F1-score of 0.78 and an AUC of 0.88. This model provides candidates with objective and personalized ability assessment and decision support for institution/program selection. It contributes to mitigating resource misallocation stemming from information asymmetry and herding effects, thereby enhancing the scientific rigor and success rate of postgraduate application strategies.
文章引用:刘彬, 黎俊伟, 邓欣, 岳曦, 丁晓宇, 胡珂, 王一雄, 王进. 基于多源数据和模型融合的考研成功率预测研究[J]. 教育进展, 2025, 15(9): 584-591. https://doi.org/10.12677/ae.2025.1591711

参考文献

[1] 中国教育在线. 2024全国研究生招生调查报告[R]. 北京: 中国教育在线, 2024.
[2] 李东洁, 袁善良, 黄林萱, 等. 本科成绩对考研成绩的影响及成功考研的共性调研与分析——以哈尔滨理工大学自动化学院为例[J]. 高教学刊, 2022, 8(19): 53-55, 59.
[3] 李梦莹, 王晓东, 阮书岚, 等. 基于双路注意力机制的学生成绩预测模型[J]. 计算机研究与发展, 2020, 57(8): 12-34.
[4] 郑宝乐, 李思成, 唐庭龙. 基于机器学习的考研结果预测算法设计[J]. 信息通信, 2020(10): 17-46.
[5] 张懿, 胡春美. 基于线性回归的考研成绩分析与预测[J]. 中国宽带, 2020(5): 21-25.
[6] Aljasmi, S., Nassif, A.B., Shahin, I., et al. (2020) Graduate Admission Prediction Using Machine Learning. International Journal of Computers and Communications, 14, 79-83. [Google Scholar] [CrossRef
[7] Fedynich, L.V. (2017) The Grand Question: Do Entrance Examinations Determine Graduate Student Academic Success? Journal of Academic and Business Ethics, 11, 12-29.
[8] 周建华, 单正义, 覃红霞. 新高考是否促进了学生自主选科?——基于CatBoost回归树模型的实证分析[J]. 华东师范大学学报(教育科学版), 2024, 42(3): 12-25.