中国农村留守儿童问题行为的因子预测:基于机器学习的方法研究
Factor Prediction of Problem Behaviors in Rural Left-Behind Children in China: A Machine Learning-Based Study
摘要: 准确预测农村留守儿童的问题行为,对于早期识别高风险个体并实施针对性干预,以改善其心理健康和社会适应具有重要意义。本研究旨在探索机器学习方法在留守儿童问题行为预测中的应用,并为相关干预措施的制定提供科学依据。本研究选取随机森林、决策树、线性回归和套索回归四种机器学习算法,对中国农村留守儿童的问题行为进行因子预测。研究首先对农村留守儿童问题行为的国内外研究现状进行综述,并深入分析了不同机器学习方法在预测问题行为方面的优势与局限性。结果表明,随机森林和决策树模型表现出较好的预测性能,经交叉验证后的平均均方误差(MSE)分别为1.056和0.012,决定系数(R2)分别为0.871和0.998。相比之下,线性回归和套索回归模型的拟合效果较差,MSE和R2值均表现不佳。综合随机森林和决策树模型的结果,识别出对问题行为预测具有重要影响的前五个因子,分别为:抑郁情绪、焦虑情绪、同伴关系、父亲教养投入以及留守儿童对未来职业的期望(期望从事工人职业)。本研究结果强调了心理健康、社会关系和家庭教养在留守儿童问题行为中的重要作用,并为利用机器学习技术进行早期风险识别和精准干预提供了新的思路和方法。
Abstract: Accurately predicting problem behaviors in rural left-behind children is of great significance for early identification of high-risk individuals and the implementation of targeted interventions to improve their mental health and social adaptation. This study aims to explore the application of machine learning methods in predicting problem behaviors among left-behind children and to provide a scientific basis for the formulation of relevant intervention measures. Four machine learning algorithms—Random Forest, Decision Tree, Linear Regression, and Lasso Regression—were selected to conduct factor prediction of problem behaviors in rural left-behind children in China. Initially, the study reviewed the current research status on problem behaviors in rural left-behind children both domestically and internationally, and conducted an in-depth analysis of the advantages and limitations of different machine learning methods in predicting problem behaviors. The results indicated that the Random Forest and Decision Tree models demonstrated superior predictive performance, with average Mean Squared Errors (MSE) after cross-validation of 1.056 and 0.012, respectively, and Coefficients of Determination (R2) of 0.871 and 0.998, respectively. In contrast, the Linear Regression and Lasso Regression models exhibited poorer fitting effects, with unsatisfactory MSE and R2 values. By integrating the results of the Random Forest and Decision Tree models, the top five factors identified as having a significant impact on predicting problem behaviors were: depressive mood, anxiety mood, peer relationships, paternal parenting involvement, and left-behind children’s expectations for future occupations (expecting to work as industrial workers). The findings of this study underscore the crucial roles of mental health, social relationships, and family parenting in the problem behaviors of left-behind children and offer new insights and methods for utilizing machine learning technology for early risk identification and precise interventions.
文章引用:吴丽娟 (2025). 中国农村留守儿童问题行为的因子预测:基于机器学习的方法研究. 心理学进展, 15(10), 156-169. https://doi.org/10.12677/ap.2025.1510556

参考文献

[1] 陈国平, 王志强, 孙莹, 等(2024). 安徽省中学生校园欺凌与焦虑抑郁共患的关联性. 中国学校卫生, 45(8), 1115-1119.
[2] 范兴华, 方晓义, 黄月胜, 陈锋菊, 余思(2018). 父母关爱对农村留守儿童抑郁的影响机制: 追踪研究. 心理学报, 50(9), 1029-1040.
[3] 方圆, 王雅芯, 李翀, 等(2022). 乡村儿童心理健康及其影响因素数据集. 中国科学数据(中英文网络版), 8(3), 375-383.
[4] 高敏(2015). 高中生个人规划与领悟社会支持、问题行为的关系. 硕士学位论文, 南京: 南京师范大学.
[5] 高雯, 陈会昌(2021). 父母分歧对七岁儿童问题行为的影响. 中国心理卫生杂志, 22(11), 791-795.
[6] 金桃(2021). 基于机器学习的流动儿童问题行为的预测研究. 硕士学位论文, 天津: 天津师范大学.
[7] 李志红(2019). 父母养育心理灵活性对父母焦虑抑郁和儿童问题行为的影响. 中国健康教育, 35(6), 511-516.
[8] 刘莉, 吴倩, 郑研, 谭晓庆(2020). 父母心理攻击与儿童问题行为: 另一方父母支持的调节作用. 中国临床心理学杂志, 28(6), 1235-1239, 1245.
[9] 刘瑞杰, 彭玮婧, 郑美欣, 等(2023). 湖南省中学生抑郁、焦虑现状及流行病学特征. 中国临床心理学杂志, 31(4), 816-820.
[10] 吕催芳, 周永红(2019). 感恩对流浪儿童问题行为的影响: 核心自我评价和社会支持的中介效应. 中国临床心理学杂志, 27(2), 406-413.
[11] 吕勤, 陈会昌, 王莉(2003). 儿童问题行为及其相关父母教养因素研究综述. 心理科学, 26(1), 130-132.
[12] 覃军(2023). 基于文献计量学和社会网络分析的留守儿童问题行为研究现状. 职业与健康, 39(23), 3299-3303, 3307.
[13] 王学思, 李静雅, 王美芳(2021). 父母婚姻冲突对儿童发展的影响及其机制. 心理科学进展, 29(5), 875-884.
[14] 王玉洁, 窦凯(2019). 同伴依恋与青少年抑郁的关系: 情绪调节效能感的中介作用. 中国健康心理学杂志, 27(7), 1092-1095
[15] 伍新春, 刘畅, 胡艳蕊, 等(2015). 父亲教养投入问卷的编制及其信效度检验. 中国临床心理学杂志, 23(4), 576-579.
[16] 张文新(1999). 儿童社会性发展. 北京师范大学出版社.
[17] Achenbach, T. M., Dumenci, L., & Rescorla, L. A. (2001). Ratings of Relations between DSM-IV Diagnostic Categories and Items of the CBCL/6-18, TRF, and YSR. University of Vermont, Research Center for Children, Youth, & Families.
[18] Agerup, T., Lydersen, S., Wallander, J., & Sund, A. M. (2015). Associations between Parental Attachment and Course of Depression between Adolescence and Young Adulthood. Child Psychiatry & Human Development, 46, 632-642.[CrossRef] [PubMed]
[19] Alateyat, H., Cruz, S., Cernadas, E., Tubío-Fungueiriño, M., Sampaio, A., González-Villar, A. et al. (2022). A Machine Learning Approach in Autism Spectrum Disorders: From Sensory Processing to Behavior Problems. Frontiers in Molecular Neuro-science, 15, Article 889641.[CrossRef] [PubMed]
[20] Blain-Arcaro, C., & Vaillancourt, T. (2017). Longitudinal Associations between Depression and Aggression in Children and Adolescents. Journal of Abnormal Child Psychology, 45, 959-970.[CrossRef] [PubMed]
[21] Cabitza, F., & Banfi, G. (2018). Machine Learning in Laboratory Medicine: Waiting for the Flood? Clinical Chemistry and Laboratory Medicine (CCLM), 56, 516-524.[CrossRef] [PubMed]
[22] DeWalt, D. A., Thissen, D., Stucky, B. D., Langer, M. M., Morgan DeWitt, E., Irwin, D. E. et al. (2013). PROMIS Pediatric Peer Relationships Scale: Development of a Peer Relationships Item Bank as Part of Social Health Measurement. Health Psychology, 32, 1093-1103.[CrossRef] [PubMed]
[23] Fosco, G. M., Stormshak, E. A., Dishion, T. J., & Winter, C. E. (2012). Family Relationships and Parental Monitoring during Middle School as Predictors of Early Adolescent Problem Behavior. Journal of Clinical Child & Adolescent Psychology, 41, 202-213.[CrossRef] [PubMed]
[24] Friedman, J., Hastie, T., & Tibshirani, R. (2001). The Elements of Statistical Learning. Springer.
[25] Geisser, S. (1975). The Predictive Sample Reuse Method with Applications. Journal of the American Statistical Association, 70, 320-328.[CrossRef
[26] Henri, T., & John, T. (1970). Social Identity Theory. Cambridge University Press.
[27] Hunt, T. K. A., Slack, K. S., & Berger, L. M. (2018). Adverse Childhood Experiences and Behavioral Problems in Middle Childhood. Child Abuse & Neglect, 67, 391-402.[CrossRef] [PubMed]
[28] Knudsen, A. K., Ystrom, E., Skogen, J. C., & Torgersen, L. (2015). Maternal Heavy Alcohol Use and Toddler Behavior Problems: A Fixed Effects Regression Analysis. European Child & Adolescent Psychiatry, 24, 1269-1277.[CrossRef] [PubMed]
[29] Kovacs, M. (1992). The Childrens Depression Inventory. Multi-Health Systems.
[30] Ouyang, Y., Xu, X., & Ouyang, Z. (2023). Confidence in the Future and Adolescent Problem Behavior. Sustainability, 15, Article 5191.[CrossRef
[31] Sándor, M. C., & Bakó, B. (2022). Unmasking Risky Habits: Identifying and Predicting Problem Gamblers through Machine Learning Techniques. Journal of Gambling Studies, 40, 1367-1377.[CrossRef] [PubMed]
[32] Senders, J. T., Staples, P. C., Karhade, A. V., Zaki, M. M., Gormley, W. B., Broekman, M. L. D. et al. (2018). Machine Learning and Neurosurgical Outcome Prediction: A Systematic Review. World Neurosurgery, 109, 476-486.e1.[CrossRef] [PubMed]
[33] Spitzer, R. L., Kroenke, K., Williams, J. B. W., & Löwe, B. (2006). A Brief Measure for Assessing Generalized Anxiety Disorder. Archives of Internal Medicine, 166, 1092-1097.[CrossRef] [PubMed]
[34] Zhou, Y., Zhao, C., He, F., Tian, X., Zheng, Y., & Sun, J. (2021). Emotional and Behavioral Problems of Left-Behind Children in Impoverished Rural China: A Comparative Cross-Sectional Study of First Grade Junior High School. International Journal of Mental Health Promotion, 23, 1-13.[CrossRef
[35] Zhu, C., Yin, X., Li, Z., & Zhou, L. (2023). Psychological Capital Differs among Rural Left-Behind Children and Is Associated with Emotional and Behavioral Problems. Frontiers in Psychology, 12, Article 565385.[CrossRef] [PubMed]