基于SMOTE增强与多模型择优的银行客户忠诚度预测研究
Research on the Prediction of Bank Customer Loyalty Based on SMOTE Enhancement and Multi-Model Optimization
DOI: 10.12677/csa.2025.1511306, PDF,    科研立项经费支持
作者: 刘政永, 孙 娜:河北金融学院河北省金融科技应用重点实验室,河北 保定
关键词: 银行客户忠诚度SMOTE增强逻辑回归随机森林AdaBoost支持向量机Bank Customer Loyalty SMOTE Enhancement Logistic Regression Random Forest AdaBoost Support Vector Machine
摘要: 本研究针对银行客户流失预测问题,通过系统性数据处理、可视化分析与特征工程,构建了多种机器学习模型(包括逻辑回归、随机森林、AdaBoost和支持向量机),并基于ROC曲线、F1分数等指标评估模型性能。核心发现表明,随机森林模型在应对数据不平衡和捕捉复杂特征关系方面表现最优(测试集F1分数达0.8546),显著优于其他模型;方法贡献在于提出了一套结合可视化探索与特征优化的建模框架,强调了数据质量与衍生特征对预测性能的关键作用;研究局限包括数据来源单一性及模型对特定业务场景的泛化能力有待进一步验证。本研究为银行客户忠诚度管理提供了数据驱动的决策支持。
Abstract: This study addresses the problem of bank customer churn prediction. By systematic data processing, visualization analysis, and feature engineering, a variety of machine learning (including logistic regression, random forest, AdaBoost, and support vector machine) are constructed and evaluated based on ROC curves, F1 scores, and other metrics. Core findings show that the random forest model performs the best in dealing with data imbalance and capturing complex feature relationships (achieving a F1 score of 0.8546 the test set), significantly outperforming the other models. The methodological contribution lies in proposing a modeling framework that combines visualization exploration and feature optimization, emphasizing the critical roles of quality and derived features in prediction performance. Research limitations include the singularity of data sources and the need for further validation of the model’s generalization capability in specific business scenarios. This study provides data-driven decision support for bank customer loyalty management.
文章引用:刘政永, 孙娜. 基于SMOTE增强与多模型择优的银行客户忠诚度预测研究[J]. 计算机科学与应用, 2025, 15(11): 305-319. https://doi.org/10.12677/csa.2025.1511306

参考文献

[1] 贾薇. 徽商银行S支行个人理财客户忠诚度影响因素研究[D]: [硕士学位论文]. 北京: 中国矿业大学, 2022.
[2] 余敏. Z银行高端客户忠诚度维护研究[D]: [硕士学位论文]. 昆明: 云南财经大学, 2022.
[3] 蒋月. 基于客户感知价值理论的建行H支行个人客户忠诚度提升策略研究[D]: [硕士学位论文]. 大连: 东北财经大学, 2022.
[4] 冉丽. G银行C分行信用卡客户忠诚度评价及提升策略研究[D]: [硕士学位论文]. 重庆: 重庆工商大学, 2022.
[5] Peng, K., Peng, Y. and Li, W. (2023) Research on Customer Churn Prediction and Model Interpretability Analysis. PLOS ONE, 18, e0289724. [Google Scholar] [CrossRef] [PubMed]
[6] Tam, L.T., Vi, L.G. and Tuan, N.M. (2025) Comparison of Methods for Handling Imbalanced Data in Customer Churn Prediction with Feature Selection Using SHAP and mRMR Frameworks. Cybernetics and Information Technologies, 25, 68-87. [Google Scholar] [CrossRef
[7] Maan, J. and Maan, H. (2023) Customer Churn Prediction Model Using Explainable Machine Learning. International Journal of Computer Science Trends and Technology, 11, 33-38.
[8] Li, Y. and Yan, K. (2025) Prediction of Bank Credit Customers Churn Based on Machine Learning and Interpretability Analysis. Data Science in Finance and Economics, 5, 19-34. [Google Scholar] [CrossRef