融合时间特征增强与定向GAN过采样的XGBoost信用卡欺诈检测研究
Study on XGBoost-Based Credit Card Fraud Detection Integrating Temporal Feature Enhancement and Targeted GAN Oversampling
DOI: 10.12677/csa.2025.1512321, PDF,    科研立项经费支持
作者: 宋蔓蔓, 郭禹辰:河北金融学院河北省科技金融重点实验室,河北 保定;刘 梦:河北金融学院金融博物馆,河北 保定;佟 磊:河北软件职业技术学院,河北 保定
关键词: 信用卡欺诈检测时间特征增强定向GAN过采样XGBoostCredit Card Fraud Detection Temporal Feature Enhancement Targeted GAN Oversampling XGBoost
摘要: 针对信用卡欺诈检测中样本极度不平衡导致模型偏向正常样本、传统特征缺乏时间关联性导致区分度不足、过采样生成样本无针对性导致误判率高等核心问题,本文提出一套融合时间特征增强与定向GAN过采样的XGBoost检测方案。首先,基于公开信用卡欺诈数据集构建“交易小时–临时用户标识–交易间隔–间隔统计”四层时间特征体系,补充欺诈行为的时序关联信息;其次,针对小额漏检与大额欺诈两类核心漏检模式,设计定向GAN过采样模型,通过特征约束生成目标样本,并以余弦相似度筛选高质量样本;最后,优化XGBoost关键参数构建高精度分类模型。实验结果表明,该方案在测试集上欺诈类精确率达98.81%、召回率达84.69%、F1值达91.21%;消融实验验证,时间特征增强可使F1值提升4.25个百分点,定向GAN过采样可提升0.99个百分点。
Abstract: Aiming at the core issues in credit card fraud detection, such as extreme class imbalance leading models to favor normal samples, insufficient discriminative power of traditional features due to the lack of temporal correlation, and high misjudgment rate caused by the lack of targeting in oversampled generated samples, this paper proposes a set of XGBoost detection schemes integrating temporal feature enhancement and targeted GAN oversampling. Firstly, based on the public credit card fraud dataset, a four-layer temporal feature system of “transaction hour—temporary user ID—transaction interval—interval statistics” is constructed to supplement the temporal correlation information of fraudulent behaviors. Secondly, targeting the two core missed detection patterns (small-amount missed detection and large-amount fraud), a targeted GAN oversampling model is designed. Target samples are generated through feature constraints, and high-quality samples are screened using cosine similarity. Finally, the key parameters of XGBoost are optimized to construct a high-precision classification model. Experimental results show that the scheme achieves a fraud class precision of 98.81%, recall of 84.69%, and F1-score of 91.21% on the test set. Ablation experiments verify that temporal feature enhancement can increase the F1-score by 4.25 percentage points, and targeted GAN oversampling can increase it by 0.99 percentage points.
文章引用:宋蔓蔓, 郭禹辰, 刘梦, 佟磊. 融合时间特征增强与定向GAN过采样的XGBoost信用卡欺诈检测研究[J]. 计算机科学与应用, 2025, 15(12): 58-65. https://doi.org/10.12677/csa.2025.1512321

参考文献

[1] CardInfoLink (2025) How Does Risk Control Reshape Security Boundaries in the Face of Payment Fraud?
https://www.cardinfolink.com/en-US/newsroom/article/20
[2] Milo, T., Novgorodov, S. and Tan, W. (2016) Rudolf: Interactive Rule Refinement System for Fraud Detection. Proceedings of the VLDB Endowment, 9, 1465-1468. [Google Scholar] [CrossRef
[3] Chandola, V., Banerjee, A. and Kumar, V. (2009) Anomaly Detection: A Survey. ACM Computing Surveys, 41, 1-58. [Google Scholar] [CrossRef
[4] Chawla, N.V., Bowyer, K.W., Hall, L.O. and Kegelmeyer, W.P. (2002) SMOTE: Synthetic Minority Over-Sampling Technique. Journal of Artificial Intelligence Research, 16, 321-357. [Google Scholar] [CrossRef
[5] He, H.B., Bai, Y., Garcia, E.A. and Li, S.T. (2008) ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, 1-8 June 2008, 1322-1328. [Google Scholar] [CrossRef
[6] 顾明, 李飞凤, 王晓勇, 等. 基于改进SMOTE算法和深度学习集成框架的信用卡欺诈检测[J]. 贵阳学院学报(自然科学版), 2024, 19(2): 99-104.
[7] Chen, T. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, 13-17 August 2016, 785-794. [Google Scholar] [CrossRef
[8] Lin, T., Goyal, P., Girshick, R., He, K. and Dollar, P. (2017) Focal Loss for Dense Object Detection. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2999-3007. [Google Scholar] [CrossRef
[9] Goodfellow, I., Pouget-Abadie, J., Mirza, M., et al. (2014) Generative Adversarial Nets. Advances in Neural Information Processing Systems 2014, Montreal, 8-13 December 2014, 2672-2680.
[10] 刘汝欣, 徐洪珍. 基于注意力机制优化的WGAN-BiLSTM信用卡欺诈检测方法[J]. 现代电子技术, 2024, 47(10): 73-78.
[11] 李占利, 唐成, 靳红梅. aLMGAN-信用卡欺诈检测方法[J]. 计算机工程与设计, 2024, 45(3): 830-836.
[12] 户媛姣, 牛梦园, 张留美. 融合蝙蝠优化与XGBoost的信用卡欺诈检测方法[J]. 西安邮电大学学报, 2025, 30(5): 51-60.
[13] 张海洋, 陈玉明, 曾念峰, 等. 基于XGBoost与LR融合模型的信用卡欺诈检测[J]. 重庆理工大学学报(自然科学), 2024, 38(3): 195-200.
[14] 丁卫星. 基于深度学习技术的信用卡交易欺诈侦测研究[D]: [硕士学位论文]. 上海: 上海交通大学, 2015.
[15] Kaggle: Credit Card Fraud Detection.
https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud