电商平台个性化推荐中相似度优化与可学习模型融合研究
Research on the Similarity Optimization and Learnable Model Integration for Personalized Recommendation in E-Commerce Platforms
DOI: 10.12677/ecl.2026.154464, PDF,    国家自然科学基金支持
作者: 郭 强, 代梦飞:上海理工大学管理学院,上海
关键词: 电子商务推荐系统手工特征XGBoost相似度度量E-Commerce Recommendation Systems Handcrafted Features XGBoost Similarity Measures
摘要: 随着电子商务的发展,个性化推荐系统在提升用户满意度、点击率和转化率方面起着重要作用。然而,传统的协同过滤方法往往面临数据稀疏性和用户评分偏差等问题,限制了其预测精度和排序效果。本研究旨在通过将相似度度量与残差学习框架相结合,优化电子商务场景下的个性化推荐性能。本文基于MovieLens 100k数据集,探索了在不同相似度方法下结合手工特征与可学习模型的推荐方法。首先提取了用户活跃度(Activity)、评分序列相关性(DFA)和声誉(Reputation)三个手工特征,并在Cosine、Jaccard、Pearson和Spearman四种相似度基础上,使用了XGBoost等六种可学习模型对预测残差进行建模与校正。实验结果显示:在不同的相似度方法下,XGBoost模型性能最优,其中Jaccard相似度下的XGBoost在RMSE、MAE指标上表现最佳,并且单个手工特征使用的效果有限,而特征组合能够显著提升推荐性能。此外,将Jaccard相似度结合XGBoost可学习模型的推荐模型与常用推荐算法进行对比,发现其在RMSE和MAE上显著优于传统方法,且NDCG@10指标最高,验证了手工特征与可学习模型结合在电商推荐中的有效性。本文研究结果为电商平台提供了优化个性化推荐策略的可行方案。
Abstract: With the rapid development of e-commerce, personalized recommendation systems play a critical role in enhancing user satisfaction, click-through rates, and conversion rates. However, traditional collaborative filtering methods often suffer from data sparsity and user rating bias, which limit prediction accuracy and ranking performance. To address these challenges, this study proposes a hybrid recommendation framework that integrates similarity measures with a residual learning mechanism to optimize personalized recommendation performance in e-commerce scenarios. Based on the MovieLens 100K dataset, we investigate recommendation approaches that combine handcrafted user features with learnable models under different similarity metrics. Specifically, three handcrafted features—user activity, rating sequence correlation measured by detrended fluctuation analysis (DFA), and reputation—are extracted. On the basis of four similarity measures (Cosine, Jaccard, Pearson, and Spearman), six learnable models, including XGBoost, are employed to model and correct the prediction residuals. Experimental results demonstrate that XGBoost consistently achieves superior performance across different similarity measures. In particular, the combination of Jaccard similarity and XGBoost yields the lowest RMSE and MAE values. Moreover, using a single handcrafted feature yields limited improvement, whereas the combination of all features significantly enhances recommendation performance. Furthermore, compared with commonly used recommendation algorithms, the proposed hybrid model significantly outperforms traditional methods in terms of RMSE and MAE, while achieving the highest NDCG@10. These findings validate the effectiveness of integrating handcrafted features with learnable models for e-commerce recommendation and provide empirical support for optimizing personalized recommendation strategies.
文章引用:郭强, 代梦飞. 电商平台个性化推荐中相似度优化与可学习模型融合研究[J]. 电子商务评论, 2026, 15(4): 859-869. https://doi.org/10.12677/ecl.2026.154464

参考文献

[1] 刘超慧, 李玲玲. 基于机器学习的个性化推荐算法及应用[M]. 北京: 人民邮电出版社, 2024: 161.
[2] 雷清锋. 基于用户行为序列的电子商务混合推荐模型研究[D]: [硕士学位论文]. 武汉: 武汉轻工大学, 2024.
[3] 庄维嘉. 基于LightGBM-LR融合模型的混合推荐算法研究及应用[D]: [硕士学位论文]. 上海: 上海第二工业大学, 2023.
[4] 叶昊, 袁凯骏, 沈伟, 等. 基于改进XGBoost算法的电商平台用户复购行为预测研究[J]. 通信与信息技术, 2025(2): 11-16.
[5] 景秀丽, 史明曦. 基于XGBoost算法的电商用户重复购买行为预测[J]. 辽宁大学学报(自然科学版), 2023, 50(2): 134-145.
[6] 吴鑫, 李君, 茅智慧, 等. 基于GA-XGBoost的电商用户购买行为预测[J]. 浙江万里学院学报, 2022, 35(4): 86-92.
[7] Peng, N.J., et al. (2023) Design and Implementation of an Intelligent Recommendation System for Product Information on an e-Commerce Platform Based on Machine Learning. International Conference on Internet of Things and Machine Learning (IoTML 2023), Singapore, 15-17 September 2023, 129371K.
[8] Li, Z.J. (2023) Research on Cross-Border E-Commerce Recommendation System Based on Collaborative Filtering. Proceedings SPIE 12625, International Conference on Mathematics, Modeling, and Computer Science (MMCS2022), Wuhan, 1 June 2023, 1262510. [Google Scholar] [CrossRef
[9] Park, S.H. and Kim, K. (2023) Collaborative Filtering Recommendation System Based on Improved Jaccard Similarity. Journal of Ambient Intelligence and Humanized Computing, 14, 11319-11336. [Google Scholar] [CrossRef
[10] Kermany, N.R., et al. (2023) Incorporating User Rating Credibility in Recommender Systems. Future Generation Computer Systems, 147, 30-43.
[11] 杜宇键. 基于CT-LightGBM模型的音乐推荐算法研究及榜单预测[D]: [硕士学位论文]. 上海: 上海师范大学, 2024.
[12] 王芳, 龚丽兰. 基于协同过滤技术的电商用户重复购买行为预测模型构建[J]. 佳木斯大学学报(自然科学版), 2024, 42(8): 173-176+145.
[13] 杨峰, 耿秀丽. 利用TDGCN-L优化电商推荐: 整合显式反馈以提高用户满意度[J/OL]. 重庆工商大学学报(自然科学版), 1-10.
https://link.cnki.net/urlid/50.1155.N.20251231.0933.002, 2026-03-04.
[14] Ahmed, A., Saleem, K., Khalid, O. and Rashid, U. (2021) On Deep Neural Network for Trust Aware Cross Domain Recommendations in e-Commerce. Expert Systems with Applications, 174, Article ID: 114757. [Google Scholar] [CrossRef
[15] Liang, S., Ma, J., Sun, F., Chen, T., Lu, X. and Ren, S. (2025) KerGNNT: An Interpretable Graph Neural Network Recommendation Model. Knowledge and Information Systems, 67, 5187-5213. [Google Scholar] [CrossRef
[16] Zhou, Y., Lei, T. and Zhou, T. (2011) A Robust Ranking Algorithm to Spamming. EPL (Europhysics Letters), 94, Article No. 48002. [Google Scholar] [CrossRef
[17] 王方圆. 基于XGBoost和CTI融合的推荐算法的研究与应用[D]: [硕士学位论文]. 株洲: 湖南工业大学, 2025.