基于大语言模型强化的个性化推荐算法研究

doi:10.12677/aam.2026.156285

期刊菜单

基于大语言模型强化的个性化推荐算法研究
Research on Personalized Recommendation Algorithm Enhanced by Large Language Models

DOI: 10.12677/aam.2026.156285, PDF,
作者: 贾佳奇, 周文学^*：兰州交通大学数理学院，甘肃兰州
关键词: 个性化推荐；深度强化学习；大语言模型；跨视图表示学习；知识图谱；Personalized Recommendation； Deep Reinforcement Learning； Large Language Model； Cross-View Representation Learning； Knowledge Graph

摘要: 随着互联网平台内容规模的持续扩张，用户在海量候选项中快速找到符合自身兴趣的信息变得越来越困难，个性化推荐成为提升用户体验的重要手段。现有推荐方法大多以用户–项目交互数据为核心进行偏好建模，但在用户兴趣动态变化、交互数据稀疏以及多源异构信息利用不足等问题上仍存在明显局限。针对上述问题，本文提出了一种结合大语言模型语义增强、知识图谱结构建模与深度强化学习决策的个性化推荐方法。该方法将推荐过程建模为序列决策问题，在用户侧结合行为序列表示与语义画像构建状态表示，在项目侧联合学习文本语义表示与知识图谱结构表示，并通过深度Q网络完成候选项目的价值评估与推荐决策。在Yelp和Amazon-Book两个公开数据集上进行了实验，并与多种代表性推荐方法进行了对比。结果表明，所提出的方法在HR@K以及NDCG@K等指标上均取得了较优表现，验证了所提框架的有效性。

Abstract: With the continuous expansion of the content scale on internet platforms, it has become increasingly difficult for users to quickly find information that matches their interests from the vast number of options. Personalized recommendations have become an important means to enhance user experience. Most existing recommendation methods mainly model preferences based on user-item interaction data. However, there are still significant limitations in addressing issues such as dynamic changes in user interests, sparse interaction data, and insufficient utilization of multi-source heterogeneous information. To address these problems, this paper proposes a personalized recommendation method that combines semantic enhancement of large language models, knowledge graph structure modeling, and deep reinforcement learning decision-making. This method models the recommendation process as a sequence decision-making problem. On the user side, it combines behavioral sequence representation and semantic profiling to construct state representation. On the item side, it jointly learns text semantic representation and knowledge graph structure representation, and completes the value assessment and recommendation decision through a deep Q-network. Experiments were conducted on two public datasets, Yelp and Amazon-Book, and compared with several representative recommendation methods. The results show that the proposed method achieves superior performance in metrics such as HR@K and NDCG@K, verifying the effectiveness of the proposed framework.

文章引用：贾佳奇, 周文学. 基于大语言模型强化的个性化推荐算法研究[J]. 应用数学进展, 2026, 15(6): 263-273. https://doi.org/10.12677/aam.2026.156285

参考文献

[1]	Lyu, H., Jiang, S., Zeng, H., Xia, Y., Wang, Q., Zhang, S., et al. (2024) LLM-Rec: Personalized Recommendation via Prompting Large Language Models. Findings of the Association for Computational Linguistics: NAACL 2024, Mexico City, 16-21 June 2024, 583-612. [Google Scholar] [CrossRef]
[2]	Chung, J., Gulcehre, C., Cho, K. and Bengio, Y. (2014) Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. arXiv:1412.3555.
[3]	Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L. and Polosukhin, I. (2017) Attention Is All You Need. Conference and Workshop on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 5998-6008.
[4]	Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P. and Bengio, Y. (2018) Graph Attention Networks. Proceedings of the 6th International Conference on Learning Representations, Vancouver.
[5]	Wang, X., He, X., Cao, Y., Liu, M. and Chua, T. (2019) KGAT: Knowledge Graph Attention Network for Recommendation. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, 4-8 August 2019, 950-958. [Google Scholar] [CrossRef]
[6]	Chen, T., Kornblith, S., Norouzi, M. and Hinton, G. (2020) A Simple Framework for Contrastive Learning of Visual Representations. In: Daumé, P. and Singh, A., Eds., Proceedings of the 37th International Conference on Machine Learning, JMLR.org, 1597-1607.
[7]	He, K., Fan, H., Wu, Y., Xie, S. and Girshick, R. (2020) Momentum Contrast for Unsupervised Visual Representation Learning. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 9729-9738. [Google Scholar] [CrossRef]
[8]	Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., et al. (2015) Human-Level Control through Deep Reinforcement Learning. Nature, 518, 529-533. [Google Scholar] [CrossRef] [PubMed]
[9]	Wang, Z., Schaul, T., Hessel, M., Van Hasselt, H., Lanctot, M. and De Freitas, F. (2016) Dueling Network Architectures for Deep Reinforcement Learning. Proceedings of the 33rd International Conference on Machine Learning, New York, 19-24 June 2016, 1995-2003.
[10]	Van Hasselt, H., Guez, A. and Silver, D. (2016) Deep Reinforcement Learning with Double Q-Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 30, 2094-2100. [Google Scholar] [CrossRef]

为你推荐

友情链接