融合自注意力与渐进抽取机制的电商多场景多任务预测方法研究
Research on a Multi-Scenario, Multi-Task Prediction Method for E-Commerce Based on Self-Attention and Progressive Extraction Mechanisms
DOI: 10.12677/ecl.2026.156720, PDF,    国家自然科学基金支持
作者: 王英万, 于丽娅*, 徐 兆:贵州大学机械工程学院,贵州 贵阳;李少波:贵州理工学院机械工程学院,贵州 贵阳;李传江:贵州大学省部共建公共大数据国家重点实验室,贵州 贵阳
关键词: 推荐系统多场景推荐多任务预测电子商务注意力机制Recommendation Systems Multi-Scenario Recommendation Multi-Task Prediction E-Commerce Attention Mechanism
摘要: 随着电商平台由单场景、单目标逐步迈向多场景、多任务的统一建模范式,如何在充分共享跨场景共性知识的同时有效规避任务间冲突与负迁移,已成为制约模型可用性与训练稳定性的核心挑战。针对该问题,提出一种面向电商多场景多任务预测的分层专家增强框架:以Star场景融合底座为基础,在嵌入层之后引入自注意力模块,以显式刻画高阶特征依赖;同时采用渐进分层抽取替代传统单层MoE的共享机制,使共享–特化在不同语义层级实现自适应迁移与有效解耦,从结构上缓解跷跷板现象并抑制负迁移。此外,模型保留场景/任务专家,并通过可学习的平衡混合机制增强对场景不均衡与分布漂移的鲁棒性。在两个公开数据集上开展了广泛实验,结果表明该方法在整体性能与稳定性方面均显著优于对比模型。
Abstract: As e-commerce platforms gradually evolve from single-scenario and single-objective frameworks toward a unified multi-scenario and multi-task modeling paradigm, how to fully share cross-scenario common knowledge while effectively avoiding inter-task conflicts and negative transfer has emerged as a core challenge restricting model usability and training stability. To address this issue, we propose a hierarchical expert-enhanced framework for multi-scenario multi-task prediction in e-commerce. Building upon a Star-topology scenario fusion foundation, a self-attention module is introduced after the embedding layer to explicitly capture high-order feature dependencies. Simultaneously, a progressive hierarchical extraction mechanism replaces the traditional single-layer MoE sharing approach. This enables the adaptive transfer and effective decoupling of shared and task-specific representations across different semantic levels, fundamentally mitigating the seesaw phenomenon and suppressing negative transfer from a structural perspective. Furthermore, the model retains scenario and task-specific experts, utilizing a learnable balanced mixing mechanism to enhance robustness against scenario imbalance and distribution shift. Extensive experiments conducted on two public datasets demonstrate that the proposed method significantly outperforms baseline models in terms of overall performance and stability.
文章引用:王英万, 于丽娅, 李少波, 李传江, 徐兆. 融合自注意力与渐进抽取机制的电商多场景多任务预测方法研究[J]. 电子商务评论, 2026, 15(6): 1004-1014. https://doi.org/10.12677/ecl.2026.156720

参考文献

[1] 吕学强, 王夏雨, 马登豪. 面向推荐系统的用户兴趣建模综述[J]. 计算机工程与应用, 2025, 61(21): 15-29.
[2] Song, D., Yang, E., Guo, G., Shen, L., Jiang, L. and Wang, X. (2024) Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System. ACM Transactions on Knowledge Discovery from Data, 18, 1-20. [Google Scholar] [CrossRef
[3] 任衍栋, 张东, 李冠宇. 融合注意力与结构降噪的对比学习知识感知推荐[J]. 计算机工程与应用, 2025, 61(17): 232-240.
[4] Ning, W., Yan, X., Liu, W., Cheng, R., Zhang, R. and Tang, B. (2023) Multi-Domain Recommendation with Embedding Disentangling and Domain Alignment.
http://arxiv.org/abs/2308.05508
[5] Zhu, J., Wang, Y., Lin, J., Qin, J., Tang, R., Zhang, W., et al. (2024) M-Scan: A Multi-Scenario Causal-Driven Adaptive Network for Recommendation. Proceedings of the ACM Web Conference 2024, Singapore, 13-17 May 2024, 3844-3853. [Google Scholar] [CrossRef
[6] Ma, J., Zhao, Z., Yi, X., Chen, J., Hong, L. and Chi, E.H. (2018) Modeling Task Relationships in Multi-Task Learning with Multi-Gate Mixture-of-Experts. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, 19-23 August 2018, 1930-1939. [Google Scholar] [CrossRef
[7] Sheng, X.R., Zhao, L., Zhou, G., et al. (2021) One Model to Serve All: Star Topology Adaptive Recommender for Multi-Domain CTR Prediction.
http://arxiv.org/abs/2101.11427
[8] Tang, H., Liu, J., Zhao, M. and Gong, X. (2020) Progressive Layered Extraction (PLE): A Novel Multi-Task Learning (MTL) Model for Personalized Recommendations. Fourteenth ACM Conference on Recommender Systems, Brazil, 22-26 September 2020, 269-278. [Google Scholar] [CrossRef
[9] Ouyang, W., Zhang, X., Zhao, L., et al. (2020) MiNet: Mixed Interest Network for Cross-Domain Click-Through Rate Prediction.
http://arxiv.org/abs/2008.02974
[10] Zou, X., Hu, Z., Zhao, Y., et al. (2022) Automatic Expert Selection for Multi-Scenario and Multi-Task Search.
http://arxiv.org/abs/2205.14321
[11] Zhang, Z., Liu, S., Yu, J., Cai, Q., Zhao, X., Zhang, C., et al. (2024) M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework. Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington DC, 14-18 July 2024, 893-902. [Google Scholar] [CrossRef
[12] Chang, J., Zhang, C., Hui, Y., Leng, D., Niu, Y., Song, Y., et al. (2023) PEPNet: Parameter and Embedding Personalized Network for Infusing with Personalized Prior Information. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, 6-10 August 2023, 3795-3804. [Google Scholar] [CrossRef
[13] Shazeer, N., Mirhoseini, A., Maziarz, K., et al. (2017) Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer.
http://arxiv.org/abs/1701.06538
[14] Schoenauer-Sebag, A., Heinrich, L., Schoenauer, M., et al. (2019) Multi-Domain Adversarial Learning.
http://arxiv.org/abs/1903.09239
[15] Sun, Z., Si, Z., Zang, X., Leng, D., Niu, Y., Song, Y., et al. (2023) KuaiSAR: A Unified Search and Recommendation Dataset. Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, Birmingham, 21-25 October 2023, 5407-5411. [Google Scholar] [CrossRef