面向电子商务系统的多业务确定性调度:一种图神经网络增强的深度强化学习方法
Multi-Service Deterministic Scheduling for E-Commerce Systems: A Graph Neural Network-Enhanced Deep Reinforcement Learning Approach
DOI: 10.12677/ecl.2026.151107, PDF,   
作者: 冯佳俊:贵州大学大数据与信息工程学院,贵州 贵阳
关键词: 电子商务调度深度强化学习熵正则化E-Commerce Scheduling Deep Reinforcement Learning Entropy Regularization
摘要: 在现代电子商务系统中,业务请求通常呈现出显著的异构时间特性。不同类型的业务(如在线支付、订单确认、实时推荐与风险控制)具有差异化的服务级别协议(SLA)约束,其截止时间与时延敏感性存在显著差异。这一特性使得传统基于统一策略的请求调度方法难以同时兼顾系统效率与服务质量:一方面,统一调度容易导致高实时性业务请求错过时延约束;另一方面,过于保守的资源分配策略又会造成计算与通信资源的低效利用,增加系统运营成本。针对上述问题,本文提出了一种异构图神经网络增强的深度强化学习调度框架,面向云–边–服务节点协同的电子商务业务调度场景。该框架在构建的异构知识图谱中显式建模业务类型、业务请求与计算资源之间的关联关系,使图神经网络能够有效捕获不同业务请求之间的依赖关系、资源状态动态以及业务类型约束。在此基础上,引入一种熵正则化自适应调度策略,在满足关键业务时延与SLA约束的前提下,实现系统调度性能与资源利用率的稳定优化。
Abstract: In modern e-commerce systems, business requests typically exhibit significant heterogeneous temporal characteristics. Different types of business operations (such as online payments, order confirmation, real-time recommendations, and risk control) have differentiated service level agreement (SLA) constraints, with significant differences in deadlines and latency sensitivity. This characteristic makes it difficult for traditional request scheduling methods based on a unified strategy to simultaneously balance system efficiency and service quality: on the one hand, unified scheduling easily leads to high-real-time business requests missing latency constraints; on the other hand, overly conservative resource allocation strategies result in inefficient utilization of computing and communication resources, increasing system operating costs. To address these issues, this paper proposes a heterogeneous graph neural network-enhanced deep reinforcement learning scheduling framework for cloud-edge-service node collaborative e-commerce business scheduling scenarios. This framework explicitly models the relationships between business types, business requests, and computing resources in a constructed heterogeneous knowledge graph, enabling the graph neural network to effectively capture the dependencies between different business requests, dynamic resource states, and business type constraints. Based on this, an entropy-regularized adaptive scheduling strategy is introduced to achieve stable optimization of system scheduling performance and resource utilization while satisfying critical business latency and SLA constraints.
文章引用:冯佳俊. 面向电子商务系统的多业务确定性调度:一种图神经网络增强的深度强化学习方法[J]. 电子商务评论, 2026, 15(1): 881-889. https://doi.org/10.12677/ecl.2026.151107

参考文献

[1] Kang, K., Ding, D., Xie, H., Yin, Q. and Zeng, J. (2022) Adaptive DRL-Based Task Scheduling for Energy-Efficient Cloud Computing. IEEE Transactions on Network and Service Management, 19, 4948-4961. [Google Scholar] [CrossRef
[2] Tang, Z., Jia, W., Zhou, X., Yang, W. and You, Y. (2022) Representation and Reinforcement Learning for Task Scheduling in Edge Computing. IEEE Transactions on Big Data, 8, 795-808. [Google Scholar] [CrossRef
[3] Jiang, F., Wang, K., Dong, L., Pan, C. and Yang, K. (2020) Stacked Autoencoder-Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks. IEEE Internet of Things Journal, 7, 9278-9290. [Google Scholar] [CrossRef
[4] Orr, M. and Sinnen, O. (2015) A Duplicate-Free State-Space Model for Optimal Task Scheduling. In: Träff, J.L., et al., Eds., Euro-Par 2015: Parallel Processing, Springer, 97-108. [Google Scholar] [CrossRef
[5] Zheng, J., Okamura, H. and Dohi, T. (2023). Hierarchical Dependability Modeling with Multi-State Systems. 2023 IEEE 28th Pacific Rim International Symposium on Dependable Computing (PRDC), Singapore, 24-27 October 2023, 268-277.[CrossRef
[6] Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C. and Yu, P.S. (2021) A Comprehensive Survey on Graph Neural Networks. IEEE Transactions on Neural Networks and Learning Systems, 32, 4-24. [Google Scholar] [CrossRef] [PubMed]
[7] Liu, C.-L. and Huang, T.-H. (2023) Dynamic Job-Shop Scheduling Problems Using Graph Neural Network and Deep Reinforcement Learning. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53, 6836-6848. [Google Scholar] [CrossRef
[8] Peng, Y., Guo, J. and Yang, C. (2024) Learning Resource Allocation Policy: Vertex-GNN or Edge-GNN? IEEE Transactions on Machine Learning in Communications and Networking, 2, 190-209. [Google Scholar] [CrossRef
[9] 覃雪莲, 刘志学. 考虑供应商竞争的电子商务供应链物流服务模式优化研究[J]. 中国管理科学, 2024, 32(1): 125-136.
[10] 杜艳鑫. 基于AI的电商优化服务模式探索[J]. 电子商务评论, 2025, 14(7): 1662-1670.