基于深度强化学习的投资组合优化研究
Research on Portfolio Optimization Based on Deep Reinforcement Learning
DOI: 10.12677/sa.2025.148230, PDF,    科研立项经费支持
作者: 肖红姗:四川外国语大学国际工商管理学院,重庆;文渝静, 王 昱:重庆大学经济与工商管理学院,重庆
关键词: 投资组合优化深度强化学习马尔科夫决策过程多源异构数据Portfolio Optimization Deep Reinforcement Learning Markov Decision Process Multi-Source Heterogeneous Data
摘要: 本文将投资组合调仓过程建模为马尔科夫决策过程,基于Actor-Critic算法框架进行策略优化。为增强状态表征能力,本文整合三类多源异构数据(股票历史交易数据、技术指标、K线图和财经新闻标题)丰富模型的状态空间。最后,将从上述三种数据中提取到的特征进行拼接融合,形成深度强化学习算法所需的环境状态。基于这一状态,算法能够学习并优化投资组合的交易策略,以实现收益最大化和风险最小化的目标。在中国A股市场的实证研究表明:本文提出的投资组合优化策略收益显著超越了自定义价格加权指数和其他传统的静态交易策略,多空交易测试验证了其于市场下行期的稳健性。
Abstract: This paper models the portfolio rebalancing process as a Markov Decision Process (MDP) and optimizes the strategy based on the Actor-Critic algorithm framework. To enhance state representation, the study integrates three types of multi-source heterogeneous data—historical stock trading data, technical indicators, and candlestick charts with financial news headlines—to enrich the model’s state space. Finally, the features extracted from these three data sources are concatenated and fused to form the environmental state required by the deep reinforcement learning algorithm. Based on this state, the algorithm can learn and optimize the trading strategy of the portfolio to achieve the dual objectives of maximizing returns and minimizing risks. Empirical research in China’s A-share market demonstrates that the proposed portfolio optimization strategy yields significantly higher returns compared to custom price-weighted indices and other traditional static trading strategies. Long-short trading tests further verify its robustness during market downturns.
文章引用:肖红姗, 文渝静, 王昱. 基于深度强化学习的投资组合优化研究[J]. 统计学与应用, 2025, 14(8): 229-240. https://doi.org/10.12677/sa.2025.148230

参考文献

[1] Markowitz, H. (1952) Portfolio Selection. The Journal of Finance, 7, 77-91. [Google Scholar] [CrossRef
[2] Fama, E.F. and French, K.R. (1993) Common Risk Factors in the Returns on Stocks and Bonds. Journal of Financial Economics, 33, 3-56. [Google Scholar] [CrossRef
[3] 张振环, 吴吉林, 吴睿珂. 厚尾数据的波动率结构变化检验及应用研究[J]. 统计研究, 2023, 40(11): 136-147.
[4] Li, Q., Chen, Y., Wang, J., Chen, Y. and Chen, H. (2018) Web Media and Stock Markets: A Survey and Future Directions from a Big Data Perspective. IEEE Transactions on Knowledge and Data Engineering, 30, 381-399. [Google Scholar] [CrossRef
[5] Li, X., Xie, H., Chen, L., Wang, J. and Deng, X. (2014) News Impact on Stock Price Return via Sentiment Analysis. Knowledge-Based Systems, 69, 14-23. [Google Scholar] [CrossRef
[6] Heaton, J.B., Polson, N.G. and Witte, J.H. (2016) Deep Learning for Finance: Deep Portfolios. Applied Stochastic Models in Business and Industry, 33, 3-12. [Google Scholar] [CrossRef
[7] Troiano, L., Villa, E.M. and Loia, V. (2018) Replicating a Trading Strategy by Means of LSTM for Financial Industry Applications. IEEE Transactions on Industrial Informatics, 14, 3226-3234. [Google Scholar] [CrossRef
[8] Karaoglu, S., Arpaci, U. and Ayvaz S. (2017) A Deep Learning Approach for Optimization of Systematic Signal Detection in Financial Trading Systems with Big Data. International Journal of Intelligent Systems and Applications in Engineering, 31-36. [Google Scholar] [CrossRef
[9] 章宁, 闫劭彬, 范丹. 基于深度学习的收益率预测与投资组合模型[J]. 统计与决策, 2022, 38(23): 48-51.
[10] 李仁宇, 叶子谦. 基于机器学习的基金收益预测[J]. 统计与决策, 2023, 39(11): 156-161.
[11] Deng, Y., Bao, F., Kong, Y., Ren, Z. and Dai, Q. (2017) Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Transactions on Neural Networks and Learning Systems, 28, 653-664. [Google Scholar] [CrossRef] [PubMed]
[12] 韩道岐, 张钧垚, 周玉航, 刘青. 基于深度强化学习的股市操盘手模型研究[J]. 计算机工程与应用, 2020, 56(21): 145-153.
[13] Cui, Y., Che, W., Liu, T., Qin, B., Wang, S. and Hu, G. (2020) Revisiting Pre-Trained Models for Chinese Natural Language Processing. Findings of the Association for Computational Linguistics: EMNLP 2020, Online, 657-668. [Google Scholar] [CrossRef