融合情感因子的多因子选股模型构建与实证分析
Construction and Empirical Analysis of Multi-Factor Stock Selection Model Based on Affective Factor
摘要: 新型冠状病毒肺炎疫情背景下,投资者对医疗行业股票关注度增加,本文通过将代表投资者的情感倾向的情感因子加入到股票因子库中,研究情感因子的融入是否会优化选股效果。首先,选用医疗行业产业链主要股票作为候选股票池,提取241个因子数据,运用mRMR特征筛选算法进行因子优化,按照是否加入情感因子的对比预测方式,利用Stacking方法将机器学习模型进行融合后,构建多因子选股模型。通过对比模型结果,本文证实加入情感因子的模型的预测准确率更高。
Abstract: Under the background of the new coronavirus pneumonia epidemic, investors’ attention to medical industry stocks has increased, and this paper studies whether the integration of affec-tive factors will optimize the stock selection effect by adding affective factors representing investors’ emotional tendencies to the stock factor pool. Firstly, the main stocks of the medical industry chain are selected as the candidate stock pool, 241 factor data are extracted, the mRMR feature screening algorithm is used for factor optimization, and a multi-factor stock selection model is constructed after fusing the machine learning model according to the comparative prediction method of whether or not to add affective factors. By comparing the model results, this paper confirms that the model with affective factors has a higher prediction accuracy.
文章引用:叶陆平. 融合情感因子的多因子选股模型构建与实证分析[J]. 运筹与模糊学, 2023, 13(2): 1372-1378. https://doi.org/10.12677/ORF.2023.132139

参考文献

[1] 费晓晖, 赵永亮. 疫情前后三因素模型对医疗股回报解释力研究[J]. 经济研究导刊, 2021(32): 120-122.
[2] Fama, E.F. and French, K.R. (1992) The Cross-Section of Expected Stock Returns. The Journal of Finance, 47, 427-465. [Google Scholar] [CrossRef
[3] Asness, C.S. (1997) The In-teraction of Value and Momentum Strategies. Financial Analysts Journal, 53, 29-36. [Google Scholar] [CrossRef
[4] Mohanram, P.S. (2005) Separating Winners from Losers among Low Book-to-Market Stocks Using Financial Statement Analysis. Review of Accounting Studies, 10, 133-170. [Google Scholar] [CrossRef
[5] 史永东, 田渊博, 马姜琼, 钟俊华. 多因子模型下投资者情绪对股票横截面收益的影响研究[J]. 投资研究, 2015, 34(5): 48-65.
[6] 田浩. 基于XGBoost的沪深300量化投资策略研究[D]: [硕士学位论文]. 上海: 上海师范大学, 2018.
[7] 赵娣. 基于机器学习方法的多因子选股策略研究[J]. 经济研究导刊, 2022(2): 106-108.
[8] Peng, H., Long, F. and Ding, C. (2005) Feature Selection Based on Mutual Information Criteria of Max Dependency, Max-Relevance, and Min Redundancy. IEEE Transactions on Pattern Analysis & Machine Intelligence, 27, 1226-1238. [Google Scholar] [CrossRef
[9] Balasch, S., Romero, R. and Ferrer, A. (2004) A Logistic Re-gression Model Applied to Evaluate the Influence of Operating Time of AgI Ground Acetonic Generators on the Size and Hardness of Hail. Natural Hazards, 32, 345-355. [Google Scholar] [CrossRef
[10] Breiman, L. (2001) Random Forests. Machine Learning, 45, 5-32. [Google Scholar] [CrossRef
[11] Friedman, J.H. (2001) Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics, 29, 1189-1232. [Google Scholar] [CrossRef