数据挖掘技术在量化交易指标选取中的应用
Research on Selecting Quantitative Trading Indicators Based on Data Mining
摘要: 股票市场的技术分析法是近年来量化投资研究的新热点,数据挖掘作为一门交叉学科,能够大幅度提高经济学中预测类研究的效果。本文系统性地运用数据挖掘技术,挖掘影响股票涨跌的关键指标,提升量化投资中的股票收益预测模块。本文设置特定观察期与交易策略,基于2021年1月至2022年1月A股市场的17项因子,采用数据挖掘中关联规则FP-Growth算法,使用Python编程语言对上市公司股票数据进行关联分析。在支持度阈值为30%、置信度阈值为80%条件下,挖掘出35条关联规则。实证结果显示,K线形态为阳线与短长下影、每股收益高、市盈率低、市净率低、市销率低、营业总收入同比增长率高这六大特征对股票未来收益率的影响较大。同时验证了投资策路的有效性和可行性,为股票分析以及股票投资中的特征指标的关注提供有意义的参考依据。
Abstract: The technical analysis method of the stock market has become a new hot topic in quantitative investment research in recent years. As an interdisciplinary field, data mining can greatly improve the effectiveness of predictive research in economics. This article systematically applies data mining techniques to explore key indicators that affect stock price fluctuations and improve the stock return prediction module in quantitative investment. This article sets specific observation periods and trading strategies, based on 17 factors in the A-share market from January 2021 to January 2022. The FP-Growth algorithm of association rules in data mining is used to conduct association analysis on the stock data of listed companies using Python programming language. Under the conditions of a support threshold of 30% and a confidence threshold of 80%, 35 association rules were mined. The empirical results show that the K-line pattern is characterized by a positive line and a short shadow, high earnings per share, low P/E ratio, low P/B ratio, low P/S ratio, and high year-on-year growth rate of total operating revenue, which have a significant impact on the future return of stocks. At the same time, the effectiveness and feasibility of investment strategies were verified, providing meaningful reference for stock analysis and the attention to characteristic indicators in stock investment.
参考文献
|
[1]
|
靳雨佳. 基于数据挖掘技术的股票市场分析与预测[D]: [硕士学位论文]. 长春: 吉林财经大学, 2018.
|
|
[2]
|
Chong, E., Han, C. and Park, F.C. (2017) Deep Learning Networks for Stock Market Analysis and Prediction: Methodology, Data Representations, and Case Studies. Expert Systems with Applications, 83, 187-205. [Google Scholar] [CrossRef]
|
|
[3]
|
兰秋军, 马超群, 甘国君, 等. 中国股市弱有效吗?——来自数据挖掘的实证研究[J]. 中国管理科学, 2005, 13(4): 17-23.
|
|
[4]
|
文娇. 关联规则挖掘算法在证券交易中的应用[D]: [硕士学位论文]. 长沙: 湖南大学, 2016.
|
|
[5]
|
郑海荣. 数据挖掘技术在股票分析与预测中的应用[D]: [硕士学位论文]. 哈尔滨: 哈尔滨工业大学, 2009.
|
|
[6]
|
Agrawal, R., Imieliński, T. and Swami, A. (1993) Mining Association Rules between Sets of Items in Large Databases. ACM SIGMOD Record, 22, 207-216. [Google Scholar] [CrossRef]
|
|
[7]
|
王赟. 基于灰色关联分析的多因子选股模型研究[D]: [硕士学位论文]. 北京: 北京交通大学, 2018.
|
|
[8]
|
黄海波. 数据挖掘技术在股票数据分析中的应用[J]. 商业文化, 2022(13): 92-94.
|
|
[9]
|
唐东升. 会计信息对股价影响的实证分析[J]. 统计与决策, 2012(6): 172-173.
|
|
[10]
|
杨艳林. 上市公司财务指标对股价影响的实证研究——以电力行业为例[J]. 中外企业家, 2016(6): 19-20.
|
|
[11]
|
闻岳春, 段弈冰. 创业板高管离职事件对股价影响的实证分析[J]. 证券市场导报, 2013(7): 31-35.
|