基于Leddam-iTransformer-FCB的日频股价预测方法
Daily Frequency Stock Price Prediction Method Based on Leddam-iTransformer-FCB
DOI: 10.12677/fin.2026.163039, PDF,   
作者: 余孟哲:华北电力大学(保定)经济管理系,河北 保定;徐铭申*:华北电力大学(保定)数理系,河北 保定;程薄绵:成都理工大学管理科学学院,四川 成都
关键词: 股价预测多变量时间序列可学习分解Stock Price Forecasting Multivariate Time Series Learnable Decomposition
摘要: 日频股价预测同时受到市场噪声、弱非平稳性以及价格变量与流动性相关变量耦合关系的共同影响,因而仍是量化金融中的一项具有挑战性的研究问题。本文围绕归档的Leddam-iTransformer-FCB实现,考察其在一步日频股价预测场景下的模型机理与实验表现。该模型首先对输入样本执行样本级归一化,随后利用可学习分解模块提取各变量中的平滑主成分,再通过倒置Transformer将变量轨迹视为token以建模跨变量依赖关系,并在潜在空间中引入Fourier Convolution Block进行全局频谱混合,最终完成预测输出。依据归档代码、checkpoint与测试预测序列,可以重建出该模型工作于一个包含5个变量、共2085个日频观测、采用60%/20%/20%划分、回溯窗口长度为12个交易日、预测步长为1日的股价预测设置。实验结果表明,完整模型在测试集上的性能为R2 = 0.9653、MSE = 1.5547、RMSE = 1.2469、MAE = 0.9294、MAPE = 1.2655%。结果表明,可学习平滑、跨变量注意力与潜在频谱混合在短期股价预测任务中具有明显的互补作用。
Abstract: Daily stock price forecasting remains a challenging research topic in quantitative finance due to the combined effects of market noise, weak non-stationarity, and coupling relationships between price variables and liquidity-related variables. This study investigates the model mechanism and experimental performance of the archived Leddam-iTransformer-FCB framework in one-step daily price prediction scenarios. The model first performs sample-level normalization on input data, then employs a learnable decomposition module to extract smooth principal components from variables. An inverted Transformer treats variable trajectories as tokens to model cross-variable dependencies, while Fourier Convolution Blocks in latent spaces facilitate global spectral mixing for prediction output. Using archived code, checkpoints, and test sequences, the model operates under the following parameters: 5 variables, 2085 daily observations, 60%/20%/20% data split, 12-day backtracking window, and 1-day prediction horizon. Experimental results demonstrate the model achieves R2 = 0.9653, MSE = 1.5547, RMSE = 1.2469, MAE = 0.9294, and MAPE = 1.2655% on the test set. The results indicate that learnable smoothing, cross-variable attention, and latent spectrum mixing exhibit significant complementary effects in short-term stock price prediction tasks.
文章引用:余孟哲, 徐铭申, 程薄绵. 基于Leddam-iTransformer-FCB的日频股价预测方法[J]. 金融, 2026, 16(3): 391-406. https://doi.org/10.12677/fin.2026.163039

参考文献

[1] Malkiel, B.G. and Fama, E.F. (1970) Efficient Capital Markets: A Review of Theory and Empirical Work. The Journal of Finance, 25, 383-417. [Google Scholar] [CrossRef
[2] Lo, A.W. and MacKinlay, A.C. (1988) Stock Market Prices Do Not Follow Random Walks: Evidence from a Simple Specification Test. Review of Financial Studies, 1, 41-66. [Google Scholar] [CrossRef
[3] Tsay, R.S. (2010). Analysis of Financial Time Series. 3rd Edition, Wiley.[CrossRef
[4] Kim, K. (2003) Financial Time Series Forecasting Using Support Vector Machines. Neurocomputing, 55, 307-319. [Google Scholar] [CrossRef
[5] Atsalakis, G.S. and Valavanis, K.P. (2009) Surveying Stock Market Forecasting Techniques—Part II: Soft Computing Methods. Expert Systems with Applications, 36, 5932-5941. [Google Scholar] [CrossRef
[6] Bao, W., Yue, J. and Rao, Y. (2017) A Deep Learning Framework for Financial Time Series Using Stacked Autoencoders and Long-Short Term Memory. PLOS ONE, 12, e0180944. [Google Scholar] [CrossRef] [PubMed]
[7] Fischer, T. and Krauss, C. (2018) Deep Learning with Long Short-Term Memory Networks for Financial Market Predictions. European Journal of Operational Research, 270, 654-669. [Google Scholar] [CrossRef
[8] Sezer, O.B., Gudelek, M.U. and Ozbayoglu, A.M. (2020) Financial Time Series Forecasting with Deep Learning: A Systematic Literature Review: 2005-2019. Applied Soft Computing, 90, Article ID: 106181. [Google Scholar] [CrossRef
[9] Qin, Y., Song, D., Chen, H., Cheng, W., Jiang, G. and Cottrell, G.W. (2017) A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction. Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, Melbourne, 19-25 August 2017, 2627-2633. [Google Scholar] [CrossRef
[10] Lim, B., Arık, S.Ö., Loeff, N. and Pfister, T. (2021) Temporal Fusion Transformers for Interpretable Multi-Horizon Time Series Forecasting. International Journal of Forecasting, 37, 1748-1764. [Google Scholar] [CrossRef
[11] Bai, S., Kolter, J.Z. and Koltun, V. (2018) An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling. arXiv: 1803.01271
[12] Oreshkin, B.N., Carpov, D., Chapados, N. and Bengio, Y. (2020) N-BEATS: Neural Basis Expansion Analysis for Interpretable Time Series Forecasting. arXiv: 1905.10437
[13] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L. and Polosukhin, I. (2017) Attention Is All You Need. Advances in Neural Information Processing Systems, 30, 5998-6008.
[14] Zhou, H., Zhang, S., Peng, J., Zhang, S., Li, J., Xiong, H., et al. (2021) Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, 35, 11106-11115. [Google Scholar] [CrossRef
[15] Wu, H., Xu, J., Wang, J. and Long, M. (2021) Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting. Advances in Neural Information Processing Systems, 34, 22419-22430.
[16] Zhou, T., Ma, Z., Wen, Q., Wang, X., Sun, L. and Jin, R. (2022) FEDformer: Frequency Enhanced Decomposed Transformer for Long-Term Series Forecasting. Proceedings of the 39th International Conference on Machine Learning, Maryland, 17-23 July 2022, 27268-27286.
[17] Zeng, A., Chen, M., Zhang, L. and Xu, Q. (2023) Are Transformers Effective for Time Series Forecasting? Proceedings of the AAAI Conference on Artificial Intelligence, 37, 11121-11128. [Google Scholar] [CrossRef
[18] Nie, Y., Nguyen, N.H., Sinthong, P. and Kalagnanam, J. (2023) A Time Series Is Worth 64 Words: Long-Term Forecasting with Transformers. arXiv: 2211.14730
[19] Kim, T., Kim, J., Tae, Y., Park, C., Choi, J.H. and Choo, J. (2022) Reversible Instance Normalization for Accurate Time-Series Forecasting against Distribution Shift. International Conference on Learning Representations 2022, 25-29 April 2022, 16621-16643.
[20] Liu, Y., Long, M., Wang, J. and Wu, H. (2022) Non-Stationary Transformers: Exploring the Stationarity in Time Series Forecasting. Advances in Neural Information Processing Systems 35, New Orleans, 28 November-9 December 2022, 9881-9893. [Google Scholar] [CrossRef
[21] Liu, Y., Hu, T., Zhang, H., Wu, H., Wang, S., Ma, L. and Long, M. (2024) iTransformer: Inverted Transformers Are Effective for Time Series Forecasting. arXiv: 2310.06625
[22] Yu, G., Zou, J., Hu, X., Aviles-Rivero, A.I., Qin, J. and Wang, S. (2024) Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations Modeling. Proceedings of the 41st International Conference on Machine Learning, Vienna, 21-27 July 2024, 57818-57841.
[23] Wang, Y., Liu, Y., Duan, X. and Wang, K. (2025) FilterTS: Comprehensive Frequency Filtering for Multivariate Time Series Forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, 39, 21375-21383. [Google Scholar] [CrossRef
[24] Sun, H., Li, Y., Li, Z., Yang, R., Xu, Z., Dou, J., et al. (2025) Fourier Convolution Block with Global Receptive Field for MRI Reconstruction. Medical Image Analysis, 99, Article ID: 103349. [Google Scholar] [CrossRef] [PubMed]