基于选择性状态空间的时间序列预测模型
Time Series Prediction Model Based on Selective State Space
DOI: 10.12677/mos.2025.145394, PDF,   
作者: 林琦淇:上海理工大学光电信息与计算机工程学院,上海
关键词: 多变量时序预测小波变换状态空间模型Multivariate Time Series Forecasting Wavelet Transform State Space Model
摘要: 针对时间序列数据特征复杂、模式多样且长期依赖导致模型计算复杂度高等问题,文章提出了一种基于选择性状态空间与时间卷积网络的多尺度时间序列预测模型。该模型通过专家混合网络为不同类型的时间序列数据动态分配专家模块,并结合一维离散小波变换与时间卷积网络捕捉多尺度特征。具体而言,专家模块利用小波变换将时序数据分解为高频和低频分量,分别提取短期波动和长期趋势信息,并通过时间卷积网络进行特征提取。处理后的数据输入编码器,采用基于选择性状态空间的Mamba模型同时建模正向和逆向时序信息,增强时序建模能力。实验结果表明,该模型在交通、经济、天气和电力等领域的多变量时间序列预测任务中均显著优于现有基线模型,验证了其有效性与鲁棒性。
Abstract: To address the challenges of complex time series data characteristics, diverse patterns, and high computational complexity caused by long-term dependencies, this paper proposes a multi-scale time series prediction model based on selective state space and temporal convolutional networks. The model dynamically allocates expert modules to different types of time series data through a mixture of expert networks and captures multi-scale features by integrating one-dimensional discrete wavelet transform with temporal convolutional networks. Specifically, the expert modules decompose time series data into high-frequency and low-frequency components using wavelet transform, extracting short-term fluctuations and long-term trends, respectively, and performing feature extraction through temporal convolutional networks. The processed data is then fed into an encoder, which employs a selective state space-based Mamba model to simultaneously model forward and backward temporal information, enhancing temporal modeling capabilities. Experimental results demonstrate that the model significantly outperforms existing baseline models in multivariate time series prediction tasks across various domains, including transportation, economics, weather, and electricity, validating its effectiveness and robustness.
文章引用:林琦淇. 基于选择性状态空间的时间序列预测模型[J]. 建模与仿真, 2025, 14(5): 293-303. https://doi.org/10.12677/mos.2025.145394

参考文献

[1] Fjellstrom, C. (2022) Long Short-Term Memory Neural Network for Financial Time Series. 2022 IEEE International Conference on Big Data (Big Data), Osaka, 17-20 December 2022, 3496-3504. [Google Scholar] [CrossRef
[2] Angryk, R.A., Martens, P.C., Aydin, B., Kempton, D., Mahajan, S.S., Basodi, S., et al. (2020) Multivariate Time Series Dataset for Space Weather Data Analytics. Scientific Data, 7, Article No. 227. [Google Scholar] [CrossRef] [PubMed]
[3] Chen, C., Petty, K., Skabardonis, A., Varaiya, P. and Jia, Z. (2001) Freeway Performance Measurement System: Mining Loop Detector Data. Transportation Research Record: Journal of the Transportation Research Board, 1748, 96-102. [Google Scholar] [CrossRef
[4] Lai, G., Chang, W., Yang, Y. and Liu, H. (2018) Modeling Long-and Short-Term Temporal Patterns with Deep Neural Networks. The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, Ann Arbor, 8-12 July 2018, 95-104. [Google Scholar] [CrossRef
[5] Zaremba, W., Sutskever, I. and Vinyals, O. (2014) Recurrent Neural Network Regularization. arXiv: 1409.2329. [Google Scholar] [CrossRef
[6] Zhao, Z., Chen, W., Wu, X., Chen, P.C.Y. and Liu, J. (2017) LSTM Network: A Deep Learning Approach for Short‐term Traffic Forecast. IET Intelligent Transport Systems, 11, 68-75. [Google Scholar] [CrossRef
[7] Elfeky, M.G., Aref, W.G. and Elmagarmid, A.K. (2005) Periodicity Detection in Time Series Databases. IEEE Transactions on Knowledge and Data Engineering, 17, 875-887. [Google Scholar] [CrossRef
[8] Pfander, G.E. and Benedetto, J.J. (2002) Periodic Wavelet Transforms and Periodicity Detection. SIAM Journal on Applied Mathematics, 62, 1329-1368. [Google Scholar] [CrossRef
[9] Wu, H., Xu, J., Wang, J. and Long, M. (2021) Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting. Advances in Neural Information Processing Systems, 34, 22419-22430.
[10] Zhou, T., Ma, Z., Wen, Q., Wang, X., Sun, L. and Jin, R. (2022) FEDformer: Frequency Enhanced Decomposed Transformer for Long-Term Series Forecasting. Proceedings of the 39th International Conference on Machine Learning, Baltimore, 17-23 July 2022, 27268-27286.
[11] Zhou, H., Zhang, S., Peng, J., Zhang, S., Li, J., Xiong, H., et al. (2021) Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, 35, 11106-11115. [Google Scholar] [CrossRef
[12] Gu, A. and Dao, T. (2023) Mamba: Linear-Time Sequence Modeling with Selective State Spaces. arXiv: 2312.00752. [Google Scholar] [CrossRef
[13] Mallat, S.G. (1989) A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11, 674-693. [Google Scholar] [CrossRef
[14] Hewage, P., Behera, A., Trovati, M., Pereira, E., Ghahremani, M., Palmieri, F., et al. (2020) Temporal Convolutional Neural (TCN) Network for an Effective Weather Forecasting Using Time-Series Data from the Local Weather Station. Soft Computing, 24, 16453-16482. [Google Scholar] [CrossRef
[15] Ioffe, S. and Szegedy, C. (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Co-Variate Shift. Proceedings of the 32nd International Conference on International Conference on Machine Learning, Lille, 6-11 July 2015, 448-456.
[16] Ba, J.L., Kiros, J.R. and Hinton, G.E. (2016) Layer Normalization. arXiv: 1607.06450. [Google Scholar] [CrossRef
[17] Li, Z., Qi, S., Li, Y. and Xu, Z. (2023) Revisiting Long-Term Time Series Forecasting: An Investigation on Linear Mapping. arXiv:2305.10721. [Google Scholar] [CrossRef
[18] Bebis, G. and Georgiopoulos, M. (1994) Feed-Forward Neural Networks. IEEE Potentials, 13, 27-31. [Google Scholar] [CrossRef
[19] Li, S., Jin, X., Xuan, Y., Zhou, X., Chen, W., Wang, Y.X. and Yan, X. (2019) Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting. Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, 8-14 December 2019, 5243-5253.
[20] Gu, A., Johnson, I., Goel, K., Saab, K., Dao, T., Rudra, A. and Ré, C. (2021) Combining Recurrent, Convolutional, and Continuous-Time Models with Linear State Space Layers. Proceedings of the 35th International Conference on Neural Information Processing Systems, Online, 6-14 December 2021, 572-585.
[21] Das, A., Kong, W., Leach, A., Mathur, S., Sen, R. and Yu, R. (2023) Long-Term Forecasting with Tide: Time-Series Dense Encoder. arXiv: 2304.08424. [Google Scholar] [CrossRef