一种融合稀疏注意力机制和BlockDiagonal矩阵的xLSTM-Informer深度学习模型
A Deep Learning Model of xLSTM-Informer Fusing Sparse Attention Mechanism and BlockDiagonal Matrices
DOI: 10.12677/airr.2026.153065, PDF,    科研立项经费支持
作者: 王鹏超, 黎锁平*, 周永强:兰州理工大学理学院,甘肃 兰州
关键词: xLSTM-Informer稀疏注意力机制BlockDiagonal电力现货价格xLSTM-Informer Sparse Attention BlockDiagonal Electricity Spot Price
摘要: 在新能源高渗透率背景下,电力现货价格频繁波动,传统模型在处理这类非线性、非平稳时序数据时存在性能瓶颈。本文提出xLSTM-Informer模型,融合xLSTM、稀疏注意力机制(ProbSparse Attention)及BlockDiagonal结构压缩方法,构建出适应强波动市场特征的预测架构。其中xLSTM顺序堆叠了一个xLSTM乘性门控单元(mLSTM)和一个结构化LSTM (sLSTM)模块,并且将BlockDiagonal结构压缩方法运用到mLSTM中,更好地提取时序局部特征的注意力,旨在处理长序列建模时信息传递受限、参数冗余以及时序特征表达能力不足等问题。模型在输入数据归一化过程中联合采用层归一化与组归一化策略,增强对尺度异构数据的鲁棒性。实验采用甘肃省2024年上半年电力现货市场数据,在多种天气场景(晴天、大风、阴天、无风)下进行测试。结果表明,xLSTM-Informer在五项主要评价指标(RMSE, MAE, MAPE, TIC, R2)上均优于CNN、CNN-LSTM、Informer和iTransformer等基准模型。其中在典型晴天场景下,RMSE仅为8.863,MAPE为4.556%,R2高达0.987,展现出对极端市场变化的强适应能力和稳定性。
Abstract: Against the backdrop of high new energy penetration, electricity spot prices exhibit frequent fluctuations, and traditional models suffer from performance bottlenecks when modeling such nonlinear and non-stationary time series. This paper proposes the xLSTM-Informer model, which integrates xLSTM, ProbSparse Attention, and a BlockDiagonal matrix compression method to construct a prediction framework adapted to highly volatile market characteristics. Specifically, xLSTM sequentially stacks a multiplicative LSTM (mLSTM) unit and a structured LSTM (sLSTM) module, with the BlockDiagonal compression method incorporated into the mLSTM to better capture attention over local time-series features. This design addresses the limitations of long-sequence modeling, including constrained information propagation, parameter redundancy, and insufficient representation of temporal patterns. To enhance robustness against scale-heterogeneous data, the model combines layer normalization and group normalization during input data standardization. Experiments are conducted using electricity spot market data from Gansu Province in the first half of 2024, evaluated across multiple weather scenarios (sunny, windy, cloudy, calm). The results demonstrate that xLSTM-Informer outperforms baseline models such as CNN, CNN-LSTM, Informer, and iTransformer across five key evaluation metrics (RMSE, MAE, MAPE, TIC, R2). In the typical sunny scenario, the model achieves an RMSE of 8.863, MAPE of 4.556%, and R2 of 0.987, verifying its strong adaptability and stability under extreme market variations.
文章引用:王鹏超, 黎锁平, 周永强. 一种融合稀疏注意力机制和BlockDiagonal矩阵的xLSTM-Informer深度学习模型[J]. 人工智能与机器人研究, 2026, 15(3): 685-700. https://doi.org/10.12677/airr.2026.153065

参考文献

[1] Pikus, M. and Wąs, J. (2024) Predictive Modeling of Renewable Energy Purchase Prices Using Deep Learning Based on Polish Power Grid Data for Small Hybrid PV Microinstallations. Energies, 17, Article 628. [Google Scholar] [CrossRef
[2] Baskan, D.E., Meyer, D., Mieck, S., Faubel, L., Klöpper, B., Strem, N., et al. (2023) A Scenario-Based Model Comparison for Short-Term Day-Ahead Electricity Prices in Times of Economic and Political Tension. Algorithms, 16, Article 177. [Google Scholar] [CrossRef
[3] Weron, R. (2014) Electricity Price Forecasting: A Review of the State-of-the-Art with a Look into the Future. International Journal of Forecasting, 30, 1030-1081. [Google Scholar] [CrossRef
[4] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[5] Kong, W., Dong, Z.Y., Jia, Y., Hill, D.J., Xu, Y. and Zhang, Y. (2019) Short-Term Residential Load Forecasting Based on LSTM Recurrent Neural Network. IEEE Transactions on Smart Grid, 10, 841-851. [Google Scholar] [CrossRef
[6] Rafi, S.H., Nahid-Al-Masood, Deeba, S.R. and Hossain, E. (2021) A Short-Term Load Forecasting Method Using Integrated CNN and LSTM Network. IEEE Access, 9, 32436-32448. [Google Scholar] [CrossRef
[7] Zhong, B. (2023) Deep Learning Integration Optimization of Electric Energy Load Forecasting and Market Price Based on the ANN-LSTM-Transformer Method. Frontiers in Energy Research, 11, Article 1292204. [Google Scholar] [CrossRef
[8] Yang, G., Du, S., Duan, Q. and Su, J. (2022) Short-term Price Forecasting Method in Electricity Spot Markets Based on Attention-LSTM-MTCN. Journal of Electrical Engineering & Technology, 17, 1009-1018. [Google Scholar] [CrossRef
[9] 武永江. 深度学习在电力现货价格预测中的应用研究[D]: [硕士学位论文]. 太原: 太原科技大学, 2021.
[10] Laitsos, V., Vontzos, G., Bargiotas, D., Daskalopulu, A. and Tsoukalas, L.H. (2024) Data-Driven Techniques for Short-Term Electricity Price Forecasting through Novel Deep Learning Approaches with Attention Mechanisms. Energies, 17, Article 1625. [Google Scholar] [CrossRef
[11] Kılıç, D.K., Nielsen, P. and Thibbotuwawa, A. (2024) Intraday Electricity Price Forecasting via LSTM and Trading Strategy for the Power Market: A Case Study of the West Denmark DK1 Grid Region. Energies, 17, Article 2909. [Google Scholar] [CrossRef
[12] Bâra, A., Oprea, S. and Băroiu, A. (2023) Forecasting the Spot Market Electricity Price with a Long Short-Term Memory Model Architecture in a Disruptive Economic and Geopolitical Context. International Journal of Computational Intelligence Systems, 16, Article No. 130. [Google Scholar] [CrossRef
[13] Shejul, K., Harikrishnan, R. and Kukker, A. (2024) Short‐Term Electricity Price Forecasting Using the Empirical Mode Decomposed Hilbert‐LSTM and Wavelet‐LSTM Models. Journal of Electrical and Computer Engineering, 2024, Article ID: 4575735. [Google Scholar] [CrossRef
[14] Liu, B., Li, Z., Li, Z. and Chen, C. (2024) CL-Informer: Long Time Series Prediction Model Based on Continuous Wavelet Transform. PLOS ONE, 19, e0303990. [Google Scholar] [CrossRef] [PubMed]
[15] Tran, N.T and Xin, J. (2023) Fourier-Mixed Window Attention: Accelerating Informer for Long Sequence Time-Series Forecasting. arXiv: 2307.00493.
[16] Xu, H., Peng, Q., Wang, Y. and Zhan, Z. (2023) Power-Load Forecasting Model Based on Informer and Its Application. Energies, 16, Article 3086. [Google Scholar] [CrossRef
[17] 程思远, 杨强, 石文娟, 等. 基于LSTM + Self-Attention的电力价格预测模型研究[C]//中国电机工程学会电力信息化专业委员会, 国家电网公司信息通信分公司. 2023电力行业信息化年会论文集. 北京: 北京中电普华信息技术有限公司, 2023: 127-131.