基于XGBoost特征提取的热电联产发电功率预测
Power Forecast of Cogeneration Based on Feature Extraction Using XGBoost
摘要: 精准的短期电力负载预测对电力系统的调度与控制、安全与防御等方面具有重要意义。随着电厂设备的日益复杂化,依靠专家经验或人工提取特征也变得愈发困难。结合极限梯度提升(Extreme Gradient Boosting, XGBoost)在提取特征和处理高维、非线性数据等方面的优势,提出一种基于XGBoost特征提取的融入注意力机制的深度卷积长短期记忆网络(CNN-LSTM-A)模型预测方法。首先,针对短期负荷预测,其电力负荷以及影响因素历史数据的随机性、波动性和不确定性,使用XGBoost对历史序列数据进行特征选择。将处理后的数据通过卷积层进行更深入的特征提取。然后,通过长短期记忆层进行拟合,并采取注意力机制对权值进行优化,减少历史信息的丢失并加强重要信息的影响,最终完成对功率的精准预测。利用浙江某电厂的实测数据进行实验,并与传统常用的经典模型进行对比,实验结果表明,所提方法预测精度更高,验证了该模型的精准性与可行性。
Abstract: Accurate short-term power load prediction is of great significance to power system scheduling and control, security and defense. With the increasing complexity of power plant equipment, it is increasingly difficult to rely on expert experience or manual extraction of features. Combining with Extreme Gradient Boosting (XGBoost) advantages in feature extraction and processing of high-dimensional and nonlinear data, a deep convolutional long and short-term memory network with attention mechanism (CNN-LSTM-AM) model prediction method based on XGBoost feature extraction was proposed. First of all, for short-term load forecasting, the randomness, volatility and uncertainty of historical data of power load and influencing factors, XGBoost is used to select the characteristics of historical sequence data. Further feature extraction is carried out on the processed data through the convolutional layer. Then, the long and short-term memory layer is fitted, and the attention mechanism is adopted to optimize the weights, reduce the loss of historical information and strengthen the influence of important information, and finally complete the accurate prediction of power. The measured data from a power plant in Zhejiang province are used for experiments and compared with the traditional classical model. Experimental results show that the proposed method has higher prediction accuracy, which verifies the accuracy and feasibility of the model.
文章引用:常庆, 罗龙峰. 基于XGBoost特征提取的热电联产发电功率预测[J]. 软件工程与应用, 2022, 11(5): 1105-1122. https://doi.org/10.12677/SEA.2022.115113

参考文献

[1] 关哲. 热电厂汽轮机运行节能降耗研究[J]. 电气技术与经济, 2021(2): 15-17.
[2] IEA (International Energy Agency) (2019) World Energy Outlook 2019.
https://www.iea.org/reports/world-energy-outlook-2019
[3] 郑伟, 姚远, 代邦武, 常玉清, 孙晓辉. 基于数据挖掘的燃煤机组厂级负荷经济调度[J]. 热力发电, 2021, 50(7): 78-83.
[4] 栗然, 马涛, 张潇, 回旭, 刘英培, 尹晓钢. 基于卷积长短期记忆神经网络的短期风功率预测[J]. 太阳能学报, 2021, 42(6): 304-311.
[5] 张航, 李洪双. 结构优化设计的改进交叉熵支持向量机方法[J]. 西北工业大学学报, 2018, 36(3): 448-455.
[6] 常峰铭. 基于深度学习的智能电网短期电力负荷预测研究[D]: [硕士学位论文]. 湘潭: 湘潭大学, 2019.
[7] 栗然, 孙帆, 丁星, 韩怡, 刘英培, 严敬汝. 考虑多能时空耦合的用户级综合能源系统超短期负荷预测方法[J]. 电网技术, 2020, 44(11): 4121-4134.
[8] 刘亚珲, 赵倩. 基于聚类经验模态分解的CNN-LSTM超短期电力负荷预测[J]. 电网技术, 2021, 45(11): 4444-4451.
[9] 杨振舰, 尚佳美, 张众维, 张艳, 刘树东. 基于残差注意力机制的图像去雾算法[J]. 西北工业大学学报, 2021, 39(4): 901-908.
[10] Friedman, J., Hastie, T. and Tibshirani, R. (2000) Additive Logistic Regression: A Statistical View of Boosting. Annals of Statistics, 28, 337-407. [Google Scholar] [CrossRef
[11] Friedman, J. (2002) Stochastic Gradient Boosting. Computational Statistics & Data Analysis, 38, 367-378. [Google Scholar] [CrossRef
[12] Breiman, L. (2001) Random Forests. Maching Learning, 45, 5-32. [Google Scholar] [CrossRef
[13] Friedman, J.H. and Popescu, B.E. (2003) Importance Sampled Learning Ensembles. Journal of Machine Learning Research, 94305, 1-32.
[14] 张科, 苏雨, 王靖宇, 王霰宇, 张彦华. 基于融合特征以及卷积神经网络的环境声音分类系统研究[J]. 西北工业大学学报, 2020, 38(1): 162-169.
[15] 杨格兰, 邓晓军, 刘琮. 基于深度时空域卷积神经网络的表情识别模型[J]. 中南大学学报(自然科学版), 2016, 47(7): 2311-2319.
[16] 董秋成, 吴爱国, 董娜, 冯伟. 用于卷积神经网络图像预处理的目标中心化算法[J]. 中南大学学报(自然科学版), 2019, 50(3): 579-586.
[17] Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) Imagenet Classification with Deep Convolutional Neural Networks. 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, 3-6 December 2012, 1097-1105.
[18] 谢清林, 陶功权, 温泽峰. 基于一维卷积神经网络的地铁钢轨波磨识别方法[J]. 中南大学学报(自然科学版), 2021, 52(4): 1371-1379.
[19] 刘畅, 陈雯柏. 一种基于MSDCNN-LSTM的设备RUL预测方法[J]. 西北工业大学学报, 2021, 39(2): 407-413.
[20] Yang, Y.R., Xiong, Q.Y., Wu, C., Zou, Q.H., Yu, Y., Yi, H.L. and Gao, M. (2021) A Study on Water Quality Prediction by a Hybrid CNN-LSTM Model with Attention Mechanism. Environmental Science and Pollution Research International, 28, 55129-55193. [Google Scholar] [CrossRef] [PubMed]
[21] 张珂, 杨歆豪, 张嘉慧, 倪俊芳. 基于高次指数平滑动态边界限制的深度学习优化算法[J]. 信息与控制, 2021, 50(6): 685-693.