面向新能源大数据的异常模式检测技术研究

doi:10.12677/CSA.2020.1011214

期刊菜单

面向新能源大数据的异常模式检测技术研究
Research on Abnormal Pattern Detection Technology for New Energy Big Data

DOI: 10.12677/CSA.2020.1011214, PDF, 被引量国家科技经费支持
作者: 吕清泉, 丁坤, 周强, 张健美, 高鹏飞, 张睿骁：国网甘肃省电力公司电力科学研究院，甘肃兰州；侯佳敏：中国人民大学，北京
关键词: 数据挖掘；频繁子图；异常检测；新能源大数据；Data Mining； Frequent Subgraphs； Anomaly Detection； New Energy Big Data

摘要: 随着电力系统规模的日益增大，新能源的不断加入，系统中的知识总量呈爆炸式增长，电力系统运行需基于更高的数据质量实现，以便为系统提供全方位，全周期的数据共享。国内电力信息系统所使用的数据库一般为结构化数据库。而传统关系型数据库在处理大数据复杂关系问题过程中，一系列技术瓶颈日益凸显，传统数据库已经无法满足海量数据的处理建模与分析。本文提出了一种全自动化新能源大数据异常检测的技术方法，它利用知识图谱天然反应数据间现有关系的优势，基于图结构和图顶点的属性信息，对异常图模式进行形式化定义以直接挖掘电网拓扑结构中的异常数据。本文挖掘的异常数据在现实中具有语义信息，在异常数据检测问题上具有可行性和实用价值。通过挖掘富有语义信息的异常图模式，检测新能源大数据中的异常数据，以保证数据的可靠性和准确性，避免错误或无效数据影响电力系统精细化管理和电网安全运行。算例实验效果良好，表明所提出的辨识方法具有理论价值和实际应用价值。

Abstract: With the increasing scale of the power system and the continuous addition of new energy, the total amount of knowledge in the system has exploded. The operation of the power system needs to be realized based on higher data quality in order to provide the system with all-round and full-cycle data shared. The database used by the domestic electric power information system is generally a structured database. In the process of traditional relational databases dealing with the complex relational problems of big data, a series of technical bottlenecks have become increasingly prominent, and traditional databases can no longer satisfy the processing modeling and analysis of massive data. This paper proposes a fully automated new energy big data anomaly detection technology method, which uses the advantages of the existing relationship between the natural response data of the knowledge graph, and based on the graph structure and the attribute information of the graph vertices, the abnormal graph mode is formalized to directly mine abnormal data in the grid topology. The abnormal data mined in this paper has semantic information in reality, and has feasibility and practical value in the detection of abnormal data. By mining abnormal graph patterns rich in semantic information, abnormal data in new energy big data is detected to ensure the reliability and accuracy of the data, and prevent errors or invalid data from affecting the refined management of the power system and the safe operation of the power grid. The experimental results of the calculation examples are good, indicating that the proposed identification method has theoretical value and practical application value.

文章引用：吕清泉, 丁坤, 周强, 张健美, 高鹏飞, 张睿骁, 侯佳敏. 面向新能源大数据的异常模式检测技术研究[J]. 计算机科学与应用, 2020, 10(11): 2024-2033. https://doi.org/10.12677/CSA.2020.1011214

参考文献

[1]	汤亚宸, 方定江, 韩海韵, 等. 基于图数据库和知识图谱的电力设备质量综合管理系统研究[J]. 供用电, 2019, 36(11): 35-40．
[2]	高海翔, 苗璐, 刘嘉宁, 林湘宁, 董锴, 何祥针. 知识图谱及其在电力系统中的应用研究综述[J]. 广东电力, 2020, 33(9): 66-76.
[3]	王琼, 魏军, 闫润珍, 等. 知识图谱在智能电网的应用[J]. 电子元器件与信息技术, 2020, 4(1): 135-137, 147.
[4]	刘峤, 李杨, 段宏, 刘瑶, 秦志光. 知识图谱构建技术综述[J]. 计算机研究与发展, 2016, 53(3): 13-16.
[5]	王渊, 彭晨辉, 王志强, 等. 知识图谱在电网全业务统一数据中心的应用[J]. 计算机工程与应用, 2019, 55(15): 104-109.
[6]	汤亚宸, 方定江, 韩海韵, 等. 基于图数据库和知识图谱的电力设备质量综合管理系统研究[J]. 供用电, 2019(11): 35-40.
[7]	高泽璞, 赵云, 余伊兰, 等. 基于知识图谱的低压配电网拓扑结构辨识方法[J]. 电力系统保护与控制, 2020, 48(2): 34-43.
[8]	严玉良, 董一鸿, 何贤芒, 等. FSMBUS: 一种基于Spark的大规模频繁子图挖掘算法[J]. 计算机研究与发展, 2015, 52(8): 1768-1783.
[9]	Yan, X. and Han, J. (2002) gSpan: Graph-Based Substructure Pattern Mining. Proceedings of the IEEE International Confer-ence on Data Mining, Maebashi City, 9-12 December 2002, 721-724.
[10]	Elseidy, M., Abdelhamid, E., Skiadopoulos, S., et al. (2014) GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph. Proceedings of the Vldb En-dowment, 7, 517-528. [Google Scholar] [CrossRef]
[11]	Suchanek, F.M., Kasneci, G. and Weikum, G. (2007) Yago: A Core of Semantic Knowledge. WWW’07: Proceedings of the 16th International Conference on World Wide Web, May 2007, 697-706. [Google Scholar] [CrossRef]

为你推荐

友情链接