提取时空特征的无监督时间序列异常检测
Time Series Anomaly Detection Model Based on Deep Mining of Spatio-Temporal Features
DOI: 10.12677/CSA.2022.123062, PDF,   
作者: 姜 昊:北京邮电大学计算机学院(国家示范性软件学院),北京;可信分布式计算与服务教育部重点实验室,北京;郭文明:北京邮电大学计算机学院(国家示范性软件学院),北京;可信分布式计算与服务教育部重点实验室,北京;新疆工程学院信息工程学院,新疆 乌鲁木齐
关键词: AIOps异常检测时间序列时空特征AIOps Abnormal Detection Time Series Spatio-Temporal Features
摘要: 为解决web应用程序及服务中的异常自动发现问题,针对互联网运维中常用监控指标的异常检测提出了一种基于深度挖掘时空特征的时间序列异常检测模型。考虑到web服务场景中异常发现的时效性要求,模型加强了编码器对空间信息的建模能力。模型使用基于VGG+Bi-LSTM的编码器用于挖掘时序数据中时空特征,使用全连接神经网络与Bi-LSTM构成的解码器重构输入数据。异常判定模块基于重构结果与原始输入的偏离程度计算异常得分与发现异常。这是一种无监督、不需要对异常数据进行分布假设,是纯数据驱动的方法。基于重构输入数据的方式使其拥有发现不可预见错误的能力。充分挖掘时空特征使模型能够及时准确地发现异常。实验结果表明,模型相较于目前常用的时间序列异常检测模型具有更高的异常识别准确率。在公开数据集上的实验结果表明,模型召回率提高6%,F1-score提高0.04。
Abstract: To solve the problem of automatic anomaly detection in web applications and services, a time series anomaly detection model based on deep mining of spatio-temporal features is proposed for the anomaly detection of commonly used monitoring indicators in Internet operation and maintenance. Taking into account the timeliness requirements of abnormal discovery in web service scenarios, the model strengthens the encoder’s ability to model spatial information. The model uses an encoder based on VGG+Bi-LSTM to mine spatio-temporal features in time series data, and uses a decoder composed of a fully connected neural network and Bi-LSTM to reconstruct the input data. The abnormality determination module calculates the abnormality score and finds the abnormality based on the degree of deviation between the reconstruction result and the original input. This is an unsupervised method that makes no distribution assumption on the abnormal data, and is a purely data-driven method. Based on the way of reconstructing the input data, it has the ability to find unforeseen errors. Fully excavate spatio-temporal features so that the model can find anomalies in a timely and accurate manner. The experimental results show that the model in this paper has higher recognition accuracy than the current time series anomaly detection models. The experimental results on the public data set show that the model recall is increased by 6%, and the F1-score is increased by 0.04.
文章引用:姜昊, 郭文明. 提取时空特征的无监督时间序列异常检测[J]. 计算机科学与应用, 2022, 12(3): 610-621. https://doi.org/10.12677/CSA.2022.123062

参考文献

[1] Beyer, B., Jones, C., Petoff, J. and Jones, C. (2016) Site Reliability Engineering: How Google Runs Production Systems. O’Reilly Media, Inc., Sebastopol.
[2] Dang, Y., Lin, Q. and Huang, P. (2019) Aiops: Real-World Challenges and Re-search Innovations. 2019 IEEE/ACM 41st International Conference on Software Engineering: Companion Proceedings (ICSE-Companion), Montreal, 25-31 May 2019, 4-5. [Google Scholar] [CrossRef
[3] Sadik, M.S. and Gruenwald, L. (2010) DBOD-DS: Distance Based Outlier Detection for Data Streams. International Conference on Database and Expert Systems Applica-tions, Bilbao, 30 August-3 September, 122-136. [Google Scholar] [CrossRef
[4] Sadik, S. and Gruenwald, L. (2014) Research Issues in Outlier Detection for Data Streams. ACM SIGKDD Explorations Newsletter, 15, 33-40. [Google Scholar] [CrossRef
[5] Laptev, N., Amizadeh, S. and Flint, I. (2015) Generic and Scalable Framework for Automated Time-Series Anomaly Detection. Proceedings of the 21th ACM SIGKDD International Con-ference on Knowledge Discovery and Data Mining, Sydney, 10-13 August 2015, 1939-1947. [Google Scholar] [CrossRef
[6] Liu, D., Zhao, Y., Xu, H., Sun, Y., Pei, D., Luo, J., et al. (2015) Opprentice: Towards Practical and Automatic Anomaly Detection through Machine Learning. Proceedings of the 2015 Internet Measurement Conference, Tokyo, 28-30 October 2015, 211-224. [Google Scholar] [CrossRef
[7] Wang, J., Jing, Y., Qi, Q., Feng, T. and Liao, J. (2019) ALSR: An Adaptive Label Screening and Relearning Approach for Interval-Oriented Anomaly Detection. Expert Systems with Ap-plications, 136, 94-104. [Google Scholar] [CrossRef
[8] Agarwal, D. (2005) An Empirical Bayes Approach to Detect Anomalies in Dynamic Multidimensional Arrays. 5th IEEE International Conference on Data Mining (ICDM’05), Hou-ston, 27-30 November 2005, 8 p. [Google Scholar] [CrossRef
[9] Eskin, E. (2000) Anomaly Detection over Noisy Data Using Learned Probability Distributions. Proceedings of the 17th International Conference on Machine Learning, Stanford, 29 June-2 July, 2000, 255-262.
[10] Chandola, V., Banerjee, A. and Kumar, V. (2009) Anomaly Detection: A Survey. ACM Computing Surveys, 41, Article No. 15. [Google Scholar] [CrossRef
[11] Hawkins, S., He, H. and Williams, G. (2002) Outlier Detection Using Replicator Neural Networks. International Conference on Data Ware-Housing and Knowledge Discovery, Aix-en-Provence, 4-6 September 2002, 170-180. [Google Scholar] [CrossRef
[12] Sakurada, M. and Yairi, T. (2014) Anomaly Detection Using Au-toencoders with Nonlinear Dimensionality Reduction. Proceedings of the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data Analysis, Gold Coast Australia, 2 December 2014, 4-11. [Google Scholar] [CrossRef
[13] Chauhan, S. and Vig, L. (2015) Anomaly Detection in ECG Time Signals via Deep Long Short-Term Memory Networks. 2015 IEEE International Conference on Data Science and Ad-vanced Analytics (DSAA), Paris, 19-21 October 2015, 1-7. [Google Scholar] [CrossRef
[14] Malhotra, P., Ramakrishnan, A. and Anand, G. (2016) LSTM-Based Encoder-Decoder for Multi-Sensor Anomaly Detection. 2016 International Conference on Machine Learning, New York, 19-24 June 2016. arXiv pre-print arXiv: 1607.00148.
[15] Xu, H., Chen, W., Zhao, N., Li, Z., Bu, J., Li, Z., et al. (2018) Unsupervised Anomaly Detection via Variational Auto-Encoder for Seasonal Kpis in Web Applications. Proceedings of the 2018 World Wide Web Conference, Lyon, April 2018, 187-196. [Google Scholar] [CrossRef
[16] Bashar, M.A. and Nayak, R. (2020) TAnoGAN: Time Series Anomaly Detection with Generative Adversarial Networks. 2020 IEEE Symposium Series on Computational Intelligence (SSCI), Canberra, 1-4 December 2020, 1778-1785. [Google Scholar] [CrossRef
[17] Hundman, K., Constantinou, V., Laporte, C., Colwell, I. and Soderstrom, T. (2018) Detecting Spacecraft Anomalies Using Lstms and nonparametric Dynamic Thresholding. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, 19-23 August 2018, 387-395. [Google Scholar] [CrossRef
[18] Su, Y., Zhao, Y., Niu, C., Liu, R., Sun, W. and Pei, D. (2019) Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, 4-8 August 2019, 2828-2837. [Google Scholar] [CrossRef
[19] 戚琦, 申润业, 王敬宇. GAD: 基于拓扑感知的时间序列异常检测[J]. 通信学报, 2020, 41(6): 152-160.