对抗紧密包裹与超球面约束的异常检测模型
Adversarial Compact Wrapping with Hyperspherical Constraint for Anomaly Detection
DOI: 10.12677/jisp.2026.151005, PDF,   
作者: 付纯博, 徐俊华, 汤文杰:南京审计大学计算机学院,江苏 南京;杨国为*:南通理工学院信息工程学院,江苏 南通
关键词: 异常检测单类分类对抗学习超球面学习紧密包裹Anomaly Detection One-Class Classification Adversarial Learning Hypersphere Learning Tight Wrapping
摘要: 无监督异常检测模型常面临检测边界模糊和泛化能力弱的难题。现有的深度异常检测模型,例如DeepSVDD,虽然通过超球面约束使正常模式特征分布更加紧密,但难以适配不规则的特征分布。此外,传统检测方法在对“紧贴正常边界且具有高度迷惑性的异常分布”进行精准建模方面还存在不足。为解决这些问题,本文提出了一种基于对抗紧密包裹与超球面约束的异常检测模型。该模型融合了紧密包裹学习、对抗学习以及深度超球面约束的思想,其博弈损失函数与现有方法的博弈损失函数完全不同。具体而言,该损失函数综合了正常样本的紧密包裹损失、异常样本的排斥损失以及对抗样本的生成损失,目标是缩小已知类别特征分布区域与检测模型所确定的类别特征分布区域之间的差异。其中,紧密包裹学习有助于使模型的检测边界更加清晰和准确,而对抗学习则使模型能够学习到更加鲁棒的特征表示,深度超球面约束使异常样本与正常样本分隔更大,从而优化模式特征分布建模并增强模型的泛化能力。实验结果表明,在多个数据集上,与多种现有的异常检测模型相比,本文提出的模型表现更为出色。
Abstract: Unsupervised anomaly detection models often encounter the challenges of ambiguous detection boundaries and weak generalization ability. Existing deep anomaly detection models, such as Deep SVDD, although they make the feature distribution of normal patterns more compact through hypersphere constraints, are difficult to adapt to irregular feature distributions. Moreover, traditional detection methods still have deficiencies in precisely modeling “anomalous distributions that closely adhere to the normal boundary and are highly deceptive”. To address these issues, this paper proposes an anomaly detection model based on adversarial tight wrapping and hypersphere constraints. This model integrates the ideas of tight wrapping learning, adversarial learning, and deep hypersphere constraints, and its game loss function is completely different from that of existing methods. Specifically, this loss function combines the tight wrapping loss of normal samples, the repulsion loss of anomalous samples, and the generation loss of adversarial samples, aiming to minimize the difference between the feature distribution region of known categories and the category feature distribution region determined by the detection model. Among them, tight wrapping learning helps to make the detection boundary of the model clearer and more accurate, while adversarial learning enables the model to learn more robust feature representations. Deep hypersphere constraints increase the separation between anomalous and normal samples, thereby optimizing the modeling of pattern feature distributions and enhancing the generalization ability of the model. Experimental results show that on multiple datasets, the proposed model outperforms many existing anomaly detection models.
文章引用:付纯博, 杨国为, 徐俊华, 汤文杰. 对抗紧密包裹与超球面约束的异常检测模型[J]. 图像与信号处理, 2026, 15(1): 49-63. https://doi.org/10.12677/jisp.2026.151005

参考文献

[1] Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J. and Williamson, R.C. (2001) Estimating the Support of a High-Dimensional Distribution. Neural Computation, 13, 1443-1471. [Google Scholar] [CrossRef] [PubMed]
[2] Liu, F.T., Ting, K.M. and Zhou, Z. (2008). Isolation Forest. 2008 8th IEEE International Conference on Data Mining, Pisa, 15-19 December 2008, 413-422.[CrossRef
[3] Parzen, E. (1962) On Estimation of a Probability Density Function and Mode. The Annals of Mathematical Statistics, 33, 1065-1076. [Google Scholar] [CrossRef
[4] Dempster, A.P., Laird, N.M. and Rubin, D.B. (1977) Maximum Likelihood from Incomplete Data via the em Algorithm. Journal of the Royal Statistical Society Series B: Statistical Methodology, 39, 1-22. [Google Scholar] [CrossRef
[5] Duda, R.O., Hart, P.E. and Stork, D.G. (2001) Pattern Classification. 2nd Edition, Wiley-Interscience.
[6] Vincent, P., Larochelle, H., Bengio, Y. and Manzagol, P. (2008) Extracting and Composing Robust Features with Denoising Autoencoders. Proceedings of the 25th International Conference on Machine Learning, Helsinki, 5-9 July 2008, 1096-1103. [Google Scholar] [CrossRef
[7] Schlegl, T., Seeböck, P., Waldstein, S.M., Schmidt-Erfurth, U. and Langs, G. (2017) Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery. In: Niethammer, M., et al., Eds., Information Processing in Medical Imaging, Springer International Publishing, 146-157. [Google Scholar] [CrossRef
[8] Ruff, L., Vandermeulen, R.A., Görnitz, N., Binder, A., Müller, E., Müller, K. and Kloft, M. (2018) Deep One-Class Classification. Proc. Proceedings of the 35th International Conference on Machine Learning, Stockholm, July 2018, Vol. 80, 4393-4402.
[9] Kim, Y., Lee, S. and Hwang, S.J. (2023) Hypersphere Embedding with Angular Margin for One-Class Classification. in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, June 2023, 14567-14576.
[10] Zhang, Z., Chen, X., Li, J. and Liu, Y. (2024) Adversarial Compact Wrapping Classifier Learning for Open Set Recognition. The IEEE Transactions on Neural Networks and Learning Systems, 35, 345-358.
[11] Goodfellow, I.J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A. and Bengio, Y. (2014) Generative Adversarial Nets. Advances in Neural Information Processing Systems, Montreal, Vol. 27, 2672-2680.
[12] Hendrycks, D., Mazeika, M. and Dietterich, T.G. (2019) Deep Anomaly Detection with Outlier Exposure.
https://arxiv.org/abs/1812.04606
[13] Sabokrou, M., Khalooei, M., Fathy, M. and Adeli, E. (2018) Adversarially Learned One-Class Classifier for Novelty Detection. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Las Vegas, 18-22 June 2018, 3379-3388. [Google Scholar] [CrossRef
[14] Tax, D.M.J. and Duin, R.P.W. (2004) Support Vector Data Description. Machine Learning, 54, 45-66. [Google Scholar] [CrossRef
[15] Xu, B., Wu, X., Wang, P. and Chen, L. (2015) Convolutional Autoencoder for Video Anomaly Detection. IEEE Signal Processing Letters, 22, 1791-1795.
[16] Schlegl, T., Seeböck, P., Waldstein, S.M., Langs, G. and Schmidt-Erfurth, U. (2019) f-AnoGAN: Fast Unsupervised Anomaly Detection with Generative Adversarial Networks. Medical Image Analysis, 54, 30-44. [Google Scholar] [CrossRef] [PubMed]
[17] Cevikalp, H., Trutschel, M. and Trancón y Widemann, B. (2020) Deep Support Vector Data Description for Anomaly Detection. 15th International Conference on Computer Vision Theory and Applications (VISAPP), Valletta, 27-29 February 2020, 532-539.
[18] Russell, S.J., Moreira, L.C., Pimentel, M.A., Bentley, P.J. and Constantinides, G.A. (2016) Anomaly Detection in Wearable Medical Sensors Using Extreme Value Theory. IEEE Engineering Medicine and Biology Society (EMBC), Orlando, 16-20 August 2016, 2122-2125.
[19] Cevikalp, H. and Trancón y Widemann, B. (2017) Deep Compact Hypersphere Classification. 16th International Conference on Machine Learning and Applications (ICMLA), Cancun, 18-21 December 2017, 387-392.
[20] Snell, J., Swersky, K. and Zemel, R. (2017) Prototypical Networks for Few-Shot Learning. 31st International Conference on Neural Information Processing Systems (NeurIPS), Long Beach, 4-9 December 2017, 4077-4087.
[21] 杨国为, 万鸣华, 赖志辉, 等. 具有合适拒识机制的高正确识别率分类器设计[J]. 电子学报, 2021, 49(8): 1569.
[22] Arjovsky, M., Chintala, S. and Bottou, L. (2017) Wasserstein Generative Adversarial Networks. Proceedings of the 34th International Conference on Machine Learning, Vol. 70, 214-223.
[23] Mao, X., Li, Q., Xie, H., Lau, R.Y.K., Wang, Z. and Smolley, S.P. (2017) Least Squares Generative Adversarial Networks. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, October 2017, 2794-2802. [Google Scholar] [CrossRef
[24] Madry, A., Makelov, A., Schmidt, L., Tsipras, D. and Vladu, A. (2018) Towards Deep Learning Models Resistant to Adversarial Attacks. International Conference on Learning Representations (ICLR), Vancouver, 30 April-3 May 2018.
https://arxiv.org/abs/1706.06083
[25] Zenati, M., Foo, C.-S., Lecouat, B., Manek, G. and Lee, V.R.P. (2018) Adversarially Regularized Autoencoders for Generating Realistic Images. International Joint Conference on Artificial Intelligence, Stockholm, 13-19 July 2018, 5338-5344.
[26] Deecke, L., Vandermeulen, R.A., Ruff, L., Mandt, S. and Kloft, M. (2018) Anomaly Detection with Robust Deep Variational Autoencoders. The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Londo, 19-23 August 2018, 1038-1047.
[27] Chen, L., Wang, Z., Zhu, F. and Yuen, P.C. (2024) Adversarial Hypersphere Clustering for Unsupervised Anomaly Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46, 1892-1905.
[28] Ruff, L., Vandermeulen, R.A., Görnitz, N., Binder, A., Müller, E., Müller, K.-R. and Kloft, M. (2019) Deep Semi-Supervised Anomaly Detection. in Proc. 7th International Conference on Learning Representations, New Orleans, 6-9 May 2019.
https://arxiv.org/abs/1906.02694
[29] Cevikalp, H., Uzun, B., Salk, Y., Saribas, H. and Köpüklü, O. (2023) From Anomaly Detection to Open Set Recognition: Bridging the Gap. Pattern Recognition, 138, Article ID: 109385. [Google Scholar] [CrossRef
[30] Bergmann, P., Fauser, M., Sattlegger, D. and Steger, C. (2019) MVTec AD—A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 15-20 June 2019, 9584-9592. [Google Scholar] [CrossRef
[31] Li, G., Liu, W., Zhou, L. and Li, H. (2019) DIOR: A New Dataset for Object Detection in Optical Remote Sensing Images. ISPRS Journal of Photogrammetry and Remote Sensing, 151, 119-132.
[32] Fawcett, T. (2006) An Introduction to ROC Analysis. Pattern Recognition Letters, 27, 861-874. [Google Scholar] [CrossRef
[33] Tack, J., Mo, S., Jeong, J. and Shin, J. (2020) CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances. Advances in Neural Information Processing Systems (NeurIPS), Vol. 33, 11839-11852.
[34] Reiss, T., Cohen, N., Bergman, L. and Hoshen, Y. (2021) PANDA: Adapting Pretrained Features for Anomaly Detection and Segmentation. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 19-25 June 2021, 2806-2814. [Google Scholar] [CrossRef
[35] Reiss, T. and Hoshen, Y. (2023) Mean-Shifted Contrastive Loss for Anomaly Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 37, 2155-2162. [Google Scholar] [CrossRef
[36] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[37] Tan, M. and Le, Q.V. (2019) EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. Proceedings of the 36th International Conference on Machine Learning, Long Beach, 9-15 June 2019, 6105-6114.
[38] Huang, G., Liu, Z., Van Der Maaten, L. and Weinberger, K.Q. (2017) Densely Connected Convolutional Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 4700-4708. [Google Scholar] [CrossRef