基于混合防御架构的深度伪造检测与防御系统
Deepfake Detection and Defense System Based on Hybrid Defense Architecture
摘要: 深度伪造(Deepfake)技术的快速演进使高质量伪造内容泛滥,对个人隐私、舆论安全与社会治理构成严峻威胁。现有防御方法普遍存在检测手段单一、主动与被动防御割裂、跨数据集泛化能力不足等问题。为此,文章提出一种“主动防御 + 被动检测”的混合防御架构。在主动防御方面,提出梯度引导的区域自适应对抗扰动生成方法,结合人脸显著性图与梯度信息实现感知不可见的精准扰动分配,并设计基于DWT-DCT-SVD级联变换与K-means盲判决的鲁棒盲水印方案,支持“在线溯源 + 本地验真”双模式可信链。在被动检测方面,构建多源证据融合检测框架:以改进的双分支注意力增强ResNet-50为核心检测器,融合频域–纹理–几何传统特征分析层与视觉大模型语义推理层,并引入基于Dempster-Shafer证据理论的自适应融合决策机制替代固定权重方案,实现多源证据的不确定性建模与冲突消解。在FaceForensics++数据集上的实验表明,该系统性能良好,面对不同类型伪造Face2Face、FaceSwap、NeuralTextures检测准确率分别达到97.12%、95.58%和89.44%,优于现有主流方法。
Abstract: The rapid evolution of Deepfake technology has led to the proliferation of high-quality forged content, posing a severe threat to personal privacy, public opinion security, and social governance. Existing defense methods generally have problems, such as single detection means, the separation of active and passive defenses, and insufficient cross-dataset generalization ability. Therefore, this paper proposes a hybrid defense architecture of “active defense + passive detection”. In terms of active defense, a method for generating gradient-guided region-adaptive adversarial perturbations is proposed. By combining face saliency maps and gradient information, precise perturbation allocation that is imperceptible to perception is achieved. A robust blind watermarking scheme based on the cascade transformation of DWT-DCT-SVD and K-means blind decision is designed to support the dual-mode trusted chain of “online traceability + local verification”. In terms of passive detection, a multi-source evidence fusion detection framework is constructed: an improved dual-branch attention-enhanced ResNet-50 is used as the core detector, which integrates the traditional feature analysis layers of frequency domain-texture-geometry and the semantic reasoning layer of large vision models. An adaptive fusion decision mechanism based on Dempster-Shafer evidence theory is introduced to replace the fixed weight scheme, realizing the uncertainty modeling and conflict resolution of multi-source evidence. Experiments on the FaceForensics++ dataset show that the system has good performance. The detection accuracies for different types of forgeries, Face2Face, FaceSwap, and NeuralTextures, reach 97.12%, 95.58%, and 89.44%, respectively, which are better than existing mainstream methods.
文章引用:姬懿轩, 袁杨坤, 甄博文, 王喆宇, 于越, 康晓凤. 基于混合防御架构的深度伪造检测与防御系统[J]. 数据挖掘, 2026, 16(2): 60-71. https://doi.org/10.12677/hjdm.2026.162006

参考文献

[1] Goodfellow, I., Pouget-Abadie, J., Mirza, M., et al. (2014) Generative Adversarial Nets. NIPS’14: Proceedings of the 28th International Conference on Neural Information Processing Systems, Volume 2, 2672-2680.
[2] 梁俊杰, 韦舰晶, 蒋正锋. 生成对抗网络GAN综述[J]. 北京: 计算机科学与探索, 2020, 14(1): 1-17.
[3] 左哲铭. 基于自编码框架的人脸交换方法研究[D]: [硕士学位论文]. 南京: 南京理工大学, 2023.
[4] Mirsky, Y. and Lee, W. (2021) The Creation and Detection of Deepfakes: A Survey. ACM Computing Surveys, 54, 1-41. [Google Scholar] [CrossRef
[5] 王振波, 吴湘玲. 数字时代深度伪造技术研究——机理特征、功能异化及其优化理路[J]. 北京航空航天大学学报社会科学版, 2025, 38(2): 47-55.
[6] 黄晓宝. 基于图像特征提取与融合的深度人脸伪造检测[D]: [硕士学位论文]. 南昌: 南昌大学, 2025.
[7] 姚文达, 李盼池, 赵娅, 等. 人脸深度伪造检测方法研究综述[J]. 中国图象图形学报, 2025, 30(7): 2343-2363.
[8] 瞿左珉, 殷琪林, 盛紫琦, 等. 人脸深度伪造主动防御技术综述[J]. 中国图象图形学报, 2024, 29(2): 318-342.
[9] 刘晓龙, 刘欢, 赵耀, 等. AIGC伪造内容被动检测与主动防御技术综述[J/OL]. 中国科学: 信息科学, 2025, 55(9): 2250-2288.
https://link.cnki.net/urlid/11.5846.TP.20251020.0913.002, 2026-02-13.
[10] 丁峰, 匡仁盛, 周越, 等. 深度伪造及其取证技术综述[J]. 中国图象图形学报, 2024, 29(2): 295-317.
[11] 杨睿, 胡心如, 黄卓超, 等. 深度网络生成式伪造人脸检测方法研究综述[J]. 计算机辅助设计与图形学学报, 2024, 36(10): 1491-1510.
[12] Durall, R., Keuper, M., Pfreundt, F.J., et al. (2019) Unmasking Deepfakes with Simple Features.
https://arxiv.org/abs/1911.00686
[13] Liu, Z., Qi, X. and Torr, P.H.S. (2020) Global Texture Enhancement for Fake Face Detection in the Wild. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 14-19 June 2020, 8057-8066. [Google Scholar] [CrossRef
[14] 朱新同, 唐云祁, 耿鹏志. 基于特征融合的篡改与深度伪造图像检测算法[J]. 信息网络安全, 2021, 21(8): 70-81.
[15] Abdullah, M.T., Hussein, N. and Ali, M. (2023) Deepfake Detection Improvement for Images Based on a Proposed Method for Local Binary Pattern of the Multiple-Channel Color Space. International Journal of Intelligent Engineering and Systems, 16, 92-104.
[16] Rossler, A., Cozzolino, D., Verdoliva, L., Riess, C., Thies, J. and Niessner, M. (2019) Faceforensics++: Learning to Detect Manipulated Facial Images. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27-28 October 2019, 1-11. [Google Scholar] [CrossRef
[17] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[18] 汤博宇, 焦良葆, 徐逸, 等. 基于改进ResNet-50的图像特征提取网络[J]. 计算机测量与控制, 2023, 31(6): 162-167.
[19] 张文祥, 王夏黎, 王欣仪, 等. 一种强化伪造区域关注的深度伪造人脸检测方法[J]. 图学学报, 2025, 46(1): 47-58.
[20] 陈鑫, 高迪, 蒲志明, 等. 基于多模态融合与注意力增强的深度伪造检测方法[J/OL]. 物联网学报, 1-12.
https://link.cnki.net/urlid/10.1491.TP.20260210.1057.002, 2026-02-13.
[21] 许楷文, 周翊超, 谷文权, 等. 基于多尺度特征融合重建学习的深度伪造人脸检测算法[J]. 信息网络安全, 2024, 24(8): 1173-1183.
[22] 彭春蕾, 李俊烨, 刘德成, 等. 大模型时代的深度伪造检测[J]. 中国科学: 信息科学, 2026, 56(1): 1-22.
[23] 陈泊睿, 张梅, 李昕蕊, 等. 数字水印技术原理与发展[J]. 中国防伪报道, 2025(10): 98-102.
[24] 高媛. 基于离散小波变换和奇异值分解的数字水印改进算法研究[D]: [硕士学位论文]. 合肥: 安徽建筑大学, 2021.
[25] 陈国凯, 冯辉. 深度学习中对抗样本攻击与防御方法研究[J]. 唐山师范学院学报, 2024, 46(3): 59-66+77.
[26] 刘瑞祺, 李虎, 王东霞, 等. 图像对抗样本防御技术研究综述[J]. 计算机科学与探索, 2023, 17(12): 2827-2839.
[27] 裘昊轩. 针对图像深度伪造模型的对抗样本生成技术研究[D]: [硕士学位论文]. 北京: 中国人民公安大学, 2023.
[28] 吴涛. 生成式AI的深度伪造攻击与多模态防御体系研究——基于对抗样本与区块链可追溯验证[J]. 中国信息界, 2025(12): 154-156.
[29] Shafer, G. (1976) A Mathematical Theory of Evidence. Princeton University Press.
[30] 孙业超. 基于RESTful API的前后端分离项目接口测试方法研究[J]. 软件, 2025, 46(9): 116-118.
[31] Afchar, D., Nozick, V., Yamagishi, J. and Echizen, I. (2018) Mesonet: A Compact Facial Video Forgery Detection Network. 2018 IEEE International Workshop on Information Forensics and Security (WIFS), Hong Kong, 11-13 December 2018, 1-7. [Google Scholar] [CrossRef
[32] Qian, Y., Yin, G., Sheng, L., Chen, Z. and Shao, J. (2020) Thinking in Frequency: Face Forgery Detection by Mining Frequency-Aware Clues. In: Vedaldi, A., et al., Eds., Computer VisionECCV 2020, Springer International Publishing, 86-103. [Google Scholar] [CrossRef
[33] Jin, X., Wu, N., Jiang, Q., Kou, Y., Duan, H., Wang, P., et al. (2024) A Dual Descriptor Combined with Frequency Domain Reconstruction Learning for Face Forgery Detection in Deepfake Videos. Forensic Science International: Digital Investigation, 49, Article ID: 301747. [Google Scholar] [CrossRef
[34] Lin, K., Han, W., Li, S., Gu, Z., Zhao, H., Ren, J., et al. (2022) IR-Capsule: Two-Stream Network for Face Forgery Detection. Cognitive Computation, 15, 13-22. [Google Scholar] [CrossRef