解释性陷阱:目标检测模型中性能与解释可靠性研究
Explainability Traps: A Study on the Relationship between Object Detection Model Performance and Explanation Reliability
DOI: 10.12677/airr.2025.146134, PDF,   
作者: 叶 阳:西华大学汽车测控与安全四川省重点实验室,四川 成都
关键词: Faster R-CNN可解释性解释性陷阱Grad-CAMSmoothGrad-IGFaster R-CNN Interpretability Interpretative Traps Grad-CAM SmoothGrad-IG
摘要: 目标检测模型在自动驾驶等安全关键领域广泛应用,其决策过程的可解释性对系统可靠性至关重要。当前研究重点主要集中在如何提高模型性能上,对于模型的可解释性以及解释质量与决策可靠性之间的内在关联关注较少。鉴于这一问题,本研究系统探究了目标检测模型性能与解释可靠性之间的关系。基于KITTI数据集,采用三种Faster R-CNN变体在简单、中等和困难三类场景下进行实验,通过Grad-CAM和SmoothGrad-IG两种解释方法,结合新提出的Energy-based Pointing Game和Performance-Explanation Correlation指标进行量化评估。结果表明:性能最佳的ResNet50_Epoch20在简单场景中PEC值为−0.189,揭示了“解释性陷阱”现象——高置信度预测反而伴随低质量解释;Grad-CAM生成的热力图分散且模型间差异显著,而SmoothGrad-IG产生的热力图高度收敛且模型间一致性高;简单场景中的解释可靠性问题最为严重,与直觉相反。
Abstract: Target detection model is widely used in key safety fields such as automatic driving, and the interpretability of its decision-making process is very important for system reliability. The current research focuses on how to improve the performance of the model, and pays less attention to the interpretability of the model and the internal relationship between the interpretation quality and decision reliability. In view of this problem, this study systematically explores the relationship between the performance of target detection model and interpretation reliability. Based on the Kitti data set, three fast R-CNN variants were used to carry out experiments in simple, medium and difficult scenarios. The two interpretation methods of grad cam and smooth grad Ig were combined with the newly proposed energy based pointing game and performance explanation correlation indicators for quantitative evaluation. The results show that the PEC value of resnet50_poch20 with the best performance is −0.189 in simple scenarios, which reveals the phenomenon of “interpretative trap”—high confidence prediction is accompanied by low quality interpretation; The thermal maps generated by grad cam are scattered and have significant differences among models, while the thermal maps generated by smooth grad IG are highly convergent and have high consistency among models; The problem of interpretation reliability is the most serious in simple scenarios, which is contrary to intuition.
文章引用:叶阳. 解释性陷阱:目标检测模型中性能与解释可靠性研究[J]. 人工智能与机器人研究, 2025, 14(6): 1433-1443. https://doi.org/10.12677/airr.2025.146134

参考文献

[1] Peng, Y. (2023) Deep Learning for 3D Object Detection and Tracking in Autonomous Driving: A Brief Survey.
[2] Mirzaie, M. and Rosenhahn, B. (2025) Interpretable Decision-Making for End-to-End Autonomous Driving.
[3] 董文轩, 梁宏涛, 刘国柱, 等. 深度卷积应用于目标检测算法综述[J]. 计算机科学与探索, 2022, 16(5): 1025-1042.
[4] Girshick, R., Donahue, J., Darrell, T. and Malik, J. (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. [Google Scholar] [CrossRef
[5] Ren, S., He, K., Girshick, R., et al. (2016) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149.
[6] He, K., Gkioxari, G., Dollar, P. and Girshick, R. (2017) Mask R-CNN. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2980-2988. [Google Scholar] [CrossRef
[7] Cai, Z. and Vasconcelos, N. (2018) Cascade R-CNN: Delving into High Quality Object Detection. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 6154-6162. [Google Scholar] [CrossRef
[8] Pang, J., Chen, K., Shi, J., Feng, H., Ouyang, W. and Lin, D. (2019) Libra R-CNN: Towards Balanced Learning for Object Detection. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 15-20 June 2019, 821-830. [Google Scholar] [CrossRef
[9] Redmon, J., Divvala, S., Girshick, R. and Farhadi, A. (2016) You Only Look Once: Unified, Real-Time Object Detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 779-788. [Google Scholar] [CrossRef
[10] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., et al. (2016) SSD: Single Shot Multibox Detector. In: Lecture Notes in Computer Science, Springer, 21-37. [Google Scholar] [CrossRef
[11] Lin, T., Goyal, P., Girshick, R., He, K. and Dollar, P. (2017) Focal Loss for Dense Object Detection. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2999-3007. [Google Scholar] [CrossRef
[12] Tian, Z., Shen, C., Chen, H. and He, T. (2019) FCOS: Fully Convolutional One-Stage Object Detection. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27 October-2 November 2019, 9626-9635. [Google Scholar] [CrossRef
[13] Atakishiyev, S., Salameh, M. and Goebel, R. (2025) Safety Implications of Explainable Artificial Intelligence in End-to-End Autonomous Driving. IEEE Transactions on Intelligent Transportation Systems, 1-20. [Google Scholar] [CrossRef
[14] Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D. and Batra, D. (2017) Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 618-626. [Google Scholar] [CrossRef
[15] Wang, H., Wang, Z., Du, M., Yang, F., Zhang, Z., Ding, S., et al. (2020) Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural Networks. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, 14-19 June 2020, 111-119. [Google Scholar] [CrossRef
[16] Ribeiro, M.T., Singh, S. and Guestrin, C. (2016). “Why Should I Trust You?”. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, 13-17 August 2016, 1135-1144.[CrossRef
[17] Lundberg, S.M. and Lee, S.I. (2017) A Unified Approach to Interpreting Model Predictions. Advances in Neural Information Processing Systems, 30, 4768-4777.
[18] Sundararajan, M., Taly, A. and Yan, Q. (2016) Gradients of Counterfactuals.
https://arxiv.org/abs/1611.02639
[19] Smilkov, D., Thorat, N., Kim, B., et al. (2017) Smooth Grad: Removing Noise by Adding Noise.
https://arxiv.org/abs/1706.03825
[20] Badithela, A., Wongpiromsarn, T. and Murray, R.M. (2022) Evaluation Metrics for Object Detection for Autonomous Systems.
https://arxiv.org/abs/2210.10298
[21] Jiang, P.T., Zhang, C.B., Hou, Q., et al. (2021) Layercam: Exploring Hierarchical Class Activation Maps for Localization. IEEE Transactions on Image Processing, 30, 5875-5888.
[22] Li, X., Chen, Z., Zhang, J.M., Sarro, F., Zhang, Y. and Liu, X. (2025) Bias behind the Wheel: Fairness Testing of Autonomous Driving Systems. ACM Transactions on Software Engineering and Methodology, 34, 1-24. [Google Scholar] [CrossRef
[23] Zhang, J., Bargal, S.A., Lin, Z., Brandt, J., Shen, X. and Sclaroff, S. (2017) Top-Down Neural Attention by Excitation Backprop. International Journal of Computer Vision, 126, 1084-1102. [Google Scholar] [CrossRef