雨雾环境下的行人检测基准
A Benchmark for Pedestrian Detection in Rain and Fog
摘要: 随着针对雨雾天气行人目标检测研究的深入,现有数据集的局限性日益凸显:一方面,现有真实数据集存在严重的标注偏差,仅标注清晰可见的行人目标而忽略模糊实例,导致模型评估结果虚高;另一方面,现有合成数据集虽样本数量多,但其模拟雾效与真实场景存在显著缺陷。为此,本研究重新界定检测目标与评估维度,并且构建了新型真实雨雾行人检测数据集,其包含了2400张图像,通过数据清洗与重新标注。重点优化了低分辨率、小目标及复杂场景下的标注质量,解决了现有公开数据集在雨雾场景下样本不足、标注不精确等问题。实验结果表明,该数据集能够有效评估行人检测模型在复杂气象条件下的检测性能,为雨雾天气行人目标检测任务提供了更贴近实际的实验基准。
Abstract: As research into pedestrian target detection in rainy and foggy weather has deepened, the limitations of existing datasets have become increasingly apparent: on the one hand, existing real-world datasets suffer from severe annotation bias, as they only label clearly visible pedestrian targets while ignoring blurry instances, leading to inflated model evaluation results; on the other hand, while existing synthetic datasets have a large number of samples, their simulated fog effects exhibit significant deficiencies compared to real-world scenarios. To address these issues, this study redefines the detection targets and evaluation criteria and constructs a new real-world rain and fog pedestrian detection dataset comprising 2400 images, achieved through data cleaning and re-labeling. The dataset prioritizes improving annotation quality in low-resolution, small-object, and complex-scenario conditions, addressing the limitations of existing public datasets, such as insufficient samples and imprecise annotations in rain and fog scenarios. Experimental results demonstrate that this dataset can effectively evaluate the detection performance of pedestrian detection models under complex weather conditions, providing a more realistic experimental benchmark for pedestrian target detection tasks in rain and fog weather conditions.
文章引用:张伟航, 郑凯东. 雨雾环境下的行人检测基准[J]. 计算机科学与应用, 2025, 15(8): 168-181. https://doi.org/10.12677/csa.2025.158207

参考文献

[1] Li, C. (2017) Pedestrian Detection Based on Deep Learning. eScholarship.
[2] Dalal, N. and Triggs, B. (2005) Histograms of Oriented Gradients for Human Detection. IEEE Computer Society Conference on Computer Vision Pattern Recognition, San Diego, 21-23 September 2005.
[3] Felzenszwalb, P.F., Girshick, R.B., McAllester, D. and Ramanan, D. (2010) Object Detection with Discriminatively Trained Part-Based Models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32, 1627-1645. [Google Scholar] [CrossRef] [PubMed]
[4] Ren, S.Q., He, K.M., Girshick, R. and Sun, J.A. (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef] [PubMed]
[5] Redmon, J., Divvala, S., Girshick, R. and Farhadi, A. (2016) You Only Look Once: Unified, Real-Time Object Detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 779-788. [Google Scholar] [CrossRef
[6] Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A. and Zagoruyko, S. (2020) End-to-End Object Detection with Transformers. In: Lecture Notes in Computer Science, Springer, 213-229. [Google Scholar] [CrossRef
[7] Dixit, I.A. and Bhoite, S. (2024) Analysis of Performance of YOLOv8 Algorithm for Pedestrian Detection. 2024 9th International Conference on Communication and Electronics Systems (ICCES), Coimbatore, 16-18 December 2024, 1918-1924.
[8] Shi, X.L. and Song, A.J. (2024) Defog YOLO for Road Object Detection in Foggy Weather. The Computer Journal, 67, 3115-3127. [Google Scholar] [CrossRef
[9] Zhang, X., Song, H., Wan, F. and Yang, X. (2022) A Pedestrian Detection Method Based on Improved YOLOv5s. 2022 International Conference on Cloud Computing, Big Data Applications and Software Engineering (CBASE), Suzhou, 23-25 September 2022, 197-201. [Google Scholar] [CrossRef
[10] Sakaridis, C., Dai, D.X. and Van Gool, L. (2017) Semantic Foggy Scene Understanding with Synthetic Data. International Journal of Computer Vision, 126, 973-992. [Google Scholar] [CrossRef
[11] Pour, B.S., Jozani, H.M. and Shokouhi, S.B. (2024) Al-YOLO: Accurate and Lightweight Vehicle and Pedestrian Detector in Foggy Weather. 2024 14th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, 19-20 November 2024, 131-136.
[12] Sakaridis, C., Wang, H.R., Li, K., Zurbrügg, R., et al. (2021) ACDC: The Adverse Conditions Dataset with Correspondences for Robust Semantic Driving Scene Perception. CVF Open Access.
[13] Li, B.Y., Ren, W.Q., Fu, D.P., Tao, D.C., et al. (2017) Benchmarking Single Image Dehazing and beyond. IEEE Transactions on Image Processing, 28, 492-505.
[14] Li, B.Y., Ren, W.Q., Fu, D.P., Tao, D.C. and Wang, Z.Y. (2017) Reside: A Benchmark for Single Image Dehazing.
[15] Ancuti, C.O., Ancuti, C., Sbert, M. and Timofte, R. (2019) Dense-Haze: A Benchmark for Image Dehazing with Dense-Haze and Haze-Free Images. 2019 IEEE International Conference on Image Processing (ICIP), Taipei, 22-25 September 2019, 1014-1018. [Google Scholar] [CrossRef
[16] Ancuti, C.O., Ancuti, C. and Timofte, R. (2020) NH-HAZE: An Image Dehazing Benchmark with Non-Homogeneous Hazy and Haze-Free Images. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, 14-19 June 2020, 1798-1805. [Google Scholar] [CrossRef
[17] Wang, C.Y., Yeh, I.H., Yuan H. and Liao, M. (2025) YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information. In: Lecture Notes in Computer Science, Vol. 15089, Springer, 1-21.
[18] Alif, M.A.R. and Hussain, M. (2025) YOLOv12: A Breakdown of the Key Architectural Features.
[19] Gurbindo, U., Brando, A., Abella, J. and Knig, C. (2025) Object Detection in Adverse Weather Conditions for Autonomous Vehicles Using Instruct Pix2Pix.
[20] Ding, Y.X., Zhang, M., Pan, J., Hu, J.X. and Luo, X.W. (2024) Robust Object Detection in Extreme Construction Conditions. Automation in Construction, 165, Article 105487. [Google Scholar] [CrossRef