基于暗光环境的实时目标检测算法研究与应用
Research and Application of Real-Time Object Detection Algorithm Based on Low-Light Environments
DOI: 10.12677/mos.2025.148567, PDF,   
作者: 葛岩松, 付东翔:上海理工大学光电信息与计算机工程学院,上海
关键词: 暗光图像增强机器视觉边缘计算平台Low-Light Image Enhancement Machine Vision Edge Computing Platform
摘要: 为了提升视觉任务在复杂暗光环境中的鲁棒性,并满足边缘计算平台对实时目标检测的需求,本文提出了一种改进的轻量化暗光图像处理网络(RPC-YOLO)。该网络前端采用基于Retinexformer改进的RetinexIGTNet对暗光图像进行快速处理,并结合YOLOv5n实现高效的端到端目标检测。此外,模型引入部分卷积(C3-fasterblock)减少计算冗余,跳过无用区域,显著降低参数量并提升推理速度。同时,设计级联注意力机制(CGA)增强特征提取精度和计算效率。实验表明,改进后的模型在公开数据集上的mAP值较原YOLOv5n提升了23%,参数量减少12万,真实场景检测速度(FPS)达到44.65,满足边缘计算平台的实时性需求。该模型为暗光环境下的边缘计算任务(如视觉引导的机械臂抓取)提供了更高效的解决方案。
Abstract: In order to enhance the robustness of visual tasks in complex low-light environments and enable to enhance the robustness of visual tasks in complex low-light environments and meet the real-time object detection requirements of edge computing platforms, this paper proposes an improved lightweight low-light image processing network (RPC-YOLO). The front end of the network uses the RetinexIGTNet, which is based on improvements to the Retinexformer, to quickly process low-light images and combines with YOLOv5n to achieve efficient end-to-end object detection. In addition, the model introduces partial convolution (C3-fasterblock) to reduce computational redundancy by skipping irrelevant regions, significantly reducing the number of parameters and improving inference speed. Meanwhile, a cascaded group attention mechanism (CGA) is designed to enhance the precision and computational efficiency of feature extraction. Experiments show that the improved model has increased the mAP value on public datasets by 23% compared with the original YOLOv5n, reduced the number of parameters by 120,000, and achieved a detection speed (FPS) of 44.65 in real scenes, meeting the real-time requirements of edge computing platforms. This model provides a more efficient solution for edge computing tasks in low-light environments (such as visually guided robotic arm grasping).
文章引用:葛岩松, 付东翔. 基于暗光环境的实时目标检测算法研究与应用[J]. 建模与仿真, 2025, 14(8): 286-296. https://doi.org/10.12677/mos.2025.148567

参考文献

[1] Lore, K.G., Akintayo, A. and Sarkar, S. (2017) LLNet: A Deep Autoencoder Approach to Natural Low-Light Image Enhancement. Pattern Recognition, 61, 650-662. [Google Scholar] [CrossRef
[2] 冯欣, 王思平, 张智先, 焦晓宁, 薛明龙. 一种面向机器视觉感知的暗光图像增强网络[J]. 计算机应用研究, 2024, 41(6): 1910-1915.
[3] Wei, C., Wang, W.J., Yang, W.H. and Liu, J.Y. (2018) Deep Retinex Decomposition for Low-Light Enhancement. British Machine Vision Conference 2018, BMVC 2018, Newcastle, 3-6 September 2018, 1-12.
[4] Zhang, Y., Guo, X., Ma, J., Liu, W. and Zhang, J. (2021) Beyond Brightening Low-Light Images. International Journal of Computer Vision, 129, 1013-1037. [Google Scholar] [CrossRef
[5] Zhang, Y.H., Zhang, J.W. and Guo, X.J. (2019) Kindling the Darkness: A Practical Low-Light Image Enhancer. Proceedings of the 27th ACM International Conference on Multimedia, Nice, 21-25 October 2019, 1632-1640. [Google Scholar] [CrossRef
[6] Cai, Y., Bian, H., Lin, J., Wang, H., Timofte, R. and Zhang, Y. (2023) Retinexformer: One-Stage Retinex-Based Transformer for Low-Light Image Enhancement. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, 2-6 October 2023, 12504-12513. [Google Scholar] [CrossRef
[7] Yin, X., Yu, Z., Fei, Z., Lv, W. and Gao, X. (2023) PE-YOLO: Pyramid Enhancement Network for Dark Object Detection. In: Iliadis, L., et al., Eds., International Conference on Artificial Neural Networks, Springer, 163-174. [Google Scholar] [CrossRef
[8] Ma, L., Ma, T., Liu, R., Fan, X. and Luo, Z. (2022) Toward Fast, Flexible, and Robust Low-Light Image Enhancement. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 5637-5646. [Google Scholar] [CrossRef
[9] 李子姝, 谢人超, 孙礼, 等. 移动边缘计算综述[J]. 电信科学, 2018, 34(1): 87-101.
[10] Girshick, R., Donahue, J., Darrell, T. and Malik, J. (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. [Google Scholar] [CrossRef
[11] Girshick, R. (2015) Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision, Santiago, 7-13 December 2015, 1440-1448. [Google Scholar] [CrossRef
[12] Ren, S., He, K., Girshick, R. and Sun, J. (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef] [PubMed]
[13] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., et al. (2016) SSD: Single Shot Multibox Detector. Computer Vision-ECCV 2016: 14th European Conference, Amsterdam, 11-14 October 2016, 21-37. [Google Scholar] [CrossRef
[14] Jocher, G., Chaurasia, A., Stoken, A., et al. (2022) Ultralytics/yolov5: v6.1—TensorRT, TensorFlow Edge TPU and OpenVINO Export and Inference. Zenodo.
[15] Cui, Z., Li, K., Gu, L., et al. (2022) You Only Need 90k Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction.
[16] Loh, Y.P. and Chan, C.S. (2019) Getting to Know Low-Light Images with the Exclusively Dark Dataset. Computer Vision and Image Understanding, 178, 30-42. [Google Scholar] [CrossRef
[17] Xu, X., Wang, R., Fu, C. and Jia, J. (2022) SNR-Aware Low-Light Image Enhancement. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 17714-17724. [Google Scholar] [CrossRef
[18] Wei, C., Wang, W., Yang, W. and Liu, J. (2018) Deep Retinex Decomposition for Low-Light Enhancement.