轨道交通低能见度场景图像增强系统研究
Research on Image Enhancement System of Low Visibility Scene in Rail Transit
摘要: 目标检测、语义分割等视觉任务应用于轨道交通的众多场景,大多数视觉系统被设计为在清晰的环境中执行,然而真实的轨道交通场景必然会包含退化的图像场景,列车一年四季在外遇到的恶劣的天气,以及面对隧道、夜间等环境中较差的照明,这些退化的图像会降低高级视觉任务的性能。本文关注于在恶劣天气(雾天,降雨)和弱光条件导致的能见度低的场景下进行图像增强,改善退化图像质量,为此,本文提出了一种轨道交通低能见度场景图像增强系统,可以自适应地对天气和照度进行分类并对分类后得到的低照度、雾天和雨天这三种场景图像进行增强。将该系统运用于轨道交通场景下的多目标检测以及语义分割应用中,实验结果表明,对于低照度图像以及雾天图像,本文系统可以提升多目标检测及语义分割约1%的准确率,对于雨天图像,本文系统可以提升多目标检测及语义分割4%以上的准确率。
Abstract: Visual tasks such as object detection and semantic segmentation are applied to many scenes in rail transit. Most visual systems are designed to execute in clear environments. However, real rail transit scenes inevitably contain degraded image scenes, such as the harsh weather encountered by trains outside all year round, as well as poor lighting in environments such as tunnels and nighttime. These degraded images can reduce the performance of advanced visual tasks. This article focuses on image enhancement in scenes with low visibility caused by severe weather (foggy, rainy) and weak light conditions to improve degraded image quality. For this reason, this paper proposes a low visibility scene image enhancement system for rail transit, which can adaptively classify weather and illumination and enhance the classified images of low illumination, foggy, and rainy scenes. Applying this system to multi-object detection and semantic segmentation applications in rail transit scenarios, experimental results show that the system can improve the accuracy of multi-object detection and semantic segmentation by about 1% for low illumination images and foggy images. For rainy images, the system can improve the accuracy of multi-object detection and semantic segmentation by more than 4%.
文章引用:袁小军, 李晨, 刘昕武, 田野, 姚巍巍. 轨道交通低能见度场景图像增强系统研究[J]. 图像与信号处理, 2023, 12(3): 302-316. https://doi.org/10.12677/JISP.2023.123030

参考文献

[1] 姚巍巍, 田野, 李晨. 基于改进SOLO的列车主动避障视觉算法研究[J]. 电子测量技术, 2022, 45(9): 133-139.
[2] 叶崧, 杨万扣. 改进的基于大气光幕单幅图像去雾算法[J]. 电子测量技术, 2016, 39(3): 50-55.
[3] Yang, W., Yuan, Y., Ren, W., et al. (2020) Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study. IEEE Transactions on Image Processing, 29, 5737-5752.
[Google Scholar] [CrossRef
[4] Kwon, D., Kim, G. and Kwon, J. (2020) DALE: Dark Region-Aware Low-Light Image Enhancement. The 31st British Machine Vision Conference, Manchester, 7-11 September 2020, page.
[5] Gharbi, M., Chen, J.W., Barron, J.T., Hasinoff, S.W. and Durand, F. (2017) Deep Bilateral Learning for Real-Time Image Enhancement. ACM Transactions on Graphics, 36, 1-12.
[Google Scholar] [CrossRef
[6] Dong, Y., Liu, Y.H., Zhang, H., Chen, S.F. and Qiao, Y. (2020) FD-GAN: Generative Adversarial Networks with Fusion-Discriminator for Single Image Dehazing. Proceedings of the AAAI Conference on Artificial Intelligence, 34, 10729-10736.
[Google Scholar] [CrossRef
[7] Zhang, H. and Patel, V.M. (2018) Densely Connected Pyramid Dehazing Network. 2018 IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 3194-3203.
[Google Scholar] [CrossRef
[8] Wang, H., Yue, Z., Xie, Q., et al. (2021) From Rain Generation to Rain Removal. 2021 IEEE Conference on Computer Vision and Pattern Recognition, Nashville, 20-25 June 2021, 14786-14796.
[Google Scholar] [CrossRef
[9] Fu, X.Y., Liang, B.R., Huang, Y., Ding, X.H. and Paisley, J. (2020) Lightweight Pyramid Networks for Image Deraining. IEEE Transactions on Neural Networks and Learning Systems, 31, 1794-1807.
[Google Scholar] [CrossRef
[10] Li, T. and Asari, V.K. (2005) Adaptive and Integrated Neighborhood-Dependent Approach for Nonlinear Enhancement of Color Images. Journal of Electronic Imaging, 14, Article ID: 043006.
[Google Scholar] [CrossRef
[11] Rahman, Z., Jobson, D.J. and Woodell, G.A. (1996) Multiscaleretinex for Color Image Enhancement. Proceedings of 3rd IEEE International Conference on Image Processing, Lausanne, 9-19 September 1996, 1003-1006.
[12] He, K., Sun, J. and Tang, X. (2011) Single Image Haze Removal Using Dark Channel Prior. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 2341-2353.
[Google Scholar] [CrossRef
[13] Qin, X., Wang, Z.L., Bai, Y.C., Xie, X.D. and Jia, H.Z. (2019) FFA-Net: Feature Fusion Attention Network for Single Image Dehazing. Proceedings of the AAAI Conference on Artificial Intelligence, 34, 11908-11915.
[Google Scholar] [CrossRef
[14] Wei, W., Yi, L., Xie, Q., et al. (2017) Should We Encode Rain Streaks in Video as Deterministic or Stochastic? Proceedings of the IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 2535-2544.
[Google Scholar] [CrossRef
[15] Ren, D.W., Zuo, W.M., Hu, Q.H., Zhu, P.F. and Meng, D.Y. (2019) Progressive Image Deraining Networks: A Better and Simpler Baseline. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, 15-20 June 2019, 3932-3941.
[Google Scholar] [CrossRef
[16] Tian, Y. and Yokota, Y. (2019) Estimating the Major Cluster by Mean-Shift with Updating Kernel. Mathematics, 7, Article 771.
[Google Scholar] [CrossRef
[17] Xie, S., Girshick, R., Dollár, P., Tu, Z.W. and He, K.M. (2017) Aggregated Residual Transformations for Deep Neural Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 21-26 July 2017, 5987-5995.
[Google Scholar] [CrossRef
[18] He, K.M., Zhang, X.Y., Ren, S.Q. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778.
[Google Scholar] [CrossRef
[19] Szegedy, C., Liu, W., Jia, Y., et al. (2014) Going Deeper with Convolutions. 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 1-9.
[Google Scholar] [CrossRef
[20] Simonyan, K. and Zisserman, A. (2014) Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv: 1409.1556.
[21] Boch Kovskiy, A., Wang, C.Y., Liao, H. and Yuan, H. (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv: 2004.10934.
[22] Chen, L.C., Zhu, Y.K., Papandreou, G., Schroff, F. and Adam, H. (2018) Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer Vision—ECCV 2018, Springer, Cham, 833-851.
[Google Scholar] [CrossRef