基于Mask R-CNN的近海漂浮垃圾智能识别与清理路径规划系统
Intelligent Recognition and Cleanup Path Planning System for Offshore Floating Debris Based on Mask R-CNN
摘要: 针对近海漂浮垃圾人工清理效率低、成本高和风险大的问题,本文设计了一套基于Mask R-CNN的智能检测与清理路径规划系统。系统采用Roboflow海洋垃圾数据集,包含11类目标、10,000张图像和56,272个标注实例;基于ResNet-50-FPN的Mask R-CNN实现目标检测与实例区域分割,并将清理路径建模为非闭合旅行商问题,采用最近邻贪心算法与2-opt局部搜索进行优化。实验结果表明,模型在验证集上的边界框AP为53.50%,掩码AP为 52.05%;路径规划算法在200个目标规模下耗时约181.8 ms,路径长度缩短约10.5%。同时,本文开发了Web与桌面双模态可视化系统,实现了图像输入、目标检测、路径规划和结果展示的原型化闭环流程,为近海漂浮垃圾智能识别与自动化清理提供了实验基础。
Abstract: To address the low efficiency, high cost, and operational risk of manual cleanup of offshore floating debris, this paper designs an intelligent detection and cleanup path planning system based on Mask R-CNN. The system uses the Roboflow marine debris dataset, containing 11 categories, 10,000 images, and 56,272 annotated instances. A Mask R-CNN model with a ResNet-50-FPN backbone is adopted for object detection and instance-level region segmentation, while the cleanup route is modeled as an open-loop Traveling Salesman Problem and optimized by a nearest-neighbor greedy algorithm combined with 2-opt local search. Experimental results show that the model achieves a bounding box AP of 53.50% and a mask AP of 52.05% on the validation set. For 200 targets, the path planning algorithm takes about 181.8 ms and reduces the route length by about 10.5%. A dual-mode visualization system with Web and desktop clients is also developed, forming a prototype closed-loop workflow of image input, target detection, path planning, and result display, which provides an experimental basis for intelligent recognition and automated cleanup of offshore floating debris.
文章引用:杜佳旺, 曹笑云, 李昀静, 吐鲁娜依·吐尔洪江. 基于Mask R-CNN的近海漂浮垃圾智能识别与清理路径规划系统[J]. 计算机科学与应用, 2026, 16(6): 356-371. https://doi.org/10.12677/csa.2026.166234

参考文献

[1] Jambeck, J.R., Geyer, R., Wilcox, C., Siegler, T.R., Perryman, M., Andrady, A., et al. (2015) Plastic Waste Inputs from Land into the Ocean. Science, 347, 768-771. [Google Scholar] [CrossRef] [PubMed]
[2] Jia, T., Kapelan, Z., de Vries, R., Vriend, P., Peereboom, E.C., Okkerman, I., et al. (2023) Deep Learning for Detecting Macroplastic Litter in Water Bodies: A Review. Water Research, 231, Article 119632. [Google Scholar] [CrossRef] [PubMed]
[3] Fallati, L., Polidori, A., Salvatore, C., Saponari, L., Savini, A. and Galli, P. (2019) Anthropogenic Marine Debris Assessment with Unmanned Aerial Vehicle Imagery and Deep Learning: A Case Study along the Beaches of the Republic of Maldives. Science of the Total Environment, 693, Article 133581. [Google Scholar] [CrossRef] [PubMed]
[4] Ren, S., He, K., Girshick, R. and Sun, J. (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef] [PubMed]
[5] Redmon, J., Divvala, S., Girshick, R. and Farhadi, A. (2016) You Only Look Once: Unified, Real-Time Object Detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 779-788. [Google Scholar] [CrossRef
[6] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., et al. (2016) SSD: Single Shot Multibox Detector. In: Lecture Notes in Computer Science, Springer, 21-37. [Google Scholar] [CrossRef
[7] Córdova, M., Pinto, A., Hellevik, C.C., Alaliyat, S.A., Hameed, I.A., Pedrini, H., et al. (2022) Litter Detection with Deep Learning: A Comparative Study. Sensors, 22, Article 548. [Google Scholar] [CrossRef] [PubMed]
[8] Yang, X., Zhao, J., Zhao, L., Zhang, H., Li, L., Ji, Z., et al. (2022) Detection of River Floating Garbage Based on Improved Yolov5. Mathematics, 10, Article 4366. [Google Scholar] [CrossRef
[9] Croes, G.A. (1958) A Method for Solving Traveling-Salesman Problems. Operations Research, 6, 791-812. [Google Scholar] [CrossRef
[10] AIproject (2025) AIProject Dataset. Open Source Dataset. Roboflow Universe. Roboflow.
https://universe.roboflow.com/aiproject-rqjzf/aiproject-ryolj
[11] He, K., Gkioxari, G., Dollar, P. and Girshick, R. (2017) Mask R-CNN. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2961-2969. [Google Scholar] [CrossRef
[12] Wu, Y., Kirillov, A., Massa, F., et al. (2019) Detectron2.
https://github.com/facebookresearch/detectron2
[13] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[14] Lin, T.Y., Dollar, P., Girshick, R., He, K., Hariharan, B. and Belongie, S. (2017) Feature Pyramid Networks for Object Detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 2117-2125. [Google Scholar] [CrossRef
[15] Lin, T.Y., Goyal, P., Girshick, R., He, K. and Dollar, P. (2017) Focal Loss for Dense Object Detection. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2980-2988. [Google Scholar] [CrossRef
[16] Wojke, N., Bewley, A. and Paulus, D. (2017) Simple Online and Realtime Tracking with a Deep Association Metric. 2017 IEEE International Conference on Image Processing (ICIP), Beijing, 17-20 September 2017, 3645-3649. [Google Scholar] [CrossRef