基于YOLO系列模型的瓦楞纸箱损伤检测方法研究
Research on YOLO-Series Methods for Corrugated Box Damage Detection
DOI: 10.12677/met.2026.151002, PDF,   
作者: 李 源, 程文娟*, 旷龙祥:北京印刷学院机电工程学院,北京
关键词: 瓦楞纸箱包装损伤检测YOLO端侧部署Corrugated Box Defect Detection YOLO Edge Deployment
摘要: 为应对物流现场中瓦楞纸箱因划痕、破洞与湿渍等造成的包装失效问题,本文构建包含3类典型损伤、共3800张图像的多源数据集(实验室模拟、网络采集、驿站实拍),并在统一训练/评测流程下,对YOLOv3-tiny、YOLOv5n、YOLOv8n、YOLO10n、YOLO11n与YOLO12n六个版本进行系统对比。采用P、R、F1、mAP50与GFLOPs等指标综合评估,并通过混淆矩阵分析不同模型在细粒度损伤上的识别差异。结果显示:YOLOv8n在精度与召回之间取得最优平衡(P = 0.840, R = 0.790, mAP50 = 0.868, F1 = 0.814),适合作为精度优先的工业检测基线;YOLO12n在保持较高精度的同时具备最低计算复杂度(GFLOPs = 6.3, mAP50 = 0.858),更适合端侧实时部署。对比分析还表明,弱纹理/反光干扰场景下“wet”与背景的混淆最为突出,而YOLO10n在“hole”类的召回存在明显下降。本文的贡献在于:(i) 提出覆盖多场景的纸箱损伤数据集与统一评测协议;(ii) 给出跨代YOLO的可复现对比基准;(iii) 提供面向工程落地的模型选型建议。未来工作将聚焦于损伤分级标准化、小样本增强与轻量注意力机制,以进一步提升复杂环境下的鲁棒性与可部署性。
Abstract: To address packaging failures of corrugated boxes caused by scratches, holes, and wet stains in logistics, we curate a 3-class, 3,800-image dataset from laboratory simulation, web crawling, and field acquisition at parcel stations. Under a unified training and evaluation protocol, we benchmark six YOLO variants—YOLOv3-tiny, YOLOv5n, YOLOv8n, YOLO10n, YOLO11n, and YOLO12n. Comprehensive evaluation employing metrics including P, R, F1, mAP50 and GFLOPs, alongside confusion matrix analysis to assess the identification discrepancies among different models regarding fine-grained damage. Results show that YOLOv8n achieves the best accuracy-recall balance (P = 0.840, R = 0.790, mAP50 = 0.868, F1 = 0.814), making it a strong baseline for accuracy-oriented inspection. YOLO12n attains the lowest computational cost (GFLOPs = 6.3) while maintaining competitive accuracy (mAP50 = 0.858), thus favoring real-time edge deployment. We also find noticeable confusion between “wet” and background under weak-texture/reflective conditions, and degraded recall for “hole” with YOLO10n. Our contributions include: (i) a multi-scenario corrugated-box defect dataset with a unified evaluation protocol; (ii) a reproducible cross-generation YOLO benchmark; and (iii) actionable guidance for engineering deployment. Future work will explore standardized severity grading, small-sample augmentation, and lightweight attention to further improve robustness and deployability in the wild.
文章引用:李源, 程文娟, 旷龙祥. 基于YOLO系列模型的瓦楞纸箱损伤检测方法研究[J]. 机械工程与技术, 2026, 15(1): 8-20. https://doi.org/10.12677/met.2026.151002

参考文献

[1] Garbowski, T., Knitter-Piątkowska, A. and Grabski, J.K. (2023) Estimation of the Edge Crush Resistance of Corrugated Board Using Artificial Intelligence. Materials, 16, Article 1631. [Google Scholar] [CrossRef] [PubMed]
[2] Smith, D.S. (2024) UK Shoppers Slam ‘Excessive’ Packaging, with Over Half Receiving Damaged Goods in the Last Year. Business Wire.
[3] Kim, S. and Lee, S. (2024) YOLO-Based Damage Detection with Stylegan3 Data Augmentation for Parcel Information-Recognition System. Computers, Materials & Continua, 80, 195-215. [Google Scholar] [CrossRef
[4] Ma, Y., Yin, J., Huang, F. and Li, Q. (2024) Surface Defect Inspection of Industrial Products with Object Detection Deep Networks: A Systematic Review. Artificial Intelligence Review, 57, Article No. 333. [Google Scholar] [CrossRef
[5] Lowe, D.G. (2004) Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60, 91-110. [Google Scholar] [CrossRef
[6] Jain, A.K. and Farrokhnia, F. (1991) Unsupervised Texture Segmentation Using Gabor Filters. Pattern Recognition, 24, 1167-1186. [Google Scholar] [CrossRef
[7] Ren, S., He, K., Girshick, R. and Sun, J. (2015) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. arXiv:1506.01497.
[8] Bochkovskiy, A., Wang, C.Y. and Liao, H.M. (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv:2004.10934.
[9] Wang, C., Bochkovskiy, A. and Liao, H.M. (2023) YOLOV7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 7464-7475. [Google Scholar] [CrossRef
[10] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Lecture Notes in Computer Science, Springer International Publishing, 3-19. [Google Scholar] [CrossRef
[11] Hu, J., Shen, L. and Sun, G. (2018) Squeeze-and-Excitation Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake, 18-23 June 2018, 7132-7141. [Google Scholar] [CrossRef
[12] Li Y., et al. (2024) TPMN: Texture Prior-Aware Multi-Level Feature Fusion Network for Corrugated Cardboard Parcels Defect Detection. International Journal of Advanced Computer Science and Applications, 15, 834-843.
[13] Holland, S., Tavasoli, M. and Lee, E. (2024). Corrugated Box Damage Classification Using Artificial Neural Network Image Training. Packaging Technology and Science, 37, 685-696.[CrossRef
[14] Ultralytics (2023) Explore YOLOv8 Models. 2023-2024. Ultralytics Docs.
[15] Xing, Y., Han, X., Pan, X., An, D., Liu, W. and Bai, Y. (2024) EMG-YOLO: Road Crack Detection Algorithm for Edge Computing Devices. Frontiers in Neurorobotics, 18, Article ID: 1423738. [Google Scholar] [CrossRef] [PubMed]