改进YOLOv8的晶体熔接阶段图像视觉识别检测方法
Improving the Visual Recognition and Detection Method for Crystal Fusion Stage Images in YOLOv8
摘要: 直拉硅晶体生长熔接流程中生长界面温度检测是保障后续引晶成功的重要任务。现有的目标检测模型对熔接凸点目标检测存在定位不准确,误检率高等问题,本文提出了一种改进了基于特征增强的YOLOv8算法。首先,针对熔接光圈图像中小目标容易出现误检和漏检的常见问题,引入BiFPN的思想对YOLOv8m中的颈部部分进行改进。为了进一步提升检测精度,在特征融合网络中采用了更轻量的动态上采样算子DySample,以提高融合特征的质量和丰富度。在工业提供的数据集上评估了YOLOv8-A模型,实验结果表明,与原来算法相比,YOLOv8-A的参数量和计算量分别减少至2.19 × 10^7,同时实现了98.2%的mAP,对小目标的检测提升了5.8个百分点。通过与其它主流目标检测算法比较,验证了该方法的有效性和优越性。
Abstract: The detection of the growth interface temperature in the Czochralski process for silicon crystal growth welding process is considered an important task for ensuring the successful subsequent crystal pulling. Existing target detection models exhibit issues such as inaccurate localization and high false detection rates in the detection of welding bump targets. An improved YOLOv8 algorithm based on feature enhancement is proposed in this paper. Firstly, to address the common problems of false and missed detections of small targets in welding aperture images, the concept of BiFPN is introduced to enhance the neck part of YOLOv8m. To further improve detection accuracy, a more lightweight dynamic upsampling operator, DySample, is utilized in the feature fusion network to enhance the quality and richness of the fused features. The YOLOv8-A model is evaluated on an industrially provided dataset, and experimental results indicate that, compared to the original algorithm, the parameter count and computational load of YOLOv8-A are reduced to 2.19 × 10^7, while achieving a 98.2% mAP and improving small target detection by 5.8 percentage points. The effectiveness and superiority of this method are validated through comparisons with other mainstream target detection algorithms.
文章引用:李振成, 李桐, 胡涛, 张自主, 王海欣. 改进YOLOv8的晶体熔接阶段图像视觉识别检测方法[J]. 计算机科学与应用, 2026, 16(1): 72-86. https://doi.org/10.12677/csa.2026.161007

参考文献

[1] 王正省, 任永生, 马文会, 等. 直拉法单晶硅生长原理、工艺及展望[J]. 材料导报, 2024, 38(9): 5-17.
[2] 马亚苹. 直拉法单晶硅生长原理及工艺分析[J]. 中国粉体工业, 2023(1): 29-32.
[3] 刘赟, 薛忠营, 魏星, 李炜. 直拉单晶硅晶体缺陷研究进展[J]. 微纳电子与智能制造, 2022, 4(1): 64-74.
[4] Soori, M., Arezoo, B. and Dastres, R. (2023) Artificial Intelligence, Machine Learning and Deep Learning in Advanced Robotics, a Review. Cognitive Robotics, 3, 54-70. [Google Scholar] [CrossRef
[5] Yang, L. et al. (2023) An Improving Faster-RCNN with Multi-Attention ResNet for Small Target Detection in Intelligent Autonomous Transport with 6G. IEEE Transactions on Intelligent Transportation Systems, 24, 7717-7725. [Google Scholar] [CrossRef
[6] 邵延华, 张铎, 楚红雨, 等. 基于深度学习的YOLO目标检测综述[J]. 电子与信息学报, 2022, 44(10): 3697-3708.
[7] Li, H., Li, J., Wei, H., Liu, Z., Zhan, Z. and Ren, Q. (2022) Slim-Neck by GSConv: A Better Design Paradigm of Detector Architectures for Autonomous Vehicles. arXiv: 2206. 02424.
[8] Zhang, Q., Zhang, H. and Lu, X. (2022) Adaptive Feature Fusion for Small Object Detection. Applied Sciences, 12, Article 11854. [Google Scholar] [CrossRef
[9] 张姝瑾, 许兴时, 邓洪兴, 等. 基于YOLOv8n-seg-FCA-BiFPN的奶牛身体分割方法[J]. 农业机械学报, 2024, 55(3): 282-289+391.
[10] Zhang, M., Wang, Z., Song, W., Zhao, D. and Zhao, H. (2024) Efficient Small-Object Detection in Underwater Images Using the Enhanced YOLOv8 Network. Applied Sciences, 14, Article 1095. [Google Scholar] [CrossRef
[11] 赵其杰, 程德富, 卢建霞, 等. 基于视觉的单晶生长直径检测与监控[J]. 制造业自动化, 2011, 33(7): 22-25.
[12] 赵跃, 王欣. 基于光圈模式识别的直拉晶体生长温度测量技术研究[J]. 传感技术学报, 2018, 31(4): 573-578.
[13] 杨舟, 程莹, 张诗婧, 等. 基于改进YOLOv5算法的直拉法单晶硅位错检测模型研究[J]. 应用光学, 2023, 44(5): 1022-1029.
[14] Wang, H., Qian, H., Feng, S. and Wang, W. (2024) L-SSD: Lightweight SSD Target Detection Based on Depth-Separable Convolution. Journal of Real-Time Image Processing, 21, Article No. 33. [Google Scholar] [CrossRef
[15] Luo, F., Dai, Y., Fuentes, J., Ding, W. and Zhang, X. (2024) M-DETR: Multi-Scale DETR for Optical Music Recognition. Expert Systems with Applications, 249, Article 123664. [Google Scholar] [CrossRef
[16] Liu, B., Wang, H., Wang, Y., Zhou, C. and Cai, L. (2023) Lane Line Type Recognition Based on Improved YOLOv5. Applied Sciences, 13, Article 10537. [Google Scholar] [CrossRef
[17] Yang, Z., Xie, R., Liu, L. and Li, N. (2024) Dense-YOLOv7: Improved Real-Time Insulator Detection Framework Based on YOLOv7. International Journal of Low-Carbon Technologies, 19, 157-170. [Google Scholar] [CrossRef
[18] Wang, X. and Liu, J. (2024) Vegetable Disease Detection Using an Improved YOLOv8 Algorithm in the Greenhouse Plant Environment. Scientific Reports, 14, Article No. 4261. [Google Scholar] [CrossRef] [PubMed]
[19] Yu, X., Yu, Q., Mu, Q., Hu, Z. and Xie, J. (2023) MCAW-YOLO: An Efficient Detection Model for Ceramic Tile Surface Defects. Applied Sciences, 13, Article 12057. [Google Scholar] [CrossRef
[20] Zhao, X. and Song, Y. (2023) Improved Ship Detection with YOLOv8 Enhanced with MobileViT and GSConv. Electronics, 12, Article 4666. [Google Scholar] [CrossRef
[21] Rybczak, M. and Kozakiewicz, K. (2024) Deep Machine Learning of MobileNet, Efficient, and Inception Models. Algorithms, 17, Article 96. [Google Scholar] [CrossRef
[22] Ma, N., Zhang, X., Zheng, H. and Sun, J. (2018) Shufflenet V2: Practical Guidelines for Efficient CNN Architecture Design. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Lecture Notes in Computer Science, Springer International Publishing, 122-138. [Google Scholar] [CrossRef
[23] Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C. and Xu, C. (2020) GhostNet: More Features from Cheap Operations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, 13-19 June 2020, 1580-1589.
[24] Han, W., He, N., Wang, X., Sun, F. and Liu, S. (2023) IDPD: Improved Deformable-DETR for Crowd Pedestrian Detection. Signal, Image and Video Processing, 18, 2243-2253. [Google Scholar] [CrossRef
[25] Li, N., Ye, T., Zhou, Z., Gao, C. and Zhang, P. (2024) Enhanced YOLOv8 with BiFPN-SimAM for Precise Defect Detection in Miniature Capacitors. Applied Sciences, 14, Article 429. [Google Scholar] [CrossRef
[26] Chiley, V., Thangarasa, V., Gupta, A., Samar, A., Hestness, J. and DeCoste, D. (2023) RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network. arXiv:2206.14098.
[27] Chen, W.-Y., Zhao, H.-C., Liu, P., Fang, J. and Sun, H. (2024) Vehicle Detection Algorithm Based on Improved YOLOv3. Control and Decision, 39, 1151-1159.
[28] Wang, J., Li, Q., Fang, Z., Zhou, X., Tang, Z., Han, Y., et al. (2023) YOLOv6-ESG: A Lightweight Seafood Detection Method. Journal of Marine Science and Engineering, 11, Article 1623. [Google Scholar] [CrossRef