基于ASF-WIoU-YOLOv8的无人机航拍图像多目标检测算法
Multi-Object Detection Algorithm for UAV Aerial Images Based on ASF-WIoU-YOLOv8
DOI: 10.12677/csa.2024.145122, PDF,    科研立项经费支持
作者: 殷 波:贵州交通职业技术学院机械电子工程系,贵州 贵阳
关键词: 目标检测YOLOv8注意尺度序列融合Wise-IoUTarget Detection YOLOv8 Attention-Scale Sequence Fusion Wise-IoU
摘要: 针对无人机航拍图像的多目标检测问题,本文提出了一种基于改进YOLOv8的目标检测算法ASF-WIoU-YOLOv8。首先,在YOLOv8的基础架构上,加入一种注意尺度序列融合机制(Attentional Scale Sequence Fusion-ASF),该机制能够对不同尺度的特征图进行融合,从而获得更好的图像特征,提取出更丰富、更准确的特征信息。然后,对损失函数进行改进,引入Wise-IoU机制,该机制通过自适应地调整权重系数提高目标检测的灵活性和鲁棒性,从而进一步提高算法的检测精度。实验结果表明,在VisDrone数据集上,本文所提算法比YOLOv8算法的平均精度mAP50提升了2.0%,该算法在无人机航拍图像上具有更高的检测精度。
Abstract: For the problem of multi-object detection in UAV aerial images, this paper presents an improved YOLOv8 object detection algorithm named ASF-WIoU-YOLOv8. Firstly, on the infrastructure of YOLOv8, an attention-scale sequence fusion mechanism (Attentional Scale Sequence Fusion-ASF) is added, which can integrate feature maps at different scales, so as to obtain better image features and extract richer and more accurate feature information. Then, the loss function is improved by using the Wise-IoU mechanism, which improves the flexibility and robustness of target detection by adaptively adjusting the weight coefficients. Wise-IoU can further improve the detection accuracy of the algorithm. The experimental results show that the average accuracy of the proposed algorithm is 2.0% higher than the YOLOv8 algorithm, which has higher detection accuracy on aerial images of UAV.
文章引用:殷波. 基于ASF-WIoU-YOLOv8的无人机航拍图像多目标检测算法[J]. 计算机科学与应用, 2024, 14(5): 137-146. https://doi.org/10.12677/csa.2024.145122

参考文献

[1] Chai, X.H., Hu, Y., Lei, Y.L., et al. (2019) Review of UAV Intelligent Measurement and Control Technology. Radio Engineering, 49, 855-860. [Google Scholar] [CrossRef
[2] Surmann, H., Worst, R., Buschmann, T., et al. (2019) Integration of UAVs in Urban Search and Rescue Missions. 2019 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), Würzburg, 02-04 September 2019. [Google Scholar] [CrossRef
[3] Girshick, R., Donahue, J., Darrell, T., et al. (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014. [Google Scholar] [CrossRef
[4] Girshick, R. (2015) Fast R-CNN. 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, 07-13 December 2015. [Google Scholar] [CrossRef
[5] Ren, S., He, K., Girshick, R., et al. (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis & Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef
[6] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y. and Berg, A.C. (2016) SSD: Single Shot MultiBox Detector. Computer VisionECCV 2016, 21-37. [Google Scholar] [CrossRef
[7] Redmon, J., Divvala, S., Girshick, R., et al. (2016) You Only Look Once: Unified, Real-Time Object Detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016. [Google Scholar] [CrossRef
[8] Wang, H., et al. (2022) A UAV-Based Energy-Efficient and Real-Time Object Detection System with Multi-Source Image Fusion. Journal of Circuits, Systems and Computers, 31, Article 2250166. [Google Scholar] [CrossRef
[9] Fan, Y., Li, O. and Liu, G. (2022) An Object Detection Algorithm for Rotary-Wing UAV Based on AWin Transformer. IEEE Access, 10, 13139-13150. [Google Scholar] [CrossRef
[10] 郭业才, 孙京东, Amitave, S. 基于YOLOv5改进的航拍图像目标检测算法[J/OL]. 系统仿真学报: 1-14. 2024-04-09.[CrossRef
[11] Zhao, L.L. and Zhu, M.L. (2023) Ms-Yolov7: Yolov7 Based on Multi-Scale for Object Detection on UAV Aerial Photography. Drones, 7, Article No. 188. [Google Scholar] [CrossRef
[12] Tang, F., Yang, F. and Tian, X. (2023) Long-Distance Person Detection Based on Yolov7. Electronics, 12, Article No. 1502. [Google Scholar] [CrossRef
[13] Kang, M., Ting, C.M., Ting, F., et al. (2023) ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation. Image and Vision Computing, 147, Article 105057. [Google Scholar] [CrossRef
[14] Tong, Z., Chen, Y., Xu, Z., et al. (2023) Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing Mechanism. [Google Scholar] [CrossRef