基于注意力机制的遥感目标检测方法研究与实现
Research and Implementation of Remote Sensing Object Detection Method Based on Attention Mechanism
DOI: 10.12677/csa.2025.1511300, PDF,   
作者: 李聪慧:西京学院计算机学院,陕西 西安
关键词: 目标检测注意力机制YOLOv8遥感图像Target Detection Attention Mechanism YOLOv8 Remote Sensing Imagery
摘要: 遥感目标检测在环境监测、城市规划等领域具有重要应用价值,但由于遥感图像中目标尺度变化大、背景复杂以及特征提取不够鲁棒等问题,传统的检测方法难以满足实际需求。本文旨在研究一种基于注意力机制的遥感目标检测方法,以提升检测精度和鲁棒性,为遥感图像分析提供更高效的技术支持。本文提出了一种基于YOLOv8改进的遥感目标检测模型YOLO-Shuffle-MSDA,通过引入ShuffleAttention模块和多尺度特征融合模块(MSDA)来增强特征表达能力。ShuffleAttention模块通过注意力机制优化特征提取,有效区分目标与复杂背景;MSDA模块则通过多尺度特征融合,提升模型对多尺度目标的检测能力。通过多组对比实验和可视化检测结果可以看出,本文方法有效提升了检测精度,减少了误检和漏检。此外,本文设计并实现了完整的遥感目标检测系统,包括模型训练、验证和测试模块,并开发了用户友好的演示验证系统,支持单张图像、批量图像、视频以及实时摄像头检测。
Abstract: Remote sensing target detection has important application values in monitoring, urban planning and other fields, but due to the large change in target scale, complex background and insufficiently robust feature extraction in remote sensing images, detection is difficult to actually require. This paper aims to study a remote sensing object detection based on attention mechanism to improve detection accuracy and robustness and provide more efficient technical support for remote sensing images. This paper proposes a remote sensing object detection model YOLO-Shuffle-MSDA based on YOLOv8, which enhances feature expression capabilities by introducing the ShuffleAttention module and the multi-scale feature fusion module (MSDA). The ShuffleAttention module optimizes feature extraction through attention mechanism to effectively distinguish targets from complex backgrounds; the MSDA module improves the model’s detection ability of multi-scale targets through multi-scale feature fusion. Through multiple sets of comparison experiments and visual inspection results, it can be seen that this paper effectively improves detection accuracy, false detection and missed detection. In addition, this paper designs and implements a complete remote sensing object detection system, including model training, verification and testing modules, and a user-friendly demonstration verification system, supporting single images, batch images, videos and real-time camera detection.
文章引用:李聪慧. 基于注意力机制的遥感目标检测方法研究与实现[J]. 计算机科学与应用, 2025, 15(11): 234-246. https://doi.org/10.12677/csa.2025.1511300

参考文献

[1] Girshick, R., Donahue, J., Darrell, T. and Malik, J. (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Seg-mentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 24-27 June 2014, 580-587. [Google Scholar] [CrossRef
[2] Cao, Y., Wang, J., Jin, Y., Wu, T., Chen, K., Liu, Z. and Lin, D. (2021) Few-Shot Object Detection via Association and Discrimination. NeurIPS 2021, 6-14 December 2021, 16570-16581.
[3] Girshick, R. (2015) Fast R-CNN. 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, 7-13 December 2015, 1440-1448. [Google Scholar] [CrossRef
[4] Cai, Z. and Vasconcelos, N. (2018) Cascade R-CNN: Delving into High Quality Ob-ject Detection. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 6154-6162. [Google Scholar] [CrossRef
[5] 胡惠娟, 秦一锋, 徐鹤, 等. 面向无人机航拍图像的YOLOv8目标检测改进算法[J]. 计算机科学, 2025, 52(4): 202-211. http://kns.cnki.net/kcms/detail/50.1075.TP.20240925.1400.013.html, 2024-09-28.
[6] Liu, W., Anguelov, D., Erhan, D., et al. (2015) SSD: Single Shot MultiBox Detector.
[7] Gong, H., Mu, T., Li, Q., Dai, H., Li, C., He, Z., et al. (2022) Swin-Transformer-Enabled Yolov5 with Attention Mechanism for Small Object Detection on Satellite Images. Remote Sensing, 14, Article No. 2861. [Google Scholar] [CrossRef
[8] Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., et al. (2021) Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, 10-17 October 2021, 9992-10002. [Google Scholar] [CrossRef
[9] Wu, W., Liu, H., Li, L., Long, Y., Wang, X., Wang, Z., et al. (2021) Application of Local Fully Convolutional Neural Network Combined with YOLOv5 Algorithm in Small Target Detection of Remote Sensing Image. PLOS ONE, 16, e0259283. [Google Scholar] [CrossRef
[10] 李坤亚, 欧鸥, 刘广滨, 等. 改进YOLOv5的遥感图像目标检测算法[J]. 计算机工程与应用, 2023, 59(9): 207-214.
[11] 王建军, 魏江, 梅少辉, 王健. 面向遥感图像小目标检测的改进YOLOv3算法[J]. 计算机工程与应用, 2021, 57(20): 133-141.
[12] Wang, H. and Han, J. (2023) Improved Military Equipment Identification Algorithm Based on YOLOv5 Framework. 2023 IEEE 6th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chongqing, 24-26 February 2023, 1195-1199. [Google Scholar] [CrossRef
[13] Shi, P., Jiang, Q., Shi, C., Xi, J., Tao, G., Zhang, S., et al. (2021) Oil Well Detection via Large-Scale and High-Resolution Remote Sensing Images Based on Improved YOLOv4. Remote Sensing, 13, Article No. 3243. [Google Scholar] [CrossRef
[14] Zhang, X., Wang, T., Qi, J., Lu, H. and Wang, G. (2018) Progressive Attention Guided Recurrent Network for Salient Object Detection. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 714-722. [Google Scholar] [CrossRef
[15] Chen, H. and Li, Y. (2019) Three-Stream Atten-tion-Aware Network for RGB-D Salient Object Detection. IEEE Transactions on Image Processing, 28, 2825-2835. [Google Scholar] [CrossRef