光学遥感图像显著目标检测的边缘增强轻量型模型探究
Exploration of a Lightweight Model for Edge Enhancement for Salient Object Detection in Optical Remote Sensing Images
摘要: 光学遥感图像(ORSI)场景复杂且目标尺度多变,现有显著目标检测(SOD)模型存在参数量大、计算成本高或边缘信息利用不足导致目标边缘模糊等问题。针对上述缺陷,提出一种边缘引导的轻量型注意力网络模型(EGLANet),实现光学遥感图像显著目标的高精度、轻量化检测。该模型以MobileNet V3为骨干网络生成五级初级特征图,先通过由跨层相关性模块(CLC)和多尺度并行卷积构成的边缘多尺度注意力模块(EMSAM)提取四级边缘特征图与精细边缘图,再利用CLC、融合1 × k/k × 1分组卷积的多尺度注意力模块(MSAM)结合边缘引导模块(EGM),将边缘信息融入目标检测过程以精准定位显著目标;损失函数采用二值交叉熵(BCE)与交并比(IoU)损失的组合形式,同时约束边缘提取与目标检测任务。在公开光学遥感图像显著目标检测数据集EORSSD上开展对比实验,选取10种先进模型作为对比对象,从定性与定量角度验证模型性能。实验结果表明,EGLANet实现了参数量仅为1.75 M,显著目标检测效果却优于其它模型,在保证检测精度的同时提升计算效率,为光学遥感图像显著目标检测提供了一种高效的轻量化解决方案。
Abstract: Optical Remote Sensing Images (ORSI) feature complex scenes and variable target scales. Existing Salient Object Detection (SOD) models suffer from issues such as large parameter count, high computational cost, or insufficient utilization of edge information, leading to blurred target edges. To address these shortcomings, we propose an Edge-Guided Lightweight Attention Network (EGLANet) to achieve high-precision and lightweight detection of salient objects in optical remote sensing images. This model utilizes MobileNet V3 as the backbone network to generate five-level primary feature maps. It first extracts four-level edge feature maps and fine edge maps through an Edge Multi-Scale Attention Module (EMSAM) composed of a Cross-Layer Correlation (CLC) module and multi-scale parallel convolution. Then, it integrates edge information into the target detection process using a combination of CLC, a multi-scale attention module (MSAM) that fuses 1 × k/k × 1 grouped convolution, and an Edge Guidance Module (EGM) to accurately locate salient objects. The loss function combines Binary Cross Entropy (BCE) and Intersection over Union (IoU) losses, simultaneously constraining edge extraction and target detection tasks. Comparative experiments were conducted on the public optical remote sensing image salient object detection dataset EORSSD, selecting 10 advanced models as benchmarks to verify the model performance from both qualitative and quantitative perspectives. The experimental results show that EGLANet achieves superior salient object detection performance with only 1.75 M parameters, improving computational efficiency while maintaining detection accuracy, providing an efficient and lightweight solution for salient object detection in optical remote sensing images.
文章引用:全彬彬. 光学遥感图像显著目标检测的边缘增强轻量型模型探究[J]. 应用数学进展, 2026, 15(4): 100-109. https://doi.org/10.12677/aam.2026.154140

参考文献

[1] Wang, W., Lai, Q., Fu, H., Shen, J., Ling, H. and Yang, R. (2022) Salient Object Detection in the Deep Learning Era: An In-Depth Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 3239-3259. [Google Scholar] [CrossRef] [PubMed]
[2] Cong, R., Lei, J., Fu, H., et al. (2018) Review of Visual Saliency Detection with Comprehensive Information.
[3] Borji, A., Cheng, M., Hou, Q., Jiang, H. and Li, J. (2019) Salient Object Detection: A Survey. Computational Visual Media, 5, 117-150. [Google Scholar] [CrossRef
[4] Simonyan, K. and Zisserman, A. (2015) Very Deep Convolutional Networks for Large-Scale Image Recognition. 3rd International Conference on Learning Representations, ICLR 2015, San Diego, 7-9 May 2015, 1-14.
[5] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[6] Ahsan, U., Madhok, R. and Essa, I. (2019) Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition. 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, 7-11 January 2019, 179-189. [Google Scholar] [CrossRef
[7] Wang, L., Lu, H. and Yang, M. (2018) Constrained Superpixel Tracking. IEEE Transactions on Cybernetics, 48, 1030-1041. [Google Scholar] [CrossRef] [PubMed]
[8] Wang, D., Lu, H. and Bo, C. (2015) Visual Tracking via Weighted Local Cosine Similarity. IEEE Transactions on Cybernetics, 45, 1838-1850. [Google Scholar] [CrossRef] [PubMed]
[9] Li, X., Han, Z., Wang, L. and Lu, H. (2016) Visual Tracking via Random Walks on Graph Model. IEEE Transactions on Cybernetics, 46, 2144-2155. [Google Scholar] [CrossRef] [PubMed]
[10] Liu, Z., Zhao, D., Shi, Z. and Jiang, Z. (2019) Unsupervised Saliency Model with Color Markov Chain for Oil Tank Detection. Remote Sensing, 11, Article No. 1089. [Google Scholar] [CrossRef
[11] Li, C., Cong, R., Hou, J., Zhang, S., Qian, Y. and Kwong, S. (2019) Nested Network with Two-Stream Pyramid for Salient Object Detection in Optical Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing, 57, 9156-9166. [Google Scholar] [CrossRef
[12] Zhang, Q., Cong, R., Li, C., Cheng, M., Fang, Y., Cao, X., et al. (2021) Dense Attention Fluid Network for Salient Object Detection in Optical Remote Sensing Images. IEEE Transactions on Image Processing, 30, 1305-1317. [Google Scholar] [CrossRef] [PubMed]
[13] Zhou, X., Shen, K., Weng, L., Cong, R., Zheng, B., Zhang, J., et al. (2023) Edge-Guided Recurrent Positioning Network for Salient Object Detection in Optical Remote Sensing Images. IEEE Transactions on Cybernetics, 53, 539-552. [Google Scholar] [CrossRef] [PubMed]
[14] Li, G., Liu, Z., Bai, Z., Lin, W. and Ling, H. (2022) Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-12. [Google Scholar] [CrossRef
[15] Luo, H., Wang, J. and Liang, B. (2024) Spatial Attention Feedback Iteration for Lightweight Salient Object Detection in Optical Remote Sensing Images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 17, 13809-13823. [Google Scholar] [CrossRef
[16] Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. and Chen, L. (2018) MobileNetV2: Inverted Residuals and Linear Bottlenecks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-22 June 2018, 4510-4520. [Google Scholar] [CrossRef
[17] Howard, A., Sandler, M., Chen, B., Wang, W., Chen, L., Tan, M., et al. (2019) Searching for MobileNetV3. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27-28 October 2019, 1314-1324. [Google Scholar] [CrossRef
[18] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. Computer VisionECCV 2018 15th European Conference, Munich, 8-14 September 2018, 3-19. [Google Scholar] [CrossRef
[19] Li, G., Liu, Z., Lin, W. and Ling, H. (2022) Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-13. [Google Scholar] [CrossRef
[20] Li, G., Liu, Z., Chen, M., Bai, Z., Lin, W. and Ling, H. (2021) Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection. IEEE Transactions on Image Processing, 30, 3528-3542. [Google Scholar] [CrossRef] [PubMed]
[21] Qin, X., Zhang, Z., Huang, C., Gao, C., Dehghan, M. and Jagersand, M. (2019) Basnet: Boundary-Aware Salient Object Detection. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 16-20 June 2019, 7479-7489. [Google Scholar] [CrossRef
[22] Zhang, Q., Zhang, L., Shi, W. and Liu, Y. (2018) Airport Extraction via Complementary Saliency Analysis and Saliency-Oriented Active Contour Model. IEEE Geoscience and Remote Sensing Letters, 15, 1085-1089. [Google Scholar] [CrossRef
[23] Zhang, L., Liu, Y. and Zhang, J. (2019) Saliency Detection Based on Self-Adaptive Multiple Feature Fusion for Remote Sensing Images. International Journal of Remote Sensing, 40, 8270-8297. [Google Scholar] [CrossRef
[24] Gao, S., Tan, Y., Cheng, M., Lu, C., Chen, Y. and Yan, S. (2020) Highly Efficient Salient Object Detection with 100K Parameters. Computer VisionECCV 2020 16th European Conference, Glasgow, 23-28 August 2020, 702-721. [Google Scholar] [CrossRef
[25] Liu, Y., Zhang, X., Bian, J., Zhang, L. and Cheng, M. (2021) Samnet: Stereoscopically Attentive Multi-Scale Network for Lightweight Salient Object Detection. IEEE Transactions on Image Processing, 30, 3804-3814. [Google Scholar] [CrossRef] [PubMed]