基于深度残差网络的水面漂浮物检测
Detection of Floating Objects on Water Surface Based on Depth Re-Sidual Network
DOI: 10.12677/jisp.2025.143028, PDF,    科研立项经费支持
作者: 张德民, 陈玉兰, 聂方彦*:贵州商学院计算机与信息工程学院,贵州 贵阳
关键词: 漂浮物检测卷积神经网络深度学习残差网络Floating Object Detection Convolutional Neural Network Deep Learning Residual Network
摘要: 水面漂浮物作为水体污染的重要来源,严重威胁着水域生态安全和人类健康。针对现有检测方法在复杂水面场景下存在特征提取不足和小样本适应性弱的问题,本研究提出了一种改进的深度残差网络检测框架。首先构建自主采集的多源水面漂浮物数据集,通过自适应图像裁剪和归一化处理消除光照波动与尺度差异的干扰;继而基于ResNet-18架构设计分层优化策略,冻结浅层网络参数保留通用特征表征能力,通过微调深层网络增强域适应特征提取,并构建复合分类头提升小样本学习性能。针对数据集规模限制,创新性融合多维度协调数据增强策略与类别平衡损失函数,通过在线增强扩充数据多样性并缓解类别不均衡问题。实验表明,该方法在自建数据集上取得98.71%的检测准确率,较基线模型提升9.68个百分点,且在不同光照条件和漂浮物密度的测试场景中均保持稳定性能。研究系统验证了从数据预处理、网络架构优化到训练策略设计的全流程方案有效性,提出的迁移学习框架在保证模型轻量化的同时显著提升检测鲁棒性,为构建智能化水面污染监测系统提供了可靠的技术方案。
Abstract: As a significant source of water pollution, floating objects on water surfaces pose severe threats to aquatic ecological security and human health. To address the limitations of existing detection methods, such as insufficient feature extraction in complex aquatic scenarios and weak adaptability to small-sample conditions, this study proposes an improved deep residual network-based detection framework. Firstly, a self-collected multi-source dataset of floating objects was constructed, where adaptive image cropping and normalization processing were applied to mitigate interference from illumination fluctuations and scale variations. Subsequently, based on the ResNet-18 architecture, a hierarchical optimization strategy was designed: shallow network parameters were frozen to retain general feature representation capabilities, while deep networks were fine-tuned to enhance domain-adaptive feature extraction, coupled with a composite classification head to improve small-sample learning performance. To overcome dataset size constraints, a novel integration of multi-dimensional coordinated data augmentation strategies and class-balanced loss functions was implemented, expanding data diversity through online augmentation and alleviating class imbalance issues. Experimental results demonstrate that the proposed method achieves a detection accuracy of 98.71% on the self-built dataset, representing an improvement of 9.68 percentage points over the baseline model, while maintaining stable performance across varying illumination conditions and floating object densities. The study systematically validates the effectiveness of the full-process solution encompassing data preprocessing, network architecture optimization, and training strategy design. The proposed transfer learning framework significantly enhances detection robustness while ensuring model lightweight design, providing a reliable technical solution for intelligent water pollution monitoring systems.
文章引用:张德民, 陈玉兰, 聂方彦. 基于深度残差网络的水面漂浮物检测[J]. 图像与信号处理, 2025, 14(3): 311-324. https://doi.org/10.12677/jisp.2025.143028

参考文献

[1] 闫则明, 陈旭超, 王永才, 等. 人工智能在腐蚀图像检测与分析中的应用[J]. 涂料工业, 2025, 55(3): 7-12.
[2] 李志明, 蒋舒颖, 向凡, 等. 融合动态数据增强与MACA的皮肤病分类方法[J]. 计算机仿真, 2024, 41(12): 264-269+437.
[3] 田永波, 易军凯. 图像隐写的大数据分析方法[J]. 北京信息科技大学学报(自然科学版), 2024, 39(5): 81-87.
[4] 李玉洁, 马子航, 王艺甫, 等. 视觉Transformer(ViT)发展综述[J]. 计算机科学, 2025, 52(1): 194-209.
[5] 张卫东, 王中, 郝志斌, 等. 基于Box-Cox变换的海洋物种图像增强方法[J/OL]. 海军航空大学学报, 1-8.
http://kns.cnki.net/kcms/detail/37.1537.v.20250225.1310.004.html, 2025-02-26.
[6] 康金龙, 许涛, 张华岭, 等. 改进的自监督生成对抗网络算法在视频生成中的应用[J]. 信息记录材料, 2024, 25(8): 144-146.
[7] 刘耿焕, 曾祥津, 豆嘉真, 等. 基于深度学习的小目标检测技术研究进展(特邀) [J]. 红外与激光工程, 2024, 53(9): 196-228.
[8] 李小波. 基于深度学习的复杂场景下口罩佩戴检测方法研究[D]: [硕士学位论文]. 西宁: 青海大学, 2023.
[9] 吴承旭, 胡靖. 基于对称视觉注意力网络的高效图像超分辨率重建[J]. 软件导刊, 2025, 24(1): 109-114.
[10] 田有文, 覃上声, 闫玉博, 等. 基于改进YOLOv8的田间复杂环境下蓝莓成熟度检测[J]. 农业工程学报, 2024, 40(16): 153-162.