基于ResNet网络的垃圾分类模型研究
Research on Garbage Classification Model Based on ResNet Network
DOI: 10.12677/csa.2024.144106, PDF,   
作者: 王世屹:中央民族大学信息工程学院,北京;许静怡:北京林业大学信息学院,北京
关键词: 垃圾分类图像识别深度学习ResNet网络Garbage Classification Image Recognition Deep Learning ResNet Network
摘要: 垃圾分类问题对于环境保护和可持续发展至关重要。传统的垃圾分类模型在处理复杂场景时存在性能瓶颈,而ResNet网络以其深度残差学习结构在图像识别任务中取得了显著的成功。为进一步提升垃圾分类模型的性能,本文在ResNet网络基础上进行了改进,引入了注意力机制和增强的特征提取策略。通过对垃圾分类数据集的训练,验证了改进ResNet网络在垃圾分类准确性和泛化性能上的优势,所提出的模型在不同垃圾类别上均取得了较高的分类准确度,实验结果表明相较于ResNet50,本文提出的垃圾分类模型提高了1.9%的准确率。本研究为垃圾分类问题的解决提供了一种更有效的解决方案,也为其他复杂图像分类问题提供了有益的借鉴。
Abstract: The problem of garbage classification is crucial for environmental protection and sustainable development. Traditional garbage classification models face performance bottlenecks when dealing with complex scenes, while the ResNet network has achieved significant success in image recognition tasks due to its deep residual learning structure. In order to further enhance the performance of garbage classification models, this paper improved upon the ResNet network by introducing attention mechanisms and enhanced feature extraction strategies. Through training on a garbage classification dataset, the improved ResNet network’s advantages in accuracy and generalization performance for garbage classification were verified. The proposed model achieved higher classification accuracy for different garbage categories, demonstrating a 1.9% improvement in accuracy compared to ResNet50. This research provides a more effective solution for the problem of garbage classification and offers beneficial insights for other complex image classification problems.
文章引用:王世屹, 许静怡. 基于ResNet网络的垃圾分类模型研究[J]. 计算机科学与应用, 2024, 14(4): 368-382. https://doi.org/10.12677/csa.2024.144106

参考文献

[1] 刁艳杰. 什么是垃圾分类? [J]. 走向世界, 2019(32): 41.
[2] Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems, Lake Tahoe, 1106-1114.
[3] He, K.M., Zhang, X.Y., Ren, S.Q., et al. (2016) Deep Residual Learning for Image Recognition. Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[4] Aral, R.A., Keskin, S.R., Kaya, M., et al. (2018) Classification of TrashNet Dataset Based on Deep Learning Models. IEEE International Conference on Big Data, Seattle, 10-13 December 2018, 2058-2062. [Google Scholar] [CrossRef
[5] Tan, M. and Le, Q.V. (2019) EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. International Conference on Machine Learning, Long Beach, 9-15 June 2019, 6105-6114.
[6] Szegedy, C., Liu, W., Jia, Y.Q., et al. (2015) Going deeper with Convolutions. IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 1-9. [Google Scholar] [CrossRef
[7] Lin, T.Y., Goyal, P., Girshick, R., et al. (2017) Focal Loss for Dense Object Detection. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2999-3007. [Google Scholar] [CrossRef
[8] Woo, S., Park, J., Lee, J.Y., et al. (2018) CBAM: Convolutional Block Attention Module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, 8-14 September 2018, 3-19. [Google Scholar] [CrossRef