基于改进轻量级MobileNetV2的辣椒病虫害图像识别
Identification of Pepper Diseases and Pests Based on Improved Lightweight MobileNetV2
摘要: 针对真实环境下辣椒病虫害识别准确率不高以及深度卷积网络参数多、模型内存大等问题,本文提出了一种基于改进MobileNetV2的辣椒病虫害图像识别算法。首先在基线模型的基础上引入通道注意力和空间注意力机制,提升模型对特征信息的敏感度;同时将L2正则化加入到损失函数中,平滑损失函数的梯度,以缓解模型过拟合。最后实验结果表明,改进后模型识别准确率达到94.43%,相较于基线模型,改进后模型精确率提升了4.38%,召回率提升了3.38%,F1值提升了3.88%,同时改进后模型参数量仅为2.43 M,与基线模型相比只增加了约0.2 M。本研究方法在保持较低模型参数量的同时达到了较高的识别准确率,具有一定的优势。
Abstract: To address the challenges of low recognition accuracy for pepper diseases and pests in real-world environments, as well as the large parameter size and high memory consumption of deep convolutional networks, this paper proposes an improved MobileNetV2-based image recognition algorithm for pepper disease and pest identification. First, channel attention and spatial attention mechanisms are introduced into the baseline model to enhance its sensitivity to feature information. Additionally, L2 regularization is incorporated into the loss function to smooth the gradient and mitigate model overfitting. Experimental results demonstrate that the improved model achieves an accuracy of 94.43%, with a 4.38% increase in precision, a 3.38% improvement in recall, and a 3.88% boost in F1-score compared to the baseline model. Furthermore, the enhanced model has only 2.43 M parameters, an increase of merely 0.2 M over the original model. The proposed method maintains a low parameter count while achieving high recognition accuracy, demonstrating significant advantages in practical applications.
文章引用:李艳美. 基于改进轻量级MobileNetV2的辣椒病虫害图像识别[J]. 应用数学进展, 2025, 14(10): 78-88. https://doi.org/10.12677/aam.2025.1410421

参考文献

[1] 张子峰. 我国辣椒产业发展现状、主要挑战与应对之策[J]. 北方园艺, 2023(14): 153-158.
[2] 林巧, 辛竹琳, 孔令博, 等. 我国辣椒产业发展现状及育种应对措施[J]. 中国农业大学学报, 2023, 28(5): 82-95.
[3] Huang, M., Chuang, T. and Liao, Y. (2022) Application of Transfer Learning and Image Augmentation Technology for Tomato Pest Identification. Sustainable Computing: Informatics and Systems, 33, Article 100646. [Google Scholar] [CrossRef
[4] 李玲, 王红, 刘庆生, 等. 基于纹理特征和支持向量机的ALOS图像土地覆被分类[J]. 国土资源遥感, 2011(4): 58-63.
[5] 魏丽冉, 岳峻, 李振波, 等. 基于核函数支持向量机的植物叶部病害多分类检测方法[J]. 农业机械学报, 2017, 48(S1): 166-171.
[6] Han, F., Guan, X. and Xu, M. (2024) Method of Intelligent Agricultural Pest Image Recognition Based on Machine Vision Algorithm. Discover Applied Sciences, 6, Article No. 536. [Google Scholar] [CrossRef
[7] Chen, S., Gao, Q. and He, Y. (2024) A Pest Image Recognition Method for Long-Tail Distribution Problem. Frontiers in Environmental Science, 12, Article ID: 1391770. [Google Scholar] [CrossRef
[8] 武宗渊, 刘振, 张宗明. 人工智能在医学教育领域的现状、未来治理研究[J]. 中国医学伦理学, 2024, 37(9): 1093-1100.
[9] Hirasawa, T., Aoyama, K., Tanimoto, T., Ishihara, S., Shichijo, S., Ozawa, T., et al. (2018) Application of Artificial Intelligence Using a Convolutional Neural Network for Detecting Gastric Cancer in Endoscopic Images. Gastric Cancer, 21, 653-660. [Google Scholar] [CrossRef] [PubMed]
[10] 王刚, 陈红, 马敬玲, 等. 基于多尺度1D-CNN和注意力机制的汇率多步预测研究[J]. 系统工程理论与实践, 2024, 44(6): 1934-1949.
[11] 王美华, 吴振鑫, 周祖光. 基于注意力改进CBAM的农作物病虫害细粒度识别研究[J]. 农业机械学报, 2021, 52(4): 239-247.
[12] Juan, C., Palacio, R., Bunn, C., et al. (2025) Factors Affecting Deep Learning Model Performance in Citizen Science-Based Image Data Collection for Agriculture: A Case Study on Coffee Crops. Computers and Electronics in Agriculture, 232, Article 110096.
[13] Alruwaili, M., Alanazi, S., Abd, S. and Shehab, A. (2019) An Efficient Deep Learning Model for Olive Diseases Detection. International Journal of Advanced Computer Science and Applications, 10, 486-492. [Google Scholar] [CrossRef
[14] 赵法川, 徐晓辉, 宋涛, 等. 融合多头注意力的轻量级作物病虫害识别[J]. 华南农业大学学报, 2023, 44(6): 986-94.
[15] 戴敏, 孙文靖, 缪宏. 基于轻量化CBAM-GoogLeNet的辣椒病虫害识别[J]. 中国农机化学报, 2025, 46(2): 224-229, 252.
[16] Ferentinos, K.P. (2018) Deep Learning Models for Plant Disease Detection and Diagnosis. Computers and Electronics in Agriculture, 145, 311-318. [Google Scholar] [CrossRef
[17] 谢琬, 崔艳荣. 基于改进EfficientNetv2模型的玉米叶片病害识别[J/OL]. 江苏农业科学, 2025, 2025-04-10.
https://link.cnki.net/urlid/32.1214.S.20250409.1406.004, 2025-04-15.
[18] Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. and Chen, L. (2018) Mobilenetv2: Inverted Residuals and Linear Bottlenecks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 4510-4520. [Google Scholar] [CrossRef
[19] Wang, Z., Zhou, W. and Li, Y. (2025) GFN: A Garbage Classification Fusion Network Incorporating Multiple Attention Mechanisms. Electronics, 14, Article 75. [Google Scholar] [CrossRef
[20] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Biomedical physics & engineering express, Springer International Publishing, 3-19. [Google Scholar] [CrossRef
[21] Perumal, A., Nithiyanantham, J. and Nagaraj, J. (2024) An Improved Alexnet Deep Learning Method for Limb Tumor Cancer Prediction and Detection. Biomedical Physics & Engineering Express, 11, Article 015004. [Google Scholar] [CrossRef] [PubMed]
[22] Simonyan, K. and Zisserman, A. (2015) Very Deep Convolutional Networks for Large-Scale Image Recognition. International Conference on Learning Representations, San Diego, 7-9 May 2015.
[23] Shin, H., Roth, H.R., Gao, M., Lu, L., Xu, Z., Nogues, I., et al. (2016) Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning. IEEE Transactions on Medical Imaging, 35, 1285-1298. [Google Scholar] [CrossRef] [PubMed]
[24] Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D. and Batra, D. (2020) Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. International Journal of Computer Vision, 128, 336-359. [Google Scholar] [CrossRef