ResNet和EfficientNet遥感图像场景分类研究
Study on ResNet and EfficientNet Remote Sensing Image Scene Classification
DOI: 10.12677/CSA.2022.125130, PDF,  被引量    国家自然科学基金支持
作者: 江 文:无锡商业职业技术学院,江苏 无锡
关键词: 遥感图像分类ResNet模型EfficientNet模型Ranger优化器Remote Sensing Image Classification ResNet Model EfficientNet Model Ranger Optimizer
摘要: 针对遥感图像分类问题,首先采用了ResNet模型和EfficientNet模型进行训练,其中,前者采用18层网络结构,并在测试集得到了59.8%的准确度;后者采用了EfficientNet-B0和EfficientNet-B1模型,最高分类精度为92.6%。在此基础上,使用Ranger优化器替换了SGD方法,虽然测试集精度与原方法近似,但使用Ranger优化器训练EfficientNet模型具有更强的稳定性和更快的训练速度。最后,对测试的结果进行了分析,列举出每一类场景的分类精度,分析并总结了改进方法。
Abstract: The ResNet and EfficientNet are used to classify the remote sensing image. Among ResNet and Effi-cientNet, the former uses an 18-layer network structure and has a classification accuracy of 59.8% in the test set; the latter uses EfficientNet-B0 and EfficientNet-B1, with the highest classification accuracy of 92.6%. After that, we replace the SGD optimizer with Ranger optimizer. Although the new classification accuracy is similar to the original one, using the Ranger optimizer to train the EfficientNet model has stronger stability and faster training speed. Finally, we analyzed the test results, listed the classification accuracy of each type of scene, analyzed and summarized the improvement methods.
文章引用:江文. ResNet和EfficientNet遥感图像场景分类研究[J]. 计算机科学与应用, 2022, 12(5): 1301-1313. https://doi.org/10.12677/CSA.2022.125130

参考文献

[1] 姬腾飞. 基于卷积神经网络的遥感图像场景分类研究[D]: [硕士学位论文]. 开封: 河南大学, 2019.
[2] 刘源. 基于卷积神经网络的卫星图像场景分类技术研究[D]: [硕士学位论文]. 西安: 西安电子科技大学, 2020.
[3] 钱晓亮, 李佳, 程塨, 等. 特征提取策略对高分辨率遥感图像场景分类性能影响的评估[J]. 遥感学报, 2018, 22(5): 758-776.
[4] 朱圣, 邓继忠, 张亚莉, 等. 基于无人机低空遥感的水稻田间杂草分布图研究[J]. 华南农业大学学报, 2020, 41(6): 67-74.
[5] 鞠洪润, 左丽君, 张增祥, 等. 中国土地利用空间格局刻画方法研究[J]. 地理学报, 2020, 75(1): 143-159.
[6] 彭晏飞, 宋晓男, 訾玲玲, 等. 基于卷积神经网络和改进模糊C均值的遥感图像检索[J]. 激光与光电子学进展, 2018, 55(9): 173-182.
[7] 张顺, 龚怡宏, 王进军. 深度卷积神经网络的发展及其在计算机视觉领域的应用[J]. 计算机学报, 2019, 42(3): 453-482.
[8] 奚雪峰, 周国栋. 面向自然语言处理的深度学习研究[J]. 自动化学报, 2016, 42(10): 1445-1465.
[9] 余萍, 曹洁. 深度学习在故障诊断与预测中的应用[J]. 计算机工程与应用, 2020, 56(3): 1-18.
[10] 李小薪, 梁荣华. 有遮挡人脸识别综述: 从子空间回归到深度学习[J]. 计算机学报, 2018, 41(1): 177-207.
[11] 赵志成, 罗泽, 王鹏彦, 等. 基于深度残差网络图像分类算法研究综述[J]. 计算机系统应用, 2020, 29(1): 14-21.
[12] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learn-ing for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[13] Tan, M. and Le, Q.V. (2019) Efficient Net: Rethinking Model Scaling for Convolutional Neural Networks.
[14] Zhang, M., Lucas, J., Ba, J., et al. (2019) Lookahead Optimizer: K Steps for-ward, 1 Step Back. 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, 8-14 De-cember 2019, 9597-9608.
[15] Liu, L., Jiang, H., He, P., et al. (2019) On the Variance of the Adaptive Learning Rate and Beyond.
[16] Li, J., Fang, C. and Lin, Z. (2019) Lifted Proximal Operator Machines. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 4181-4188. [Google Scholar] [CrossRef