基于Transformer模型的高光谱图像分类算法研究
Research on Hyperspectral Image Classification Algorithm Based on Transformer Model
DOI: 10.12677/MOS.2024.131077, PDF,    国家自然科学基金支持
作者: 赵尚子欣*, 袁嘉豪, 董 岩:上海理工大学光电信息与计算机工程学院,上海;陈 倩#:上海理工大学管理学院,上海
关键词: 高光谱图像分类Transformer光谱特征Hyperspectral Images Classification Transformer Spectral Features
摘要: 高光谱图像(Hyperspectral image, HSI)分类在遥感领域扮演着关键角色。然而,处理高光谱图像分类任务时,遇到了光谱相同但物质不同、光谱不同但物质相同的复杂情况。尽管现有基于卷积神经网络(Convolutional Neural Network, CNN)的方法在局部信息处理方面表现出色,但它们在表示能力上存在一定限制。为了应对这一挑战,本文提出了一种综合考虑光谱信息和空间信息的Transformer方法(Spatial Spectral Transformer Network, SSTN),即本文引入了Transformer结构,在光谱和空间中倾向于捕捉全局信息。通过构建Transformer,模型将空间光谱特征有机结合。通过规范的实验研究,本文发现:在IndianPines和Houston2013数据集的分类任务中,本文的方法相较于其它Transformer网络表现更为优越,并在与其他骨干网络的对比中具有显著的改进。
Abstract: Hyperspectral image (HSI) classification plays a crucial role in the field of remote sensing. When dealing with HSI classification tasks, there are complex situations where the spectra are the same but the substances are different, and the spectra are different but the substances are the same. Although existing methods based on Convolutional Neural Networks (CNN) perform well in local in-formation processing, they have certain limitations in representation ability. To address this chal-lenge, a Transformer method that comprehensively considers spectral and spatial information (Spatial Spectral Transformer Network, SSTN) has been proposed. To overcome this limitation, this article introduces a Transformer structure that tends to capture global information in both spec-trum and space. By constructing a Transformer, the model organically combines spatial spectral features. The rigorous experimental results show that in the classification task of the Indian Pines and Houston2013 datasets, our method performs better than other Transformer networks and shows significant improvement compared to other backbone networks.
文章引用:赵尚子欣, 袁嘉豪, 董岩, 陈倩. 基于Transformer模型的高光谱图像分类算法研究[J]. 建模与仿真, 2024, 13(1): 799-806. https://doi.org/10.12677/MOS.2024.131077

参考文献

[1] Carrino, T., Crósta, A., Toledo, C. and Silva, A. (2018) Hyperspectral Remotesensing Applied to Mineral Exploration in Southern Peru: A Multiple Dataintegration Approach in the Chapi Chiara Gold Prospect. International Journal of Applied Earth Observation and Geoinformation, 64, 287-300. [Google Scholar] [CrossRef
[2] Camino, C., Gon-zález-Dugo, V., Hernández, P., Sillero, J. and Tejada, P. (2018) Improved Nitrogen Retrievals with Airborne-Derived Fluo-rescence and Planttraits Quantified from VNIR-SWIR Hyperspectral Imagery in the Context of Precision Agriculture. Interna-tional Journal of Applied Earth Observation and Geoinformation, 70, 105-117. [Google Scholar] [CrossRef
[3] Stuart, M.B., McGonigle, A.J.S. and Willmott, J.R. (2019) Hyperspectral Imaging in Environmental Monitoring: A Review of Recent Developments and Technological Advances in Compact Field De-ployable Systems. Sensors, 19, 3071. [Google Scholar] [CrossRef] [PubMed]
[4] Kanthi, M., Sarma, T.H. and Bindu, C.S. (2020) A 3D-Deep CNN Based Feature Extraction and Hyperspectral Image Classification. 2020 IEEE India Geoscience and Remote Sensing Symposium (InGARSS), Ahmedabad, 1-4 December 2020, 229-232. [Google Scholar] [CrossRef
[5] Xu, Z., Yu, H., Zheng, K., Gao, L. and Song, M. (2021) A Novel Classification Framework for Hyperspectral Image Classification Based on Multiscale Spectral-Spatial Convolutional Network. 2021 11th Workshop on Hyperspectral Imaging and Signal Processing: Evolution in Remote Sensing (WHISPERS), Amsterdam, 24-26 March 2021, 1-5. [Google Scholar] [CrossRef
[6] Firat, H. and Hanbay, D. (2021) Classification of Hy-perspectral Images Using 3D CNN Based ResNet50. 2021 29th Signal Processing and Communications Applications Con-ference (SIU), Istanbul, 9-11 June 2021, 1-4. [Google Scholar] [CrossRef
[7] Chhapariya, K., Buddhiraju, K.M. and Kumar, A. (2022) Spec-tral-Spatial Classification of Hyperspectral Images with Multi-Level Cnn. 2022 12th Workshop on Hyperspectral Imaging and Signal Processing: Evolution in Remote Sensing (WHISPERS), Rome, 13-16 September 2022, 1-5. [Google Scholar] [CrossRef
[8] Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., et al. (2020) An Image Is Worth 16x16 Words: Transformers for Image Recognitionat Scale. arXiv: 2010.11929, 2020.
[9] He, X., Chen, Y. and Lin, Z. (2021) Spatial-Spectral Transformer for Hyperspectral Image Classification. Remote Sensing, 13, 498. [Google Scholar] [CrossRef
[10] Simonyan, K. and Zisserman, A. (2014) Very Deep Convo-lutional Networks Forlarge-Scale Image Recognition. arXiv: 1409.1556, 2014.
[11] Qing, Y., Liu, W., Feng, L. and Gao, W. (2021) Improved Transformer Net for Hyperspectral Image Classification. Remote Sensing, 13, 2216. [Google Scholar] [CrossRef
[12] Hong, D., Han, Z., Yao, J., Gao, L., Zhang, B., et al. (2022) SpectralFormer: Rethinking Hyperspectral Image Classification With Transformers. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-15. [Google Scholar] [CrossRef