基于上下文感知调制的高光谱图像深层特征提取网络
Going Deeper with Context-Aware Modulation for Hyperspectral Image Feature Extraction
摘要: 高光谱图像(HSI)由数百个连续窄带组成,具有丰富的光谱–空间信息。然而,波段间高度相关性与复杂依赖关系使得判别性光谱特征的高效建模仍具有挑战。为此,本文提出CAM-HSNet,旨在增强光谱局部特征与长距离依赖的联合建模能力的同时提升参数利用效率。具体而言,我们引入改进的卷积调制模块和卷积前馈网络,对卷积提取的局部光谱特征进行进一步编码,并构建能够捕获长距离依赖并实现信息交互的结构。在获得更具判别力的光谱表征基础上,模型能够以更少的谱维表示完成全局语义汇聚,从而压缩原有局部至全局映射的参数规模,实现参数减少且保持分类性能。实验结果表明,与现有方法相比,所提出的CAM-HSNet在整体精度(OA)、平均精度(AA)和Kappa系数等指标上均取得了更优的分类表现。
Abstract: Hyperspectral images (HSIs) consist of hundreds of contiguous narrow spectral bands and contain rich spectral-spatial information. However, the strong inter-band correlations and complex depen- dencies pose challenges for efficiently modeling discriminative spectral features. To address this issue, we propose CAM-HSNet, which aims to enhance the joint modeling of local spectral features and long-range dependencies while improving parameter efficiency. Specifically, we introduce an improved convolutional modulation module and a convolutional feed-forward network to further encode the locally extracted spectral features and to build structures capable of capturing long-range dependencies and facilitating information interaction. Based on the resulting more discriminative spectral representations, the model is able to achieve global semantic aggregation with fewer spectral dimensions, thereby reducing the parameter scale required for local-to-global mapping while maintaining classification performance. Experimental results demonstrate that, compared with existing methods, the proposed CAM-HSNet achieves superior performance in overall accuracy (OA), average accuracy (AA), and the Kappa coefficient.
文章引用:蔡士威. 基于上下文感知调制的高光谱图像深层特征提取网络[J]. 计算机科学与应用, 2026, 16(1): 215-229. https://doi.org/10.12677/csa.2026.161018

参考文献

[1] Peyghambari, S. and Zhang, Y. (2021) Hyperspectral Remote Sensing in Lithological Mapping, Mineral Exploration, and Environmental Geology: An Updated Review. Journal of Applied Remote Sensing, 15, Article ID: 031501. [Google Scholar] [CrossRef
[2] Zhang, X., Sun, Y., Shang, K., Zhang, L. and Wang, S. (2016) Crop Classification Based on Feature Band Set Construction and Object-Oriented Approach Using Hyperspectral Images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 9, 4117-4128. [Google Scholar] [CrossRef
[3] Karalas, K., Tsagkatakis, G., Zervakis, M. and Tsakalides, P. (2016) Land Classification Using Remotely Sensed Data: Going Multilabel. IEEE Transactions on Geoscience and Remote Sensing, 54, 3548-3563. [Google Scholar] [CrossRef
[4] Melgani, F. and Bruzzone, L. (2004) Classification of Hyperspectral Remote Sensing Images with Support Vector Machines. IEEE Transactions on Geoscience and Remote Sensing, 42, 1778-1790. [Google Scholar] [CrossRef
[5] Gislason, P.O., Benediktsson, J.A. and Sveinsson, J.R. (2006) Random Forests for Land Cover Classification. Pattern Recognition Letters, 27, 294-300. [Google Scholar] [CrossRef
[6] Qiao, T., Yang, Z., Ren, J., Yuen, P., Zhao, H., Sun, G., et al. (2018) Joint Bilateral Filtering and Spectral Similarity-Based Sparse Representation: A Generic Framework for Effective Feature Extraction and Data Classification in Hyperspectral Imaging. Pattern Recognition, 77, 316-328. [Google Scholar] [CrossRef
[7] Hughes, G. (1968) On the Mean Accuracy of Statistical Pattern Recognizers. IEEE Transactions on Information Theory, 14, 55-63. [Google Scholar] [CrossRef
[8] Ben Hamida, A., Benoit, A., Lambert, P. and Ben Amar, C. (2018) 3-D Deep Learning Approach for Remote Sensing Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 56, 4420-4434. [Google Scholar] [CrossRef
[9] Hang, R., Liu, Q., Hong, D. and Ghamisi, P. (2019) Cascaded Recurrent Neural Networks for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 57, 5384-5394. [Google Scholar] [CrossRef
[10] Zou, Y., Fu, Y., Zheng, Y. and Li, W. (2020) CSR-Net: Camera Spectral Response Network for Dimensionality Reduction and Classification in Hyperspectral Imagery. Remote Sensing, 12, Article 3294. [Google Scholar] [CrossRef
[11] Ahmad, M., Shabbir, S., Roy, S.K., Hong, D., Wu, X., Yao, J., et al. (2022) Hyperspectral Image Classification—Traditional to Deep Models: A Survey for Future Prospects. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 15, 968-999. [Google Scholar] [CrossRef
[12] Farrell, M.D. and Mersereau, R.M. (2005) On the Impact of PCA Dimension Reduction for Hyperspectral Detection of Difficult Targets. IEEE Geoscience and Remote Sensing Letters, 2, 192-195. [Google Scholar] [CrossRef
[13] Moussaoui, S., Hauksdóttir, H., Schmidt, F., Jutten, C., Chanussot, J., Brie, D., et al. (2008) On the Decomposition of Mars Hyperspectral Data by ICA and Bayesian Positive Source Separation. Neurocomputing, 71, 2194-2208. [Google Scholar] [CrossRef
[14] Hou, B., Wang, K., Yao, M., et al. (2017) Semi-Supervised Spatial-Spectral Discriminant Analysis for Hyperspectral Image Classification. Acta Geodaetica et Cartographica Sinica, 46, 1098-1106.
[15] Jia, S., Zhao, Q., Zhuang, J., Tang, D., Long, Y., Xu, M., et al. (2021) Flexible Gabor-Based Superpixel-Level Unsupervised LDA for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 59, 10394-10409. [Google Scholar] [CrossRef
[16] Fabiyi, S.D., Murray, P., Zabalza, J. and Ren, J. (2021) Folded LDA: Extending the Linear Discriminant Analysis Algorithm for Feature Extraction and Data Reduction in Hyperspectral Remote Sensing. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 14, 12312-12331. [Google Scholar] [CrossRef
[17] Maggiori, E., Tarabalka, Y., Charpiat, G. and Alliez, P. (2017) Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 55, 645-657. [Google Scholar] [CrossRef
[18] Zhu, X.X., Tuia, D., Mou, L., Xia, G., Zhang, L., Xu, F., et al. (2017) Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources. IEEE Geoscience and Remote Sensing Magazine, 5, 8-36. [Google Scholar] [CrossRef
[19] Hu, W., Huang, Y., Wei, L., Zhang, F. and Li, H. (2015) Deep Convolutional Neural Networks for Hyperspectral Image Classification. Journal of Sensors, 2015, Article ID: 258619. [Google Scholar] [CrossRef
[20] Lee, H. and Kwon, H. (2017) Going Deeper with Contextual CNN for Hyperspectral Image Classification. IEEE Transactions on Image Processing, 26, 4843-4855. [Google Scholar] [CrossRef] [PubMed]
[21] Shen, Y. (2020) Efficient Deep Learning of Nonlocal Features for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 59, 6029-6043.
[22] Paoletti, M.E., Haut, J.M., Fernandez-Beltran, R., Plaza, J., Plaza, A.J. and Pla, F. (2019) Deep Pyramidal Residual Networks for Spectral-Spatial Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 57, 740-754. [Google Scholar] [CrossRef
[23] He, M., Li, B. and Chen, H. (2017) Multi-Scale 3D Deep Convolutional Neural Network for Hyperspectral Image Classification. 2017 IEEE International Conference on Image Processing (ICIP), Beijing, 17-20 September 2017, 3904-3908. [Google Scholar] [CrossRef
[24] Roy, S.K., Manna, S., Song, T. and Bruzzone, L. (2021) Attention-Based Adaptive Spectral-Spatial Kernel Resnet for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 59, 7831-7843. [Google Scholar] [CrossRef
[25] Luo, W., Li, Y., Urtasun, R., et al. (2016) Understanding the Effective Receptive Field in Deep Convolutional Neural Networks. arXiv: 1701.04128.
[26] Li, X., Wang, W., Hu, X. and Yang, J. (2019) Selective Kernel Networks. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 15-20 June 2019, 510-519. [Google Scholar] [CrossRef
[27] Vaswani, A. (2017) Attention Is All You Need. arXiv: 1706.03762.
[28] Hong, D., Han, Z., Yao, J., Gao, L., Zhang, B., Plaza, A., et al. (2022) SpectralFormer: Rethinking Hyperspectral Image Classification with Transformers. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-15. [Google Scholar] [CrossRef
[29] Sun, L., Zhao, G., Zheng, Y. and Wu, Z. (2022) Spectral-Spatial Feature Tokenization Transformer for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-14. [Google Scholar] [CrossRef
[30] Liu, Z., Mao, H., Wu, C., Feichtenhofer, C., Darrell, T. and Xie, S. (2022) A Convnet for the 2020s. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 11966-11976. [Google Scholar] [CrossRef
[31] Vasu, P.K.A., Gabriel, J., Zhu, J., et al. (2023) FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization. arXiv: 2303.14189.
[32] Guo, M.H., Lu, C.Z., Liu, Z.N., et al. (2022) Visual Attention Network. arXiv: 2202.09741.
[33] Hou, Q., Lu, C.Z., Cheng, M.M., et al. (2022) Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition. arXiv: 2211.11943.
[34] Yang, J., Li, C., Dai, X., et al. (2022) Focal Modulation Networks. arXiv: 2203.11926
[35] Li, S., Song, W., Fang, L., Chen, Y., Ghamisi, P. and Benediktsson, J.A. (2019) Deep Learning for Hyperspectral Image Classification: An Overview. IEEE Transactions on Geoscience and Remote Sensing, 57, 6690-6709. [Google Scholar] [CrossRef
[36] Gong, Z., Zhong, P., Yu, Y., Hu, W. and Li, S. (2019) A CNN with Multiscale Convolution and Diversified Metric for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 57, 3599-3618. [Google Scholar] [CrossRef
[37] Zhao, W. and Du, S. (2016) Spectral–spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach. IEEE Transactions on Geoscience and Remote Sensing, 54, 4544-4554. [Google Scholar] [CrossRef
[38] Zhang, H., Li, Y., Zhang, Y. and Shen, Q. (2017) Spectral-Spatial Classification of Hyperspectral Imagery Using a Dual-Channel Convolutional Neural Network. Remote Sensing Letters, 8, 438-447. [Google Scholar] [CrossRef
[39] Chen, Y., Jiang, H., Li, C., Jia, X. and Ghamisi, P. (2016) Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks. IEEE Transactions on Geoscience and Remote Sensing, 54, 6232-6251. [Google Scholar] [CrossRef
[40] Li, Y., Zhang, H. and Shen, Q. (2017) Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network. Remote Sensing, 9, Article 67. [Google Scholar] [CrossRef
[41] Zhong, Z., Li, J., Luo, Z. and Chapman, M. (2018) Spectral-Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework. IEEE Transactions on Geoscience and Remote Sensing, 56, 847-858. [Google Scholar] [CrossRef
[42] Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W. and Hu, Q. (2020) ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 11531-11539. [Google Scholar] [CrossRef
[43] Foody, G.M. (2002) Status of Land Cover Classification Accuracy Assessment. Remote Sensing of Environment, 80, 185-201. [Google Scholar] [CrossRef
[44] Kingma, D.P. and Ba, J.L. (2015) Adam: A Method for Stochastic Optimization. arXiv: 1412.6980.
[45] Smith, L.N. (2018) A Disciplined Approach to Neural Network Hyper-Parameters: Part 1—Learning Rate, Batch Size, Mo-mentum, and Weight Decay. arXiv: 1803.09820.