基于KPConv的全局上下文感知点云语义分割方法
Global Context-Aware Point Cloud Semantic Segmentation Method Based on KPConv
DOI: 10.12677/aam.2026.154149, PDF,    科研立项经费支持
作者: 余明珂:中国地质大学(武汉)数学与物理学院,湖北 武汉
关键词: 点云语义分割KPConv全局上下文感知特征融合Point Cloud Semantic Segmentation KPConv Global Context Awareness Feature Fusion
摘要: 针对KPConv网络在解码阶段对全局语义信息建模的局限性,提出了一种全局上下文感知点云语义分割模型(GCA-Net)。该模型在解码阶段逐层引入一种轻量化的全局–局部特征引导机制,使局部特征在上采样恢复过程中受到全局语义信息的约束,从而增强输出特征的语义一致性。具体而言,通过全局注意力池化获得全局语义特征表示,采用瓶颈结构增强特征通道间的依赖关系,融合输入点云的局部特征,据此增强全局与局部语义特征的协同表达能力。为了验证所提模型的有效性,在ISPRS数据集上进行了测试,相比于基准网络KPConv,所提GCA-Net的OA和F1分别提高了0.9%和2.7%。
Abstract: To address the limitation of KPConv in modeling global semantic information during the decoding stage, a global context-aware point cloud semantic segmentation model named GCA-Net is proposed. The model incorporates a lightweight global-local feature guidance mechanism into the decoder, enabling global semantic information to effectively constrain local feature recovery during upsampling and enhancing the semantic consistency of the output features. Global semantic representations are constructed through global attention pooling and refined by a bottleneck structure that strengthens inter-channel dependencies, and are subsequently fused with encoder features to improve the collaborative representation of global and local semantics. Experimental results on the ISPRS dataset show that, compared with the baseline KPConv network, GCA-Net achieves improvements of 0.9% in overall accuracy (OA) and 2.7% in F1 score.
文章引用:余明珂. 基于KPConv的全局上下文感知点云语义分割方法[J]. 应用数学进展, 2026, 15(4): 192-200. https://doi.org/10.12677/aam.2026.154149

参考文献

[1] 艾达, 张晓阳, 胥策, 等. 大规模点云深度学习语义分割方法新进展[J]. 激光与光电子学进展, 2024, 61(12): 43-60.
[2] Qi, C.R., Su, H., Mo, K., et al. (2017) PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 21-26 July 2017, 652-660.
[3] Lin, C.H., Chen, J.Y., Su, P.L., et al. (2014) Eigen-Feature Analysis of Weighted Covariance Matrices for LiDAR Point Cloud Classification. ISPRS Journal of Photogrammetry and Remote Sensing, 94, 70-79. [Google Scholar] [CrossRef
[4] Zhang, J., Lin, X. and Ning, X. (2013) SVM-Based Classification of Segmented Airborne LiDAR Point Clouds in Urban Areas. Remote Sensing, 5, 3749-3775. [Google Scholar] [CrossRef
[5] Chehata, N., Guo, L. and Mallet, C. (2009) Airborne LiDAR Feature Selection for Urban Classification Using Random Forests. Laserscanning, Paris, 1-2 September 2009, 207-212.
[6] Qi, C.R., Yi, L., Su, H., et al. (2017) PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 5105-5114.
[7] Thomas, H., Qi, C.R., Deschaud, J., Marcotegui, B., Goulette, F. and Guibas, L. (2019) KPConv: Flexible and Deformable Convolution for Point Clouds. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27-28 October 2019, 6411-6420. [Google Scholar] [CrossRef
[8] Zeng, T., Luo, F., Guo, T., Gong, X., Xue, J. and Li, H. (2023) Multilevel Context Feature Fusion for Semantic Segmentation of ALS Point Cloud. IEEE Geoscience and Remote Sensing Letters, 20, Article ID: 5506605. [Google Scholar] [CrossRef
[9] Zeng, T., Luo, F., Guo, T., Gong, X., Xue, J. and Li, H. (2023) Recurrent Residual Dual Attention Network for Airborne Laser Scanning Point Cloud Semantic Segmentation. IEEE Transactions on Geoscience and Remote Sensing, 61, Article ID: 5702614. [Google Scholar] [CrossRef
[10] Rottensteiner, F., Sohn, G., Jung, J., Gerke, M., Baillard, C., Benitez, S., et al. (2012) The ISPRS Benchmark on Urban Object Classification and 3D Building Reconstruction. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 3, 293-298. [Google Scholar] [CrossRef
[11] Zhao, J., Zhou, H. and Pan, F. (2024) A Dual Attention KPConv Network Combined with Attention Gates for Semantic Segmentation of ALS Point Clouds. IEEE Transactions on Geoscience and Remote Sensing, 62, Article ID: 5107914. [Google Scholar] [CrossRef
[12] Liu, C., Liu, Z. and Wang, X. (2025) Bidirectional Feature Aggregation and Adaptive Fusion Network for ALS Point Cloud Semantic Segmentation. IEEE Geoscience and Remote Sensing Letters, 22, Article ID: 6500705. [Google Scholar] [CrossRef