基于改进PointNet 模型的毫米波点云语义分割
Semantic Segmentation of Millimeter Wave Point Clouds Based on Improved PointNet
摘要: 毫米波成像技术在安检领域得到了普遍应用,研究基于毫米波点云图像的语义分割技术具有重要的意义。PointNet 采用了与任务无关的最远点采样(FPS)来逐步下采样点云,导致毫米波点云中为数不多的前景点信息丢失。因此,本文提出了一种基于自注意力机制的实例感知下采样点云语义分割网络。具体来说,本文结合自注意力机制实现面向任务的下采样策略来保留前景点,防止前景点信息的丢失。最后,由于毫米波点云图像中人体点云数量与前景点云数量极不平衡,改进使用Focal Loss作为语义分割损失函数以提升性能。实验结果表明,本文提出的语义分割模型相对于基准模型PointNet 在平均交并比(mIoU)方面有6.19%的提升,同时准确率有5.57%的提升。
Abstract: Millimeter-wave imaging technology has been widely applied in the security screening field, making the study of semantic segmentation techniques based on millimeter-wave point cloud images of paramount importance. PointNet employs a task-agnostic farthest point sampling (FPS) method for progressively downsampling the point cloud, resulting in the loss of sparse foreground information in the millimeter-wave point clouds. Consequently, this paper introduces an instance-aware downsampling point cloud semantic segmentation network based on the self-attention mechanism. Specifically, this work integrates the self-attention mechanism to implement a task-oriented downsampling strategy to preserve foreground points and prevent the loss of foreground information. Lastly, due to the significant imbalance between the number of human body points and foreground points in millimeter-wave point cloud images, an improved focal loss is utilized as the semantic segmentation loss function to enhance performance. Experimental results demonstrate that the semantic segmentation model proposed in this paper achieves a 6.19% improvement in mean Intersection over Union (mIoU) and a 5.57% increase in accuracy compared to the baseline model PointNet .
文章引用:柳荧, 王韬. 基于改进PointNet 模型的毫米波点云语义分割[J]. 建模与仿真, 2024, 13(3): 3653-3662. https://doi.org/10.12677/mos.2024.133333

参考文献

[1] Sheen, D.M., McMakin, D.L. and Hall, T.E. (2001) Three-Dimensional Millimeter-Wave Imaging for Concealed Weapon Detection. IEEE Transactions on Microwave Theory and Techniques, 49, 1581-1592. [Google Scholar] [CrossRef
[2] Appleby, R. and Anderton, R.N. (2007) Millimeter-Wave and Submillimeter-Wave Imaging for Security and Surveillance. Proceedings of the IEEE, 95, 1683-1690. [Google Scholar] [CrossRef
[3] Wang, Z., Chang, T. and Cui, H.-L. (2019) Review of Active Millimeter Wave Imaging Techniques for Personnel Security Screening. IEEE Access, 7, 148336-148350. [Google Scholar] [CrossRef
[4] 陈国平, 彭之玲, 黄超意, 等. 基于改进YOLOv3-Tiny的毫米波图像目标检测[J]. 电子测量技术, 2021(21): 44.
[5] 张格菲, 李春宇, 刘金坤, 等. 基于YOLOv5的毫米波图像目标检测方法研究[J]. 宇航计测技术, 2021, 41(5): 5. [Google Scholar] [CrossRef
[6] 程秋菊, 陈国平, 王璐, 等. 基于卷积神经网络的毫米波图像目标检测[J]. 科学技术与工程, 2020, 20(13): 6. [Google Scholar] [CrossRef
[7] 丁俊华, 袁明辉. 基于双分支多尺度融合网络的毫米波SAR图像多目标语义分割方法[J]. 光电工程, 2023, 50(12): 75-86. [Google Scholar] [CrossRef
[8] Qi, C.R., Yi, L., Su, H., et al. (2017) Pointnet : Deep Hierarchical Feature Learning on Point Sets in a Metric Space. Advances in Neural Information Processing Systems, 30, 5105-5114.
[9] Qi, C.R., Su, H., Mo, K., et al. (2017) Pointnet: Deep Learning on Point Sets for 3D Classification and Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Venice, 21-26 July 2017, 652-660.
[10] Zhang, Y., Hu, Q., Xu, G., et al. (2022) Not All Points Are Equal: Learning Highly Efficient Point-Based Detectors for 3D Lidar Point Clouds. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, 18-24 June 2022, 18953-18962. [Google Scholar] [CrossRef
[11] Cao, Y., Xu, J., Lin, S., et al. (2020) Global Context Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 6881-6895. [Google Scholar] [CrossRef
[12] Wang, X., Girshick, R., Gupta, A., et al. (2018) Non-Local Neural Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7794-7803. [Google Scholar] [CrossRef
[13] Hu, J., Shen, L. and Sun, G. (2018) Squeeze-and-Excitation Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7132-7141. [Google Scholar] [CrossRef
[14] Lin, T.Y., Goyal, P., Girshick, R., et al. (2017) Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 2980-2988. [Google Scholar] [CrossRef