基于可微拓扑搜索与可信动量优化的半监督3D牙齿分割方法

doi:10.12677/csa.2026.164148

期刊菜单

基于可微拓扑搜索与可信动量优化的半监督3D牙齿分割方法
Semi-Supervised 3D Tooth Segmentation via Differentiable Topology Search and Trusted Momentum Optimization

DOI: 10.12677/csa.2026.164148, PDF,
作者: 林泽峰：广东工业大学数学与统计学院，广东广州
关键词: 3D牙齿分割；深度学习；神经架构搜索；半监督学习；可信动量优化；3D Tooth Segmentation； Deep Learning； Neural Architecture Search； Semi-Supervised Learning； Trusted Momentum Optimization

摘要: 本文提出DiNTS-TMO，一种结合可微拓扑搜索与可信动量优化的半监督3D牙齿分割框架。针对现有NAS方法在半监督场景下易受伪标签噪声干扰的问题，DiNTS-TMO采用DiNTS可微拓扑搜索自动发现网络结构，并引入可信动量优化(TMO)机制，通过对齐门控筛选无标签梯度，在搜索阶段同时约束网络权重与架构参数的更新。基于STS-Tooth数据集两个子集的五折交叉验证结果，DiNTS-TMO在ROI子集上取得了最优的DSC和HD95，在Integrity子集上取得了最优的DSC，但在边界距离指标上仍存在改进空间。实验结果表明，该方法在低标注3D牙齿分割任务中具有较强的竞争力，并为结合NAS与半监督学习提供了一种可行思路。

Abstract: We propose DiNTS-TMO, a semi-supervised 3D tooth segmentation framework that combines differentiable topology search with Trusted Momentum Optimization (TMO). To address the sensitivity of NAS to noisy pseudo labels, DiNTS-TMO uses DiNTS to automatically discover network topologies and introduces an alignment-gated TMO mechanism to filter unlabeled gradients, jointly constraining the updates of network weights and architecture parameters during search. Five-fold cross-validation on two STS-Tooth subsets shows that DiNTS-TMO achieves the best DSC and HD95 on the ROI subset and the best DSC on the Integrity subset, while still leaving room for improvement on boundary-distance metrics in the full-scan setting. These results indicate that DiNTS-TMO is a competitive approach for low-label 3D tooth segmentation and provide a practical route to combining NAS with semi-supervised learning.

文章引用：林泽峰. 基于可微拓扑搜索与可信动量优化的半监督3D牙齿分割方法[J]. 计算机科学与应用, 2026, 16(4): 498-510. https://doi.org/10.12677/csa.2026.164148

参考文献

[1]	Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Lecture Notes in Computer Science, Springer, 234-241. [Google Scholar] [CrossRef]
[2]	Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T. and Ronneberger, O. (2016) 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation. In: Lecture Notes in Computer Science, Springer, 424-432. [Google Scholar] [CrossRef]
[3]	Milletari, F., Navab, N. and Ahmadi, S. (2016) V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. 2016 Fourth International Conference on 3D Vision (3DV), Stanford, 25-28 October 2016, 565-571. [Google Scholar] [CrossRef]
[4]	Isensee, F., Jaeger, P.F., Kohl, S.A.A., Petersen, J. and Maier-Hein, K.H. (2021) NNU-Net: A Self-Configuring Method for Deep Learning-Based Biomedical Image Segmentation. Nature Methods, 18, 203-211. [Google Scholar] [CrossRef] [PubMed]
[5]	Liu, H., Simonyan, K. and Yang, Y. (2019) DARTS: Differentiable Architecture Search. Proceedings of the International Conference on Learning Representations, New Orleans, 6-9 May 2019, 1-13.
[6]	He, Y., Yang, D., Roth, H., Zhao, C. and Xu, D. (2021) DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 5837-5846. [Google Scholar] [CrossRef]
[7]	Laine, S. and Aila, T. (2017) Temporal Ensembling for Semi-Supervised Learning. Proceedings of the International Conference on Learning Representations, Toulon, 24-26 April 2017, 1-13.
[8]	Tarvainen, A. and Valpola, H. (2017) Mean Teachers are Better Role Models: Weight-Averaged Consistency Targets Improve Semi-Supervised Deep Learning. Advances in Neural Information Processing Systems, 30, 1195-1204.
[9]	Yu, L., Wang, S., Li, X., Fu, C.W. and Heng, P.A. (2019) Uncertainty-Aware Self-Ensembling Model for Semi-Supervised 3D Left Atrium Segmentation. In: Lecture Notes in Computer Science, Springer, 605-613. [Google Scholar] [CrossRef]
[10]	Chen, X., Yuan, Y., Zeng, G. and Wang, J. (2021) Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 2613-2622. [Google Scholar] [CrossRef]
[11]	Zhao, W., Zhong, L., Liao, X., Liao, W., Zhang, S., Zhang, S., et al. (2026) MetaSSL: A General Heterogeneous Loss for Semi-Supervised Medical Image Segmentation. IEEE Transactions on Medical Imaging, 45, 751-763. [Google Scholar] [CrossRef]
[12]	Huang, H., Chen, Z., Chen, C., Lu, M. and Zou, Y. (2023) Complementary Consistency Semi-Supervised Learning for 3D Left Atrial Image Segmentation. Computers in Biology and Medicine, 165, Article 107368. [Google Scholar] [CrossRef] [PubMed]
[13]	Liang, K., Chen, L., Liu, B. and Liu, Q. (2026) Cautious Optimizers: Improving Training with One Line of Code. Proceedings of the International Conference on Learning Representations, Rio de Janeiro, 23-27 April 2026, 1-26.
[14]	Pauletto, L., Amini, M. and Winckler, N. (2022) Se²NAS: Self-Semi-Supervised Architecture Optimization for Semantic Segmentation. 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, 21-25 August 2022, 54-60. [Google Scholar] [CrossRef]
[15]	Chen, R., Nian, F., Cen, Y., Peng, Y., Wang, H., Yu, Z., et al. (2025) L-SSHNN: A Larger Search Space of Semi-Supervised Hybrid NAS Network for Echocardiography Segmentation. Expert Systems with Applications, 276, Article 127084. [Google Scholar] [CrossRef]
[16]	Yu, T., Kumar, S., Gupta, A., Levine, S., Hausman, K. and Finn, C. (2020) Gradient Surgery for Multi-Task Learning. Advances in Neural Information Processing Systems, 33, 5824-5836.
[17]	Qiu, Z., Yao, T. and Mei, T. (2017). Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 5534-5542.[CrossRef]
[18]	Liu, C., Chen, L., Schroff, F., Adam, H., Hua, W., Yuille, A.L., et al. (2019) Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 15-20 June 2019, 82-92. [Google Scholar] [CrossRef]
[19]	Wang, Y., Ye, F., Chen, Y., Wang, C., Wu, C., Xu, F., et al. (2025) A Multi-Modal Dental Dataset for Semi-Supervised Deep Learning Image Segmentation. Scientific Data, 12, Article No. 117. [Google Scholar] [CrossRef] [PubMed]
[20]	Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., et al. (2019) PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems, 32, 8024-8035.
[21]	MONAI Contributors (2026) Modules—MONAI 1.5.0 Documentation. https://docs.monai.io/en/1.5.0/modules.html
[22]	Oktay, O., Schlemper, J., Folgoc, L.L., Lee, M., Heinrich, M., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas. Proceedings of the International Conference on Medical Imaging with Deep Learning, Amsterdam, 4-6 July 2018, 1-10.
[23]	Hatamizadeh, A., Tang, Y., Nath, V., Yang, D., Myronenko, A., Landman, B., et al. (2022) UNETR: Transformers for 3D Medical Image Segmentation. 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, 3-8 January 2022, 1748-1758. [Google Scholar] [CrossRef]
[24]	Tang, Y., Yang, D., Li, W., Roth, H.R., Landman, B., Xu, D., et al. (2022) Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 20698-20708. [Google Scholar] [CrossRef]
[25]	Myronenko, A. (2019) 3D MRI Brain Tumor Segmentation Using Autoencoder Regularization. In: Crimi, A., Bakas, S., Kuijf, H., Keyvan, F., Reyes, M. and van Walsum, T., Lecture Notes in Computer Science, Springer, 311-320. [Google Scholar] [CrossRef]

为你推荐

友情链接