基于Adapter微调的轻量型SAM在超声图像分割中的研究
Research on Lightweight SAM Based on Adapter Fine-Tuning for Ultrasound Image Segmentation
DOI: 10.12677/mos.2025.144358, PDF,   
作者: 张霖娜:上海理工大学光电信息与计算机工程学院,上海
关键词: 超声图像分割Adapter微调轻量化模型Ultrasound Image Segmentation Adapter Fine-Tuning Lightweight Model
摘要: 随着医学影像技术的快速发展,超声图像在疾病诊断和治疗中发挥着重要作用。然而,超声图像具有噪声大、对比度低和边界模糊等特点,给超声图像分割任务带来了巨大挑战。本文提出了一种基于Adapter微调的轻量超声图像分割模型LG-MedSeg,旨在解决计算资源受限和超声图像分割精度不足的问题。通过知识蒸馏技术将SAM-Med2d模型轻量化,并引入并行低秩分解Adapte (LRAdapter)模块,增强模型对超声图像特征的适应能力。实验结果表明,LG-MedSeg在胸腔积液和腹腔积液分割任务中表现出色,Dice系数达到90.85%,IoU达到84.5%,同时模型参数量仅为13.15 M,显著降低了计算复杂度,并在多个公开数据集上的实验验证了LRAdapter的有效性和通用性。本文的研究为医学图像分割提供了一种高效、轻量化的解决方案,特别适用于资源受限的边缘设备部署。
Abstract: With the rapid development of medical imaging technology, ultrasound images play a crucial role in disease diagnosis and treatment. However, ultrasound images are characterized by high noise, low contrast, and blurred boundaries, posing significant challenges to ultrasound image segmentation tasks. This paper proposes LG-MedSeg, a lightweight ultrasound image segmentation model based on Adapter fine-tuning, aiming to address the issues of insufficient computational resources and inadequate segmentation accuracy for ultrasound images. By applying knowledge distillation technology to streamline the SAM-Med2D model and introducing a parallel low-rank decomposition Adapter (LRAdapter) module, the model’s adaptability to ultrasound image features is enhanced. Experimental results demonstrate that LG-MedSeg achieves outstanding performance in thoracic effusion and abdominal effusion segmentation tasks, with a Dice coefficient of 90.85% and IoU of 84.5%, while maintaining only 13.15 million parameters. This significantly reduces computational complexity, and experiments on multiple public datasets validate the effectiveness and generalizability of the LRAdapter. Our research provides an efficient and lightweight solution for medical image segmentation, particularly suitable for deployment on resource-constrained edge devices.
文章引用:张霖娜. 基于Adapter微调的轻量型SAM在超声图像分割中的研究[J]. 建模与仿真, 2025, 14(4): 1109-1119. https://doi.org/10.12677/mos.2025.144358

参考文献

[1] Wei, W., Zhao, S. and Zou, D. (2023) Recommendation System: A Survey and New Perspectives. World Scientific Annual Review of Artificial Intelligence, 1, Article ID: 2330001. [Google Scholar] [CrossRef
[2] Xia, Q., Zheng, H., Zou, H., Luo, D., Tang, H., Li, L., et al. (2025) A Comprehensive Review of Deep Learning for Medical Image Segmentation. Neurocomputing, 613, Article ID: 128740. [Google Scholar] [CrossRef
[3] Zhang, C., Liu, L., Cui, Y., et al. (2023) A Comprehensive Survey on Segment Anything Model for Vision and Beyond. arXiv: 2305.08196.
[4] Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., et al. (2023) Segment Anything. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, 1-6 October 2023, 3992-4003. [Google Scholar] [CrossRef
[5] Zhang, Y., Shen, Z. and Jiao, R. (2024) Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions. Computers in Biology and Medicine, 171, Article ID: 108238. [Google Scholar] [CrossRef] [PubMed]
[6] Cheng, J., Ye, J., Deng, Z., et al. (2023) SAM-Med2D. arXiv: 2308.16184.
[7] Hinton, G., Vinyals, O. and Dean, J. (2015) Distilling the Knowledge in a Neural Network. arXiv: 1503.02531.
[8] Beyer, L., Zhai, X., Royer, A., Markeeva, L., Anil, R. and Kolesnikov, A. (2022) Knowledge Distillation: A Good Teacher Is Patient and Consistent. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 10915-10924. [Google Scholar] [CrossRef
[9] Zhang, C., Han, D., Qiao, Y., et al. (2023) Faster Segment Anything: Towards Lightweight SAM for Mobile Applications. arXiv: 2306.14289.
[10] Xiong, Y., Varadarajan, B., Wu, L., Xiang, X., Xiao, F., Zhu, C., et al. (2024) EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 16-22 June 2024, 16111-16121. [Google Scholar] [CrossRef
[11] Xin, Y., Luo, S., Zhou, H., et al. (2024) Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey. arXiv: 2402.02242.
[12] Houlsby, N., Giurgiu, A., Jastrzebski, S., et al. (2019) Parameter-Efficient Transfer Learning for NLP. International Conference on Machine Learning, Long Beach, 10-15 June 2019, 2790-2799.
[13] Hu, E.J., Shen, Y., Wallis, P., et al. (2022) LoRA: Low-Rank Adaptation of Large Language Models. arXiv: 2106.09685.
[14] Al-Dhabyani, W., Gomaa, M., Khaled, H. and Fahmy, A. (2020) Dataset of Breast Ultrasound Images. Data in Brief, 28, Article ID: 104863. [Google Scholar] [CrossRef] [PubMed]
[15] Pedraza, L., Vargas, C., Narváez, F., Durán, O., Muñoz, E. and Romero, E. (2015) An Open Access Thyroid Ultrasound Image Database. 10th International Symposium on Medical Information Processing and Analysis, Cartagena de Indias, 14-16 October 2014, 188-193. [Google Scholar] [CrossRef
[16] Gong, H., Chen, G., Wang, R., Xie, X., Mao, M., Yu, Y., et al. (2021) Multi-Task Learning for Thyroid Nodule Segmentation with Thyroid Region Prior. 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), Nice, 13-16 April 2021, 257-261. [Google Scholar] [CrossRef