基于粒子群优化的注意力自适应U-Net架构搜索
Attention-Adaptive U-Net Architecture Search via Particle Swarm Optimization
摘要: U-Net通过跳跃连接对多尺度特征的有效融合,已成为医学图像分割的主流方法。然而,其架构超参数(卷积核大小、通道数)和注意力机制选择高度依赖人工设计,缺乏自动化优化手段。若将注意力选择纳入搜索空间,会导致维度爆炸和计算开销激增。针对此问题,文章提出PSO-AttUNet,通过粒子群优化(PSO)搜索U-Net架构超参数,并结合基于计分的注意力机制为每层独立选择最合适的注意力类型(SE、CBAM、AG),实现架构搜索与注意力选择的并行优化。在BUSI乳腺超声数据集上的实验表明,PSO-AttUNet在Dice、IoU、HD95等指标上优于U-Net、Attention U-Net等基线模型,同时参数量降低至U-Net的1/47,验证了方法的有效性。
Abstract: U-Net has become the mainstream method for medical image segmentation due to its effective fusion of multi-scale features through skip connections. However, its architectural hyperparameters (kernel sizes, channel numbers) and attention mechanism selection heavily rely on manual design, lacking automated optimization methods. Incorporating attention selection into the search space would lead to dimensional explosion and significant computational overhead. To address this issue, this paper proposes PSO-AttUNet, which searches U-Net architectural hyperparameters through Particle Swarm Optimization (PSO) and independently selects the optimal attention type (SE, CBAM, AG) for each layer using a score-based attention evaluation mechanism, achieving parallel optimization of architecture search and attention selection. Experiments on the BUSI breast ultrasound dataset demonstrate that PSO-AttUNet outperforms baseline models such as U-Net and Attention U-Net in metrics including Dice, IoU, and HD95, while reducing the parameter count to 1/47 of U-Net, validating the effectiveness of the proposed method.
文章引用:向健. 基于粒子群优化的注意力自适应U-Net架构搜索[J]. 计算机科学与应用, 2026, 16(5): 472-482. https://doi.org/10.12677/csa.2026.165198

参考文献

[1] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W. and Frangi, A., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2015, Springer, 234-241. [Google Scholar] [CrossRef
[2] Oktay, O., Schlemper, J., Le Folgoc, L., Lee, M., Heinrich, M., Misawa, K., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas. arXiv: 1804.03999.
[3] Kennedy, J. and Eberhart, R. (1995) Particle Swarm Optimization. Proceedings of ICNN’95—International Conference on Neural Networks, Perth, 27 November-1 December 1995, 1942-1948. [Google Scholar] [CrossRef
[4] Fernandes Junior, F.E. and Yen, G.G. (2019) Particle Swarm Optimization of Deep Neural Networks Architectures for Image Classification. Swarm and Evolutionary Computation, 49, 62-74. [Google Scholar] [CrossRef
[5] Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N. and Liang, J. (2018) Unet++: A Nested U-Net Architecture for Medical Image Segmentation. In: Stoyanov, D., et al., Eds., Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Springer, 3-11. [Google Scholar] [CrossRef] [PubMed]
[6] Diakogiannis, F.I., Waldner, F., Caccetta, P. and Wu, C. (2020) ResUNet-a: A Deep Learning Framework for Semantic Segmentation of Remotely Sensed Data. ISPRS Journal of Photogrammetry and Remote Sensing, 162, 94-114. [Google Scholar] [CrossRef
[7] Khouy, M., Jabrane, Y., Ameur, M. and Hajjam El Hassani, A. (2023) Medical Image Segmentation Using Automatic Optimized U-Net Architecture Based on Genetic Algorithm. Journal of Personalized Medicine, 13, Article 1298. [Google Scholar] [CrossRef] [PubMed]
[8] Ying, W., Zheng, Q., Wu, Y., Yang, K., Zhou, Z., Chen, J., et al. (2023) Efficient Multi-Objective Evolutionary Neural Architecture Search for U-Nets with Diamond Atrous Convolution and Transformer for Medical Image Segmentation. Applied Soft Computing, 148, Article ID: 110869. [Google Scholar] [CrossRef
[9] Yu, C., Wang, Y., Tang, C., Feng, W. and Lv, J. (2023) EU-Net: Automatic U-Net Neural Architecture Search with Differential Evolutionary Algorithm for Medical Image Segmentation. Computers in Biology and Medicine, 167, Article ID: 107579. [Google Scholar] [CrossRef] [PubMed]
[10] Saifullah, S. and Dreżewski, R. (2025) Particle Swarm-Optimized U-Net Framework for Precise Multimodal Brain Tumor Segmentation. Proceedings of the Genetic and Evolutionary Computation Conference Companion, Malaga, 14-18 July 2025, 323-326. [Google Scholar] [CrossRef
[11] Hu, J., Shen, L. and Sun, G. (2018) Squeeze-and-Excitation Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7132-7141. [Google Scholar] [CrossRef
[12] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer VisionECCV 2018, Springer, 3-19. [Google Scholar] [CrossRef
[13] Zhang, B., Qiu, S. and Liang, T. (2024) Dual Attention-Based 3D U-Net Liver Segmentation Algorithm on CT Images. Bioengineering, 11, Article 737. [Google Scholar] [CrossRef] [PubMed]
[14] Gu, R., Wang, G., Song, T., Huang, R., Aertsen, M., Deprest, J., et al. (2021) CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation. IEEE Transactions on Medical Imaging, 40, 699-711. [Google Scholar] [CrossRef] [PubMed]
[15] Chen, J.N., Lu, Y.Y., Yu, Q.H., Luo, X.D., Adeli, E., Wang, Y., Lu, L., Yuille, A.L. and Zhou, Y.Y. (2021) TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation. arXiv: 2102.04306.
[16] Cao, H., Wang, Y., Chen, J., Jiang, D., Zhang, X., Tian, Q., et al. (2023) Swin-Unet: Unet-Like Pure Transformer for Medical Image Segmentation. In: Karlinsky, L., Michaeli, T. and Nishino, K., Eds., Computer VisionECCV 2022 Workshops, Springer, 205-218. [Google Scholar] [CrossRef
[17] Hassanzadeh, T., Essam, D. and Sarker, R. (2021) Evolutionary Deep Attention Convolutional Neural Networks for 2D and 3D Medical Image Segmentation. Journal of Digital Imaging, 34, 1387-1404. [Google Scholar] [CrossRef] [PubMed]
[18] Alansari, M., Hay, O.A., Javed, S., Shoufan, A., Zweiri, Y. and Werghi, N. (2023) GhostFaceNets: Lightweight Face Recognition Model from Cheap Operations. IEEE Access, 11, 35429-35446. [Google Scholar] [CrossRef
[19] Wang, H., Li, Y., Wu, Z., Wang, H. and Zhang, Y. (2024) SASE: A Searching Architecture for Squeeze and Excitation Operations. arXiv: 2411.08333.
[20] Weng, Y., Zhou, T., Li, Y. and Qiu, X. (2019) Nas-Unet: Neural Architecture Search for Medical Image Segmentation. IEEE Access, 7, 44247-44257. [Google Scholar] [CrossRef
[21] Jha, D., Smedsrud, P.H., Riegler, M.A., Johansen, D., Lange, T.D., Halvorsen, P., et al. (2019) ResUNet++: An Advanced Architecture for Medical Image Segmentation. 2019 IEEE International Symposium on Multimedia (ISM), San Diego, 9-11 December 2019, 225-230. [Google Scholar] [CrossRef