DA-SwinUnet:基于方向注意力模块与SwinUnet的股骨头坏死分割模型
DA-SwinUnet: A Segmentation Model for Avascular Necrosis of the Femoral Head Based on Directional Attention Module and SwinUnet
摘要: 股骨头坏死是临床常见的骨科疾病,早期诊断与精准病灶定位对有效治疗至关重要。针对当前影像诊断主观性强、缺乏客观性的问题,以及现有分割方法未能充分利用图像解剖一致性的问题,本研究提出了一种融合定向注意力机制的DA-SwinUnet模型。该模型在Swin-Unet架构基础上引入定向注意力机制以增强方向性特征提取,并通过加权跳跃连接优化网络性能,有效降低了特征融合阶段的计算冗余,提升了对不同大小病灶的分割精度;同时整合方向连通向量概念以进一步强化解剖一致性约束。在股骨头坏死临床MRI数据集上的实验结果显示,DA-SwinUnet模型的Dice系数达到87.3,IoU达到78.7,性能优于传统CNN模型及其他基于Transformer的分割方法。研究表明,DA-SwinUnet能够在保持解剖一致性的前提下实现股骨头坏死病灶的精确分割,显著提升分割性能,为疾病的早期诊断和临床治疗提供了有力支持,具有广阔的前景。
Abstract: Osteonecrosis of the femoral head (ONFH) is a common orthopedic condition, where early diagnosis and precise lesion localization are crucial for effective treatment. To address the issues of subjectivity in current image-based diagnosis and the failure of existing segmentation methods to fully leverage the anatomical consistency of medical images, this study proposes a DA-SwinUnet model enhanced with a directional attention mechanism. Built upon the Swin-Unet Transformer architecture, this model integrates a directional attention mechanism to enhance directional feature extraction and employs weighted skip connections to optimize network performance, reducing computational complexity while improving segmentation accuracy for lesions of varying sizes. The concept of a Directional Connectivity Vector is also incorporated to further enforce anatomical consistency. Experimental results on a clinical MRI dataset for ONFH demonstrate that the DA-SwinUnet model achieves a Dice coefficient of 87.3 and an IoU of 78.7, outperforming traditional CNN models and other Transformer-based segmentation methods. This study indicates that the DA-SwinUnet can achieve precise segmentation of ONFH lesions while maintaining anatomical consistency, significantly enhancing segmentation performance. It provides robust support for the early diagnosis and clinical treatment of the disease and holds promising potential for application.
文章引用:刁瑞新, 王恒澳, 王琨, 计晓斐. DA-SwinUnet:基于方向注意力模块与SwinUnet的股骨头坏死分割模型[J]. 软件工程与应用, 2026, 15(2): 168-180. https://doi.org/10.12677/sea.2026.152017

参考文献

[1] Zhao, D., Yu, M., Hu, K., Wang, W., Yang, L., Wang, B., et al. (2015) Prevalence of Nontraumatic Osteonecrosis of the Femoral Head and Its Associated Risk Factors in the Chinese Population: Results from a Nationally Representative Survey. Chinese Medical Journal, 128, 2843-2850. [Google Scholar] [CrossRef] [PubMed]
[2] Mont, M.A. and Hungerford, D.S. (1995) Non-Traumatic Avascular Necrosis of the Femoral Head. The Journal of Bone & Joint Surgery, 77, 459-474. [Google Scholar] [CrossRef] [PubMed]
[3] Alkhatatbeh, T., Alkhatatbeh, A., Guo, Q., Chen, J., Song, J., Qin, X., et al. (2025) Interpretable Machine Learning and Radiomics in Hip MRI Diagnostics: Comparing ONFH and OA Predictions to Experts. Frontiers in Immunology, 16, Article ID: 1532248. [Google Scholar] [CrossRef] [PubMed]
[4] Zhang, Y., Cao, X., Li, X., Chen, J., Zhao, Y., Tian, Z., et al. (2018) Accuracy of MRI Diagnosis of Early Osteonecrosis of the Femoral Head: A Meta-Analysis and Systematic Review. Journal of Orthopaedic Surgery and Research, 13, Article No. 167. [Google Scholar] [CrossRef] [PubMed]
[5] Li, M., Jiang, Y., Zhang, Y. and Zhu, H. (2023) Medical Image Analysis Using Deep Learning Algorithms. Frontiers in Public Health, 11, Article ID: 1273253. [Google Scholar] [CrossRef] [PubMed]
[6] Litjens, G., Kooi, T., Bejnordi, B.E., Setio, A.A.A., Ciompi, F., Ghafoorian, M., et al. (2017) A Survey on Deep Learning in Medical Image Analysis. Medical Image Analysis, 42, 60-88. [Google Scholar] [CrossRef] [PubMed]
[7] Wang, P., Liu, X., Xu, J., Li, T., Sun, W., Li, Z., et al. (2021) Deep Learning for Diagnosing Osteonecrosis of the Femoral Head Based on Magnetic Resonance Imaging. Computer Methods and Programs in Biomedicine, 208, Article ID: 106229. [Google Scholar] [CrossRef] [PubMed]
[8] Gao, S., Zhu, H., Wen, M., He, W., Wu, Y., Li, Z., et al. (2024) Prediction of Femoral Head Collapse in Osteonecrosis Using Deep Learning Segmentation and Radiomics Texture Analysis of MRI. BMC Medical Informatics and Decision Making, 24, Article No. 320. [Google Scholar] [CrossRef] [PubMed]
[9] Yang, Z. and Farsiu, S. (2023) Directional Connectivity-Based Segmentation of Medical Images. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 18-22 June 2023, 11525-11535. [Google Scholar] [CrossRef] [PubMed]
[10] Cao, H., Wang, Y., Chen, J., Jiang, D., Zhang, X., Tian, Q., et al. (2023) Swin-Unet: Unet-Like Pure Transformer for Medical Image Segmentation. In: Karlinsky, L., et al., Eds., Computer VisionECCV 2022 Workshops, Springer, 205-218. [Google Scholar] [CrossRef
[11] Shen, Y., Guo, P., Wu, J., Huang, Q., Le, N., Zhou, J., et al. (2023) MoViT: Memorizing Vision Transformers for Medical Image Analysis. In: Cao, X.H., et al., Eds., Machine Learning in Medical Imaging, Springer, 205-213. [Google Scholar] [CrossRef] [PubMed]
[12] Shen, X., He, Z., Shi, Y., Liu, T., Yang, Y., Luo, J., et al. (2024) Development and Validation of an Automated Classification System for Osteonecrosis of the Femoral Head Using Deep Learning Approach: A Multicenter Study. The Journal of Arthroplasty, 39, 379-386.e2. [Google Scholar] [CrossRef] [PubMed]
[13] Ruckli, A.C., Nanavati, A.K., Meier, M.K., Lerch, T.D., Steppacher, S.D., Vuilleumier, S., et al. (2023) A Deep Learning Method for Quantification of Femoral Head Necrosis Based on Routine Hip MRI for Improved Surgical Decision Making. Journal of Personalized Medicine, 13, Article No. 153. [Google Scholar] [CrossRef] [PubMed]
[14] Uemura, K., Takashima, K., Otake, Y., Li, G., Mae, H., Okada, S., et al. (2025) Development of a Deep Learning Model for the Volumetric Assessment of Osteonecrosis of the Femoral Head on Three-Dimensional Magnetic Resonance Imaging. The Journal of Arthroplasty, 40, S160-S166.e1. [Google Scholar] [CrossRef] [PubMed]
[15] Hatamizadeh, A., Nath, V., Tang, Y., Yang, D., Roth, H.R. and Xu, D. (2022) Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images. In: Crimi, A. and Bakas, S., Eds., Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, Springer International Publishing, 272-284. [Google Scholar] [CrossRef
[16] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., et al., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2015, Springer International Publishing, 234-241. [Google Scholar] [CrossRef
[17] Badrinarayanan, V., Kendall, A. and Cipolla, R. (2017) Segnet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 2481-2495. [Google Scholar] [CrossRef] [PubMed]
[18] Chen, L., Papandreou, G., Kokkinos, I., Murphy, K. and Yuille, A.L. (2018) Deeplab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected Crfs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 834-848. [Google Scholar] [CrossRef] [PubMed]
[19] Sun, K., Zhao, Y., Jiang, B., Cheng, T., Xiao, B., Liu, D., et al. (2019) High-Resolution Representations for Labeling Pixels and Regions.
[20] Ma, X., Chen, M., Zhang, J., Song, L., Du, F. and Yu, Z. (2025) PARF-Net: Integrating Pixel-Wise Adaptive Receptive Fields into Hybrid Transformer-CNN Network for Medical Image Segmentation.
[21] Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., et al. (2021) Transunet: Transformers Make Strong Encoders for Medical Image Segmentation.
[22] Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., et al. (2020) An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale.
[23] Hatamizadeh, A., Tang, Y., Nath, V., Yang, D., Myronenko, A., Landman, B., Roth, H.R. and Xu, D. (2021) Unetr: Transformers for 3D Medical Image Segmentation. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 5-9 January 2021, 574-584.
[24] Ou, Y., Yuan, Y., Huang, X., Wong, S.T.C., Volpi, J., Wang, J.Z., et al. (2022) Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation. In: Wang, L.W., et al., Eds., Medical Image Computing and Computer Assisted InterventionMICCAI 2022, Springer, 475-484. [Google Scholar] [CrossRef
[25] Li, Z., Lyu, H. and Wang, J. (2023) FusionU-Net: U-Net with Enhanced Skip Connection for Pathology Image Segmentation. Asian Conference on Machine Learning, PMLR, İstanbul, 11-14 November 2023, 694-706.
[26] Wang, H., Cao, P., Yang, J. and Zaiane, O. (2023) MCA-UNet: Multi-Scale Cross Co-Attentional U-Net for Automatic Medical Image Segmentation. Health Information Science and Systems, 11, Article No. 10. [Google Scholar] [CrossRef] [PubMed]
[27] Jian, M., Wu, R., Xu, W., Zhi, H., Tao, C., Chen, H., et al. (2024) Vascuconnet: An Enhanced Connectivity Network for Vascular Segmentation. Medical & Biological Engineering & Computing, 62, 3543-3554. [Google Scholar] [CrossRef] [PubMed]
[28] Azad, R., Heidary, M., Yilmaz, K., et al. (2023) Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook.
[29] Lin, T.-Y., Doll’ar, P., Girshick, R., He, K., Hariharan, B. and Belongie, S. (2016) Feature Pyramid Networks for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 2117-2125.