基于改进MobileViT模型的脑肿瘤图像分类研究
Study on Brain Tumor Image Classification Based on Improved MobileViT Model
DOI: 10.12677/csa.2024.149184, PDF,    国家自然科学基金支持
作者: 向思宇, 冯慧芳*:西北师范大学数学与统计学院,甘肃 兰州
关键词: 脑肿瘤图像分类MobileViT轻量级CBAM迁移学习Brain Tumor Image Classification MobileViT Lightweight CBAM Transfer Learning
摘要: 针对目前基于深度学习的脑肿瘤分类算法参数多、计算复杂的问题,提出了一种基于改进MobileViT的轻量级脑肿瘤图像分类模型。首先,在轻量化模型MobileViT中加入卷积块注意力模块(CBAM)以有效地增强局部和全局特征。其次,采用迁移学习方法加快网络模型在脑肿瘤图像上的学习速度,并在训练过程中使用余弦退火算法来优化所提出的轻量化模型,使得模型更好地收敛。最后,在真实脑肿瘤数据集上对本文模型的有效性进行评估,并与现有的最新基线模型(ResNet、DenseNet121、ShuffleNet、EfficientNet、MobileNet和MobileViT)进行比较。实验结果表明,相比于基线模型,本文所提出的模型不仅显著提高了脑肿瘤图像分类的准确性,而且计算复杂度较低,符合在边缘计算中部署深度学习模型的需求。
Abstract: To address the challenges presented by current deep learning-based brain tumor classification algorithms, which involve numerous parameters and complex computations, we propose a lightweight brain tumor image classification model based on an enhanced version of MobileViT. Firstly, a convolutional block attention module (CBAM) is added to the lightweight model MobileViT to effectively enhance the local and global feature maps. Secondly, a transfer learning approach is used to accelerate the learning speed of the network model on brain tumor images. Additionally, we employ the cosine annealing algorithm to optimize the training process of our proposed lightweight model, facilitating better convergence. Finally, we evaluate the effectiveness of our proposed model on a real brain tumor dataset, comparing it with several state-of-the-art baselines including ResNet, DenseNet121, ShuffleNet, EfficientNet, MobileNet, and MobileViT. The experimental results show that compared to the baseline model, the proposed model in this paper not only significantly improves the accuracy of brain tumor image classification, but also has a lower computational complexity, which meets the requirements of deploying deep learning models in edge computing.
文章引用:向思宇, 冯慧芳. 基于改进MobileViT模型的脑肿瘤图像分类研究[J]. 计算机科学与应用, 2024, 14(9): 23-32. https://doi.org/10.12677/csa.2024.149184

参考文献

[1] Louis, D.N., Perry, A., Wesseling, P., et al. (2021) The 2021 WHO Classification of Tumors of the Central Nervous System: A Summary. Neuro-Oncology, 23, 1231-1251. [Google Scholar] [CrossRef] [PubMed]
[2] Mengash, H.A. and Mahmoud, H.A.H. (2021) Brain Cancer Tumor Classification from Motion-Corrected MRI Images Using Convolutional Neural Network. Computers, Materials & Continua, 68, 1551-1563. [Google Scholar] [CrossRef
[3] El Kader, I.A., Xu, G., Shuai, Z., et al. (2021) Brain Tumor Detection and Classification by Hybrid CNN-DWA Model Using MR Images. Current Medical Imaging, 17, 1248-1255. [Google Scholar] [CrossRef] [PubMed]
[4] Vankdothu, R., Hameed, M.A. and Fatima, H. (2022) A Brain Tumor Identification and Classification Using Deep Learning Based on CNN-LSTM Method. Computers and Electrical Engineering, 101, Article 107960. [Google Scholar] [CrossRef
[5] Simo, A.M.D., Kouanou, A.T., Monthe, V., et al. (2024) Introducing a Deep Learning Method for Brain Tumor Classification Using MRI Data towards Better Performance. Informatics in Medicine Unlocked, 44, Article 101423. [Google Scholar] [CrossRef
[6] Sandhiya, B. and Raja, S.K.S. (2024) Deep Learning and Optimized Learning Machine for Brain Tumor Classification. Biomedical Signal Processing and Control, 89, Article 105778. [Google Scholar] [CrossRef
[7] Lu, S.Y., Wang, S.H. and Zhang, Y.D. (2020) A Classification Method for Brain MRI via MobileNet and Feedforward Network with Random Weights. Pattern Recognition Letters, 140, 252-260. [Google Scholar] [CrossRef
[8] Vaiyapuri, T., Jaiganesh, M., Ahmad, S., et al. (2023) Ensemble Learning Driven Computer-Aided Diagnosis Model for Brain Tumor Classification on Magnetic Resonance Imaging. IEEE Access, 11, 91398-91406. [Google Scholar] [CrossRef
[9] Luo, H., Zhou, D., Cheng, Y., et al. (2024) MPEDA-Net: A Lightweight Brain Tumor Segmentation Network Using Multi-Perspective Extraction and Dense Attention. Biomedical Signal Processing and Control, 91, Article 106054. [Google Scholar] [CrossRef
[10] Mehta, S. and Rastegari, M. (2021) Mobilevit: Light-Weight, General-Purpose, and Mobile-Friendly Vision Transformer.
[11] Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al. (2020) An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale.
[12] Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. and Chen, L. (2018) Mobilenetv2: Inverted Residuals and Linear Bottlenecks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 4510-4520. [Google Scholar] [CrossRef
[13] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer VisionECCV 2018, Springer, 3-19. [Google Scholar] [CrossRef
[14] Loshchilov, I. and Hutter, F. (2016) SGDR: Stochastic Gradient Descent with Warm Restarts.
[15] Muezzinoglu, T., Baygin, N., Tuncer, I., Barua, P.D., Baygin, M., Dogan, S., et al. (2023) Patchresnet: Multiple Patch Division-Based Deep Feature Fusion Framework for Brain Tumor Classification Using MRI Images. Journal of Digital Imaging, 36, 973-987. [Google Scholar] [CrossRef] [PubMed]
[16] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[17] Huang, G., Liu, Z., Van Der Maaten, L. and Weinberger, K.Q. (2017) Densely Connected Convolutional Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 2261-2269. [Google Scholar] [CrossRef
[18] Zhang, X., Zhou, X., Lin, M. and Sun, J. (2018) Shufflenet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 6848-6856. [Google Scholar] [CrossRef
[19] Tan, M. and Le, Q.V. (2019) EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. Proceedings of the 36th International Conference on Machine Learning, ICML 2019, Long Beach, 9-15 June 2019, 6105-6114.
[20] Howard, A., Sandler, M., Chen, B., Wang, W., Chen, L., Tan, M., et al. (2019) Searching for MobileNetV3. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, 27 October-2 November 2019, 1314-1324. [Google Scholar] [CrossRef