基于可变形卷积AlexNet与软注意力机制的皮肤病变识别算法
Skin Disease Identification Algorithm Based on Deformable Convolutional AlexNet and Soft Attention Mechanism
摘要: 为解决皮肤病识别领域中数据集类别不平衡、模型复杂度高以及准确率低的问题,提出了一种基于可变形卷积AlexNet与软注意力机制的皮肤病变识别算法。首先,提出改进的可变形卷积AlexNet网络模型,提高模型辨析力的同时,降低了模型的参数量,加快模型的训练和测试效率。然后,在改进的模型中集成了软注意力机制,使模型聚焦于皮肤病的关键特征区域,优化模型的特征提取和识别能力。最后,提出了一种联合损失函数,对焦点损失函数与交叉熵损失函数进行加权,聚焦于困难样本和易出错样本,解决因数据集类别不平衡而导致的网络朝着错误方向收敛的问题。在公开数据集进行实验,主观和客观的实验结果表明,提出算法在七种不同类别的皮肤病识别准确率高于对比算法,具有较强的鲁棒性和泛化能力。
Abstract: To address the issues of dataset class imbalance, high model complexity, and low accuracy in the field of skin disease identification, this paper proposes a skin lesion recognition algorithm based on deformable convolutional AlexNet and soft attention mechanisms. First, an improved deformable convolutional AlexNet network model is introduced, which enhances the model’s discriminative power while reducing the number of model parameters, thereby speeding up the model’s training and testing efficiency. Subsequently, a soft attention mechanism is integrated into the improved model, focusing the model on key feature areas of skin diseases to optimize its feature extraction and recognition capabilities. Finally, a joint loss function is proposed, which applies weights to the focal loss function and cross-entropy loss function, concentrating on difficult and error-prone samples to solve the issue of network convergence in the wrong direction due to dataset class imbalance. Experiments on public datasets, with both subjective and objective results, demonstrate that the proposed algorithm achieves higher accuracy in identifying seven different categories of skin diseases compared to benchmark algorithms, exhibiting strong robustness and generalizability.
文章引用:曹迅, 冯艳玲, 马昭鹏, 胡铭铭. 基于可变形卷积AlexNet与软注意力机制的皮肤病变识别算法[J]. 计算机科学与应用, 2024, 14(5): 229-238. https://doi.org/10.12677/csa.2024.145131

参考文献

[1] SCF (2022) Skin Cancer Foundation.
https://www.skincancer.org/
[2] CDC (2022) Centers for Disease Control and Prevention.
https://www.cdc.gov/cancer/skin/
[3] Medhat, S., Abdel-Galil, H., Aboutabl, A.E., et al. (2023) Iterative Magnitude Pruning-Based Light-Version of AlexNet for Skin Cancer Classification. Neural Computing and Applications, 36, 1413-1428. [Google Scholar] [CrossRef
[4] Hoang, L., Lee, S.H., Lee, E.J., et al. (2022) Multiclass Skin Lesion Classification Using a Novel Lightweight Deep Learning Framework for Smart Healthcare. Applied Sciences, 12, Article 2677. [Google Scholar] [CrossRef
[5] Raghavendra, P.V.S.P., Charitha, C., Begum, K.G., et al. (2023) Deep Learning-Based Skin Lesion Multi-Class Classification with Global Average Pooling Improvement. Journal of Digital Imaging, 36, 2227-2248. [Google Scholar] [CrossRef] [PubMed]
[6] Yan, P., Wang, G., Chen, J., et al. (2023) Skin Lesion Classification Based on the VGG‐16 Fusion Residual Structure. International Journal of Imaging Systems and Technology, 33, 53-68. [Google Scholar] [CrossRef
[7] Wei, Z., Li, Q. and Song, H. (2022) Dual Attention Based Network for Skin Lesion Classification with Auxiliary Learning. Biomedical Signal Processing and Control, 74, Article 103549. [Google Scholar] [CrossRef
[8] Zhang, J.P., et al. (2019) Attention Residual Learning for Skin Lesion Classification. IEEE Transactions on Medical Imaging, 38, 2092-2103. [Google Scholar] [CrossRef
[9] Carcagnì, P., Leo, M., Cuna, A., et al. (2019) Classification of Skin Lesions by Combining Multilevel Learnings in a DenseNet Architecture. Image Analysis and ProcessingICIAP 2019: 20th International Conference, Trento, Italy, 9-13 September 2019, 335-344. [Google Scholar] [CrossRef
[10] Nguyen, V.D., Bui, N.D. and Do, H.K. (2022) Skin Lesion Classification on Imbalanced Data Using Deep Learning with Soft Attention. Sensors, 22, Article 7530. [Google Scholar] [CrossRef] [PubMed]
[11] Dai, J.F., et al. (2017) Deformable Convolutional Networks. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017. [Google Scholar] [CrossRef
[12] Shaikh, M.A., et al. (2020) Attention Based Writer Independent Verification. 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR). Dortmund, 8-10 September 2020. [Google Scholar] [CrossRef
[13] Philipp, T., Rosendahl, C. and Kittler. H. (2018) The HAM10000 Dataset, a Large Collection of Multi-Source Dermatoscopic Images of Common Pigmented Skin Lesions. Scientific Data, 5, Article No. 180161. [Google Scholar] [CrossRef] [PubMed]
[14] Selvaraju, R.R., et al. (2017) Grad-Cam: Visual Explanations from Deep Networks via Gradient-Based Localization. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017. [Google Scholar] [CrossRef