基于神经网络架构搜索的语义分割方法
Semantic Segmentation Method Based on Neural Architecture Search
DOI: 10.12677/AAM.2023.128357, PDF,    科研立项经费支持
作者: 朱 烜, 马中华*:天津职业技术师范大学理学院,天津
关键词: 神经网络架构搜索语义分割自动化深度学习Neural Architecture Search Semantic Segmentation Automated Deep Learning
摘要: 神经网络架构搜索旨在使用搜索策略在给定的搜索空间上让算法自动搜索出网络结构模型以减少人工设计网络的任务量,拓展神经网络架构搜索在语义分割领域的应用对自动化深度学习领域的研究有重要意义。通过设计U型搜索空间,将可微分神经网络架构搜索策略应用于语义分割模型。实验结果显示,在The Oxford-IIIT Pet数据集搜索得到的网络与基准网络UNet相比,搜索出的网络模型mIOU提高了14.1%,分割的效果更加显著,轮廓边界更加清晰。将搜索出来的网络迁移到Camvid数据集上进行测试,比基准网络实验精度提升了20.5%。研究表明,神经网络架构搜索与语义分割的结合在自动化深度学习领域的研究中具有重要意义,能够使语义分割模型获得更优秀的性能。
Abstract: The objective of Neural Architecture Search (NAS) is to use a search strategy to automatically find network structure models within a given search space, thereby reducing the task load of manually designing networks. Expanding the application of NAS in the field of semantic segmentation bears significant importance for research in automated deep learning. A U-shaped search space was de-signed, and a differentiable NAS strategy was applied to a semantic segmentation model. Experi-mental results showed that the network found on The Oxford-IIIT Pet dataset outperformed the benchmark UNet network model, with a Mean Intersection over Union (mIOU) increase of 14.1%, and produced more prominent segmentation results with clearer contour boundaries. When the discovered network was transferred to the Camvid dataset for testing, it surpassed the benchmark network experimental accuracy by 20.5%. This study demonstrated that the integration of NAS and semantic segmentation holds significant importance in the field of automated deep learning re-search. This approach enables semantic segmentation models to achieve superior performance.
文章引用:朱烜, 马中华. 基于神经网络架构搜索的语义分割方法[J]. 应用数学进展, 2023, 12(8): 3587-3597. https://doi.org/10.12677/AAM.2023.128357

参考文献

[1] Sengupta, A., Ye, Y., Wang, R., Liu, C. and Roy, K. (2019) Going Deeper in Spiking Neural Networks: VGG and Re-sidual Architectures. Frontiers in Neuroscience, 13, Article 95. [Google Scholar] [CrossRef] [PubMed]
[2] Targ, S., Almeida, D. and Lyman, K. (2016) Resnet in Resnet: Generalizing Residual Architectures. ArXiv Preprint ArXiv: 1603.08029.
[3] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W. and Frangi, A., Eds., Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science, Vol. 9351, Springer, Cham, 234-241. [Google Scholar] [CrossRef
[4] Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K. and Yuille, A.L. (2017) DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 834-848. [Google Scholar] [CrossRef
[5] Elsken, T., Metzen, J.H., Hutter, F. (2019) Neural Architecture Search: A Survey. The Journal of Machine Learning Research, 20, 1997-2017.
[6] Zoph, B. and Le, Q.V. (2016) Neural Architecture Search with Reinforcement Learning. ArXiv Preprint ArXiv: 1611.01578.
[7] Liu, H., Simonyan, K. and Yang, Y. (2018) DARTS: Differentiable Architecture Search. ArXiv Preprint ArXiv: 1806.09055.
[8] Ng, P.C. and Henikoff, S. (2003) SIFT: Predicting Amino Acid Changes That Affect Protein Function. Nucleic Acids Research, 31, 3812-3814. [Google Scholar] [CrossRef] [PubMed]
[9] Pang, Y., Yuan, Y., Li, X. and Pan, J. (2011) Efficient HOG Human Detection. Signal Processing, 91, 773-781. [Google Scholar] [CrossRef
[10] Villa, M., Dardenne, G., Nasan, M., et al. (2018) FCN-Based Approach for the Automatic Segmentation of Bone Surfaces in Ultrasound Images. International Journal of Computer Assisted Radiology and Surgery, 13, 1707-1716. [Google Scholar] [CrossRef] [PubMed]
[11] Badrinarayanan, V., Kendall, A. and Cipolla, R. (2017) SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 2481-2495. [Google Scholar] [CrossRef
[12] Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. and Chen, L.-C. (2018) Mobilenetv2: Inverted Residuals and Linear Bottlenecks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 4510-4520. [Google Scholar] [CrossRef
[13] Parkhi, O.M., Vedaldi, A., Zisserman, A. and Jawahar, C.V. (2012) Cats and Dogs. 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, 16-21 June 2012, 3498-3505. [Google Scholar] [CrossRef
[14] Brostow, G.J., Fauqueur, J. and Cipolla, R. (2009) Semantic Object Classes in Video: A High-Definition Ground Truth Database. Pattern Recognition Letters, 30, 88-97. [Google Scholar] [CrossRef
[15] Li, X., Sun, X., Meng, Y., et al. (2020) Dice Loss for Da-ta-Imbalanced NLP Tasks. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 5-10 July 2020, 465-476. [Google Scholar] [CrossRef
[16] Lin, T.-Y., Dollár, P., Girshick, R., et al. (2017) Feature Pyra-mid Networks for Object Detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Hon-olulu, 21-26 July 2017, 936-944. [Google Scholar] [CrossRef
[17] Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N. and Liang, J. (2019) UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation. IEEE Transactions on Medical Imaging, 39, 1856-1867. [Google Scholar] [CrossRef
[18] Zheng, S., Lu, J., Zhao, H., et al. (2021) Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 6877-6886. [Google Scholar] [CrossRef
[19] Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. and Ad-am, H. (2018) Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer Vision—ECCV 2018. ECCV 2018. Lecture Notes in Com-puter Science, Vol. 11211, Springer, Cham, 833-851. [Google Scholar] [CrossRef