一种自适应的分布式深度神经网络推理框架

doi:10.12677/mos.2024.134402

期刊菜单

一种自适应的分布式深度神经网络推理框架
An Adaptive Distributed Deep Neural Network Inference Framework

DOI: 10.12677/mos.2024.134402, PDF, 国家自然科学基金支持
作者: 吴宾宾, 杨桂松^*：上海理工大学光电信息与计算机工程学院，上海
关键词: 深度神经网络；物联网；分布式深度神经网络；特征融合；Deep Neural Network； Internet of Things； Distributed Deep Neural Network； Feature Fusion

摘要: 近年来，随着深度学习的发展，深度神经网络(Deep Neural Network, DNN)模型变得越来越复杂，所需的内存和数据传输量也随之增大，这不仅降低了DNN的训练和推理速度，也限制了DNN在一些内存较小、计算能力较差的物联网(Internet of Things, IoT)设备上的部署。现有研究将基于云–边–端协同的分布式计算框架与深度神经网络相结合，组成了分布式深度神经网络(Distributed Deep Neural Network, DDNN)框架，该框架在IoT应用场景下有着显著的优势。然而，DDNN框架存在设备的计算能力有限、以及设备之间的传输成本较高等问题。针对上述问题，本文提出了自适应的分布式深度神经网络(Adaptive Distributed Deep Neural Network, ADA-DDNN)推理框架。ADA-DDNN框架采用了多个边缘出口，这些边缘出口允许ADA-DDNN框架中的模型在不同的深度层次上进行自适应地推理，以适应不同的任务需求和数据特性。此外，该框架增加了额外的边缘处理模块，边缘处理模块可以在边缘端进行特征融合之前，判断每个终端模块的输出结果是否可信，若可信，则直接输出分类结果，无需进行特征融合和后续计算。这大大增加了样本的边缘出口概率，减少了后续的计算成本。本文在开放的CIFAR-10数据集上进行验证，实验结果表明，ADD-DDNN框架在保证云端测试精度的前提下，显著提升了边缘测试精度。

Abstract: In recent years, the development of deep learning has led to an increase in the complexity of deep neural network (DNN) models, accompanied by a proportional increase in the amount of memory and data transfer required. This has a detrimental effect on the training and inference speed of DNNs, as well as limiting their deployment on some Internet of Things (IoT) devices with limited memory and computational capabilities. Existing research combines a distributed computing framework based on cloud-edge-end collaboration with deep neural networks to form the Distributed Deep Neural Network (DDNN) framework, which has significant advantages in IoT application scenarios. However, the DDNN framework suffers from the problems of limited computational power of devices, as well as high transmission cost between devices. To address these issues, this paper proposes the Adaptive Distributed Deep Neural Network (ADA-DDNN) inference framework. The ADA-DDNN framework employs multiple edge exits, which allow the models in the ADA-DDNN framework to perform different depth levels of adaptive reasoning to accommodate different task requirements and data characteristics. Furthermore, the framework incorporates an additional edge processing module, which is responsible for evaluating the credibility of the output results produced by each terminal module. If the results are deemed credible, the classification results are directly outputted without feature fusion and subsequent computation. This significantly increases the probability of the sample being outputted at the edge and reduces the subsequent computation cost. The paper validates the ADD-DDNN framework on the open CIFAR-10 dataset. The experimental results demonstrate that the framework significantly improves edge testing accuracy while maintaining the same level of testing accuracy in the cloud.

文章引用：吴宾宾, 杨桂松. 一种自适应的分布式深度神经网络推理框架[J]. 建模与仿真, 2024, 13(4): 4449-4459. https://doi.org/10.12677/mos.2024.134402

参考文献

[1]	Khan, A., Sohail, A., Zahoora, U. and Qureshi, A.S. (2020) A Survey of the Recent Architectures of Deep Convolutional Neural Networks. Artificial Intelligence Review, 53, 5455-5516. [Google Scholar] [CrossRef]
[2]	郑远攀, 李广阳, 李晔. 深度学习在图像识别中的应用研究综述[J]. 计算机工程与应用, 2019, 55(12): 20-36.
[3]	圣文顺, 孙艳文. 卷积神经网络在图像识别中的应用[J]. 软件工程, 2019, 22(2): 13-16.
[4]	Lauriola, I., Lavelli, A. and Aiolli, F. (2022) An Introduction to Deep Learning in Natural Language Processing: Models, Techniques, and Tools. Neurocomputing, 470, 443-456. [Google Scholar] [CrossRef]
[5]	Hema, C. and Garcia Marquez, F.P. (2023) Emotional Speech Recognition Using CNN and Deep Learning Techniques. Applied Acoustics, 211, Article ID: 109492. [Google Scholar] [CrossRef]
[6]	Li, J. (2022) Recent Advances in End-to-End Automatic Speech Recognition. APSIPA Transactions on Signal and Information Processing, 11, e8. [Google Scholar] [CrossRef]
[7]	张瑞珍, 韩跃平, 张晓通. 基于深度LSTM的端到端的语音识别[J]. 中北大学学报: 自然科学版, 2020, 41(3): 244-248.
[8]	Qi, C., Shen, S., Li, R., Zhao, Z., Liu, Q., Liang, J., et al. (2021) An Efficient Pruning Scheme of Deep Neural Networks for Internet of Things Applications. EURASIP Journal on Advances in Signal Processing, 2021, Article No. 21. [Google Scholar] [CrossRef]
[9]	Xu, H., Ho, C.Y., Abdelmoniem, A.M., et al. (2020) Compressed Communication for Distributed Deep Learning: Survey and Quantitative Evaluation. http://hdl.handle.net/10754/662495
[10]	Ren, J., He, Y., Yu, G. and Li, G.Y. (2019) Joint Communication and Computation Resource Allocation for Cloud-Edge Collaborative System. 2019 IEEE Wireless Communications and Networking Conference (WCNC), Marrakesh, 15-18 April 2019, 1-6. [Google Scholar] [CrossRef]
[11]	Heigold, G., Vanhoucke, V., Senior, A., Nguyen, P., Ranzato, M., Devin, M., et al. (2013) Multilingual Acoustic Models Using Distributed Deep Neural Networks. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, 26-31 May 2013, 8619-8623. [Google Scholar] [CrossRef]
[12]	Leroux, S., Bohez, S., De Coninck, E., Verbelen, T., Vankeirsbilck, B., Simoens, P., et al. (2017) The Cascading Neural Network: Building the Internet of Smart Things. Knowledge and Information Systems, 52, 791-814. [Google Scholar] [CrossRef]
[13]	Ding, C., Zhou, A., Liu, Y., Chang, R.N., Hsu, C. and Wang, S. (2022) A Cloud-Edge Collaboration Framework for Cognitive Service. IEEE Transactions on Cloud Computing, 10, 1489-1499. [Google Scholar] [CrossRef]
[14]	Chen, J. and Ran, X. (2019) Deep Learning with Edge Computing: A Review. Proceedings of the IEEE, 107, 1655-1674. [Google Scholar] [CrossRef]
[15]	Ali, M., Anjum, A., Yaseen, M.U., Zamani, A.R., Balouek-Thomert, D., Rana, O., et al. (2018) Edge Enhanced Deep Learning System for Large-Scale Video Stream Analytics. 2018 IEEE 2nd International Conference on Fog and Edge Computing (ICFEC), Washington DC, 1-3 May 2018, 1-10. [Google Scholar] [CrossRef]
[16]	Ongati, F. and Muchemi, D.E. (2019) Big Data Intelligence Using Distributed Deep Neural Networks. arXiv: 1909.02873.
[17]	Yang, S., Zhang, Z., Zhao, C., Song, X., Guo, S. and Li, H. (2022) CNNPC: End-Edge-Cloud Collaborative CNN Inference with Joint Model Partition and Compression. IEEE Transactions on Parallel and Distributed Systems, 33, 4039-4056. [Google Scholar] [CrossRef]
[18]	Teerapittayanon, S., McDanel, B. and Kung, H.T. (2016) BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks. 2016 23rd International Conference on Pattern Recognition (ICPR), Cancun, 4-8 December 2016, 2464-2469. [Google Scholar] [CrossRef]
[19]	Teerapittayanon, S., McDanel, B. and Kung, H.T. (2017) Distributed Deep Neural Networks over the Cloud, the Edge and End Devices. 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS), Atlanta, 5-8 June 2017, 328-339. [Google Scholar] [CrossRef]
[20]	Krizhevsky, A. and Hinton, G. (2009) Learning Multiple Layers of Features from Tiny Images. Handbook of Systemic Autoimmune Diseases, 1. https://api.semanticscholar.org/CorpusID:18268744

为你推荐

友情链接