Computer Science ›› 2022, Vol. 49 ›› Issue (11): 266-276.doi: 10.11896/jsjkx.211000067

• Computer Network • Previous Articles     Next Articles

Workload Characteristics Based Performance Optimization for Edge Intelligence

HU Zhao-xia1, HU Hai-zhou1, JIANG Cong-feng1and WAN Jian2   

  1. 1 School of Computer Science and Technology,Hangzhou Dianzi University,Hangzhou 310018,China
    2 School of Information and Electronic Engineering,Zhejiang University of Science and Technology,Hangzhou 310023,China
  • Received:2021-10-10 Revised:2022-06-22 Online:2022-11-15 Published:2022-11-03
  • About author:HU Zhao-xia,born in 1997,postgra-duate.Her main research interests include data center scheduling,and edge computing.
    JIANG Cong-feng,born in 1980,Ph.D,professor,is a member of China Computer Federation.His main research interests include edge computing,system optimization,performance evaluation and distributed system benchmarking.
  • Supported by:
    General Program of National Natural Science Foundation of China(61972118,61972358).

Abstract: Edge intelligence refers to a form of service that uses artificial intelligence algorithms to provide data analysis capabilities for network edge devices.However,the edge computing environment is more complex and changeable than cloud computing.There are many problems in the process of building edge intelligence,such as the lack of quantitative evaluation standards,heterogeneous computing platforms,complex network topologies,and changing user needs.Among them,the more prominent is the contradiction between the high resource demand of the algorithm model and the low resource reserve of edge devices.Machine lear-ning is the main workload of edge intelligence.It requires a lot of computing resources.However,the computing resources of edge devices are limited,and the supply and demand between the two do not match.The deployment and optimization of edge intelligent load has become a problem.Therefore,in response to the problem of edge intelligent load performance optimization,this paper proposes cloud-edge collaborative inference(CECI) based on load characteristics,which is optimized for different machine learning loads in terms of model selection,batch adaptive adjustment and cloud-side collaboration.In terms of model selection,a model adaptive selection strategy based on target weights is used to comprehensively weigh the effects of multiple performance optimization targets under multiple constraints.In the aspect of batch adaptive adjustment,a batch adaptive adjustment algorithm based on overhead feedback is proposed,so that the model can achieve better performance at runtime.In terms of cloud-side collaboration,a cloud-side collaboration strategy is designed by combining network status and user delay requirements to achieve the effect of dynamic utilization of cloud computing resources.Experimental results show that compared with cloud intelligence,the edge intelligence based on load characteristics proposed in this paper can reduce program running time by 50.79%,reduce system energy consumption by 42.46%,and improve model accuracy by 4.52%.

Key words: Edge intelligence, Cloud-edge collaborative, Edge computing, Workload recognition, Model selection

CLC Number: 

  • TP391
[1]LU Y,CHEN Y,LI T,et al.Embedded FPGA convolutionalneuralnetwork construction method for edge computing[J].Journal of Computer Research and Development,2018,55(3):551-562.
[2]DWORK C,MCSHERRY F,NISSIM K,et al.Calibrating noise to sensitivity in private data analysis[C]//Theory of cryptography conference.2006:265-284.
[3]FREDRIKSON M,JHA S,RISTENPART T.Model inversionattacks that exploit confidence information and basic countermeasures[C]// Proceedings of the 22nd ACM SIGSAC Confe-rence on Computer and Communications Security.2015:1322-1333.
[4]DU M,WANG K,XIA Z,et al.Differential privacy preserving of training model in wireless big data with edge computing[J].IEEE transactions on big data,2018,6(2):283-295.
[5]LIU T T,YANG C Y,SUO S Q,et al.Edge Intelligence inWireless Communication[J].Signal Processing,2020,36(11):1789-1803.
[6]REN J,GAO L,YU J L,et al.Energy-efficient deep learning task scheduling strategy for edge devices[J].Chinese Journal of Computers,2020,43(3):440-452.
[7]BHATTACHARYA S,LANE N D.Sparsification and separation of deep learning layers for constrained resource inference on wearables[C]//Proceedings of the 14th ACM Conference on Embedded Network Sensor Systems CD-ROM.2016:176-189.
[8]VALERIO L,PASSARELLA A,CONTI M.A communication efficient distributed learning framework for smart environments [J].Pervasive and Mobile Computing,2017,41:46-68.
[9]HSIEH K,HARLAP A,VIJAYKUMAR N,et al.Gaia:Geo-distributed machine learning approaching speeds[C]//14th Symposium on Networked Systems Design and Implementation(NSD).2017:629-647.
[10]OGDEN S S,GUO T.MODI:Mobile deep inference made efficient by edge computing[C]// Workshop on Hot Topics in Edge Computing(HotEdge 18).2018:30-36.
[11]HU C,BAO W,WANG D,et al.Dynamic adaptive DNN surgeryfor inference acceleration on the edge[C]//IEEE INFOCOM 2019-IEEE Conference on Computer Communications.2019:1423-1431.
[12]ZHAO Z,BARIJOUGH K M,GERSTLAUER A.DeepThings:Distributed adaptive deep learning inference on resource-constrained IoT edge clusters[J].IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems,2018,37(11):2348-2359.
[13]SHENG T,FENG C,ZHUO S,et al.A quantization-friendlyseparable convolution for mobilenets[C]//2018 1st Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications(EMC2).2018:14-18.
[14]SANTOS A G,DE SOUZA C O,ZANCHETTIN C,et al.Reducing squeezenet storage size with depthwise separable convolutions[C]//2018 International Joint Conference on Neural Networks(IJCNN).2018:1-6.
[15]CERUTTI G,PRASAD R,BRUTTI A,et al.Neural network distillation on IoT platforms for sound event detection[C]//Interspeech.2019:3609-3613.
[16]YAO S,ZHAO Y,ZHANG A,et al.DeepIoT:Compressing deep neural network structures for sensing systems with a compressor-critic framework[C]//Proceedings of the 15th ACM Confe-rence on Embedded Network Sensor Systems.2017:1-14.
[17]HAN S,KANG J,MAO H,et al.Ese:Efficient speech recognition engine with sparse lstm on fpga[C]//Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.2017:75-84.
[18]LIU S,LIN Y,ZHOU Z, et al.On-demand deep model compression for mobile devices:A usage-driven model selection framework[C]//Proceedings of the 16th Annual International Conference on Mobile Systems,Applications,and Services.2018:389-400.
[19]HARDY C,LE MERRER E,SERICOLA B.Distributed deep learning on edge-devices:Feasibility via adaptive compression[C]//2017 IEEE 16th International Symposium on Network Computing and Applications(NCA).2017:1-8.
[20]DEAN J,CORRADO G,MONGA R,et al.Large scale distributed deep networks[C]//Advances in Neural Information Processing Systems.2012:1223-1231.
[21]YU Z,HU J,MIN G,et al.Federated learning based proactive content caching in edge computing[C]// 2018 IEEE Global Communications Conference(GLOBECOM).2018:1-6.
[22]WANG S,TUOR T,SALONIDIS T,et al.When edge meets learning:Adaptive control for resource-constrained distributed machine learning[C]// IEEE INFOCOM 2018-IEEE Confe-rence on Computer Communications.2018:63-71.
[23]FANG B,ZENG X,ZHANG M.Nestdnn:Resource-aware multi-tenant on-device deep learning for continuous mobile vision[C]//Proceedings of the 24th Annual International Conference on Mobile Computing and Networking.2018:115-127.
[24]TAYLOR B,MARCO V S,WOLFF W,et al.Adaptive deeplearning model selection on embedded systems[J].ACM SIGPLAN Notices,2018,53(6):31-43.
[25]KANG Y,HAUSWALD J,GAO C,et al.Neurosurgeon:Colla-borative intelligence between the cloud and mobile edge[J].ACM SIGARCH Computer Architecture News,2017,45(1):615-629.
[26]MAO J,CHEN X,NIXON K W,et al.MODNN:Local distributed mobile computing system for deep neural network[C]// Design,Automation & Test in Europe Conference & Exhibition(DATE).2017:1396-1401.
[27]ImageNet website update[OL].2021-06-28.https://www.image-net.org/.
[1] LIU Xing-guang, ZHOU Li, LIU Yan, ZHANG Xiao-ying, TAN Xiang, WEI Ji-bo. Construction and Distribution Method of REM Based on Edge Intelligence [J]. Computer Science, 2022, 49(9): 236-241.
[2] SUN Hui-ting, FAN Yan-fang, MA Meng-xiao, CHEN Ruo-yu, CAI Ying. Dynamic Pricing-based Vehicle Collaborative Computation Offloading Scheme in VEC [J]. Computer Science, 2022, 49(9): 242-248.
[3] YU Bin, LI Xue-hua, PAN Chun-yu, LI Na. Edge-Cloud Collaborative Resource Allocation Algorithm Based on Deep Reinforcement Learning [J]. Computer Science, 2022, 49(7): 248-253.
[4] LI Meng-fei, MAO Ying-chi, TU Zi-jian, WANG Xuan, XU Shu-fang. Server-reliability Task Offloading Strategy Based on Deep Deterministic Policy Gradient [J]. Computer Science, 2022, 49(7): 271-279.
[5] YUAN Hao-nan, WANG Rui-jin, ZHENG Bo-wen, WU Bang-yan. Design and Implementation of Cross-chain Trusted EMR Sharing System Based on Fabric [J]. Computer Science, 2022, 49(6A): 490-495.
[6] FANG Tao, YANG Yang, CHEN Jia-xin. Optimization of Offloading Decisions in D2D-assisted MEC Networks [J]. Computer Science, 2022, 49(6A): 601-605.
[7] LIU Zhang-hui, ZHENG Hong-qiang, ZHANG Jian-shan, CHEN Zhe-yi. Computation Offloading and Deployment Optimization in Multi-UAV-Enabled Mobile Edge Computing Systems [J]. Computer Science, 2022, 49(6A): 619-627.
[8] XIE Wan-cheng, LI Bin, DAI Yue-yue. PPO Based Task Offloading Scheme in Aerial Reconfigurable Intelligent Surface-assisted Edge Computing [J]. Computer Science, 2022, 49(6): 3-11.
[9] ZHANG Hai-bo, ZHANG Yi-feng, LIU Kai-jian. Task Offloading,Migration and Caching Strategy in Internet of Vehicles Based on NOMA-MEC [J]. Computer Science, 2022, 49(2): 304-311.
[10] LIN Chao-wei, LIN Bing, CHEN Xing. Study on Scientific Workflow Scheduling Based on Fuzzy Theory Under Edge Environment [J]. Computer Science, 2022, 49(2): 312-320.
[11] YUAN Xin-wang, XIE Zhi-dong, TAN Xin. Survey of Resource Management Optimization of UAV Edge Computing [J]. Computer Science, 2022, 49(11): 234-241.
[12] LI Xiao-bo, CHEN Peng, SHUAI Bin, XIA Yun-ni, LI Jian-qi. Novel Predictive Approach to Trajectory-aware Online Edge Service Allocation in Edge Environment [J]. Computer Science, 2022, 49(11): 277-283.
[13] LIANG Jun-bin, ZHANG Hai-han, JIANG Chan, WANG Tian-shu. Research Progress of Task Offloading Based on Deep Reinforcement Learning in Mobile Edge Computing [J]. Computer Science, 2021, 48(7): 316-323.
[14] XUE Yan-fen, GAO Ji-mei, FAN Gui-sheng, YU Hui-qun, XU Ya-jie. Energy-aware Fault-tolerant Collaborative Task Execution Algorithm in Edge Computing [J]. Computer Science, 2021, 48(6A): 374-382.
[15] SONG Hai-ning, JIAO Jian, LIU Yong. Research on Mobile Edge Computing in Expressway [J]. Computer Science, 2021, 48(6A): 383-386.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!