Computer Science ›› 2020, Vol. 47 ›› Issue (10): 256-262.doi: 10.11896/jsjkx.190800159

• Computer Network • Previous Articles     Next Articles

Inference Task Offloading Strategy Based on Differential Evolution

WANG Xuan, MAO Ying-chi, XIE Zai-peng, HUANG Qian   

  1. School of Computer and Information,Hohai University,Nanjing 211100,China
  • Received:2019-08-30 Revised:2019-11-21 Online:2020-10-15 Published:2020-10-16
  • About author:WANG Xuan,born in 1996,master candidate,is a student member of CCF.Her main research interests include edge computing and cloud computing.
    MAO Ying-chi,born in 1976,Ph.D,professor,is a senior member ofChina Computer Federation.Her main research interests include cloud computing and edge computing,mobile sensing systems and internet of things.
  • Supported by:
    National Key Technology Research and Development Program of the Ministry of Science and Technology of China (2018YFC0407105),Key Project of National Natural Science Foundation of China (61832005),Fundamental Research Funds for the Central Universities (2017B20914) and Key Technology Project of China Huaneng Group (HNKJ17-21)

Abstract: As an important technology of deep learning,convolutional Neural Network (CNN) has been widely used in intelligence applications.Due to the demand of CNN inference task for high computer memories and computation,most of the existing solutions are to offload tasks to the cloud for execution,which are hard to adapt to the time-delay sensitive mobile applications.To solve the above problem,this paper proposes a CNN inference task offloading strategy based on improved differential evolution algorithm,which can efficiently deploy computing tasks between cloud and edge devices using end-cloud collaboration mode.This strategy studies the task unloading scheme that minimizes the time delay under cost constraint.transforms the CNN inference process into a task graph and constructs it into a 0-1 integer programming problem,and finally uses the improved binary differential evolution algorithm to solve the problem so as to infer the optimal offloading policy.The experimental results show that,compared with mobile inference and cloud inference schemes,averagely,the proposed strategy can reduce the task response time by 33.60% and 6.06% respectively with cost constraints.

Key words: Collaborative inference, Computing offloading, Convolutional neural network, Differential evolution algorithm, Mobile cloud computing

CLC Number: 

  • TP393
[1]ZHOU F Y,JIN L P,DONG J.Review of Convolutional Neural Network[J].Chinese Journal of Computers,2017,40(6):1229-1251.
[2]PLASTIRAS G,TERZI M,KYRKOU C,et al.Edge Intelli-gence:Challenges and Opportunities of Near-Sensor Machine Learning Applications[C]//2018 IEEE 29th International Conference on Application-specific Systems,Architectures and Processors (ASAP).IEEE,2018:1-7.
[3]GUO T.Cloud-based or On-device:An Empirical Study of Mobile Deep Inference[C]//2018 IEEE International Conference on Cloud Engineering (IC2E).IEEE,2018:184-190.
[4]KANG Y,HAUSWALD J,GAO C,et al.Neurosurgeon:Collaborative intelligence between the cloud and mobile edge[J].ACM SIGplan Notices,2017,52(4):615-629.
[5]HUANG Y T,MA Y Q,FAN X Y,et al.When Deep Learning Meets Edge Computing[C]//2017 IEEE 25th International Conference on Network Protocols (ICNP).IEEE Computer Society,2017:1-2.
[6]LI E,ZHOU Z,CHEN X.Edge intelligence:On-demand deeplearning model co-inference with device-edge synergy[C]//Proceedings of the 2018 Workshop on Mobile Edge Communications.ACM,2018:31-36.
[7]ZHANG Q,ZHANG H L.Research progress of task offloading technologies in mobile cloud computing[J].Intelligent Computer and Applications,2016,6(6):1-4.
[8]GOYAL S,CARTER J.A lightweight secure cyber foraging infrastructure for resource-constrained devices[C]//Sixth IEEE Workshop on Mobile Computing Systems and Applications.IEEE,2004:186-195.
[9]HUANG D,WANG P,NIYATO D.A dynamic offloading algorithm for mobile computing[J].IEEE Transactions on Wireless Communications,2012,11(6):1991-1995.
[10]BARRAMEDA J,SAMAAN N.A novel application model and an offloading mechanism for efficient mobile computing[C]//2014 IEEE 10th International Conference on Wireless and Mobile Computing,Networking and Communications (WiMob).IEEE,2014:419-426.
[11]ESHRATIFAR A E,PEDRAM M.Energy and performance efficient computation offloading for deep neural networks in a mobile cloud computing environment[C]//Proceedings of the 2018 on Great Lakes Symposium on VLSI.ACM,2018:111-116.
[12]ZANNAT H,HOSSAIN M S.A hybrid framework using Markov decision process for mobile code offloading[C]//2016 19th International Conference on Computer and Information Technology (ICCIT).IEEE,2016:31-35.
[13]ENZAI N I M,TANG M.A heuristic algorithm for multi-site computation offloading in mobile cloud computing[J].Procedia Computer Science,2016,80:1232-1241.
[14]JEONG H J,JEONG I C,LEE H J,et al.Computation Offloading for Machine Learning Web Apps in the Edge Server Envi-
ronment[C]//2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS).IEEE Computer Society,2018:1492-1499.
[15]GUO K,YANG M,ZHANG Y,et al.An Efficient Dynamic Offloading Approach based on Optimization Technique for Mobile Edge Computing[C]//2018 6th IEEE International Conference on Mobile Cloud Computing,Services,and Engineering (MobileCloud).IEEE,2018:29-36.
[16]QIAN H,ANDREAEN D.Reducing mobile device energy consumption with computation offloading[C]//2015 IEEE/ACIS 16th International Conference on Software Engineering,Artificial Intelligence,Networking and Parallel/Distributed Computing (SNPD).IEEE,2015:1-8.
[17]ESHRATIFAR A E,ABRISHAMI M S,PEDRAM M.JointDNN:An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services[J/OL].https://arxiv.org/pdf/1801.08618.
[18]HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[19]SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:1-9.
[20]ZHANG W,WEN Y,WU D O.Energy-efficient scheduling policy for collaborative execution in mobile cloud computing[C]//2013 Proceedings Ieee Infocom.IEEE,2013:190-194.
[21]CHU C A,LIN W,XIAO H.An adaptive genetic algorithmbased on weighted hamming distance[J].Journal of South Normal University (Natural Science Edition),2015,47(6):121-127.
[22]CHEN H,FAN Y R,DENG S G.Adaptive differential evolution algorithm based on logistic model[J].Control and Decision,2011,26(7):1105-1108.
[23]WU L H,WANG Y N,YUAN X F,et al.Differential Evolution Algorithm with Adaptive Second Mutation[J].Control and Decision,2006,21(8):898-902.
[24]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenet classification with deep convolutional neural networks[J].Communications of the ACM,2017,60(6):84-90.
[1] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[2] CHEN Yong-quan, JIANG Ying. Analysis Method of APP User Behavior Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(8): 78-85.
[3] ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[4] DAI Zhao-xia, LI Jin-xin, ZHANG Xiang-dong, XU Xu, MEI Lin, ZHANG Liang. Super-resolution Reconstruction of MRI Based on DNGAN [J]. Computer Science, 2022, 49(7): 113-119.
[5] LIU Yue-hong, NIU Shao-hua, SHEN Xian-hao. Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(7): 127-131.
[6] XU Ming-ke, ZHANG Fan. Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition [J]. Computer Science, 2022, 49(7): 132-141.
[7] LI Dan-dan, WU Yu-xiang, ZHU Cong-cong, LI Zhong-kang. Improved Sparrow Search Algorithm Based on A Variety of Improved Strategies [J]. Computer Science, 2022, 49(6A): 217-222.
[8] LIU Bao-bao, YANG Jing-jing, TAO Lu, WANG He-ying. Study on Prediction of Educational Statistical Data Based on DE-LSTM Model [J]. Computer Science, 2022, 49(6A): 261-266.
[9] YANG Yue, FENG Tao, LIANG Hong, YANG Yang. Image Arbitrary Style Transfer via Criss-cross Attention [J]. Computer Science, 2022, 49(6A): 345-352.
[10] YANG Jian-nan, ZHANG Fan. Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure [J]. Computer Science, 2022, 49(6A): 353-357.
[11] ZHANG Jia-hao, LIU Feng, QI Jia-yin. Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer [J]. Computer Science, 2022, 49(6A): 370-377.
[12] WANG Jian-ming, CHEN Xiang-yu, YANG Zi-zhong, SHI Chen-yang, ZHANG Yu-hang, QIAN Zheng-kun. Influence of Different Data Augmentation Methods on Model Recognition Accuracy [J]. Computer Science, 2022, 49(6A): 418-423.
[13] SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[14] WU Zi-bin, YAN Qiao. Projected Gradient Descent Algorithm with Momentum [J]. Computer Science, 2022, 49(6A): 178-183.
[15] ZHAO Zheng-peng, LI Jun-gang, PU Yuan-yuan. Low-light Image Enhancement Based on Retinex Theory by Convolutional Neural Network [J]. Computer Science, 2022, 49(6): 199-209.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!