计算机科学 ›› 2025, Vol. 52 ›› Issue (4): 85-93.doi: 10.11896/jsjkx.241000097
李洲诚, 张毅, 孙晋
LI Zhoucheng, ZHANG Yi, SUN Jin
摘要: 边缘智能作为一种新型的智能计算范式,能够有效提升智能推理任务在嵌入式边缘设备中的响应速度。而信息年龄(AoI)作为衡量数据时效性的重要指标,对于边缘智能应用的计算资源开销和实时响应至关重要。针对多出口深度神经网络(DNN)的资源配置优化问题,考虑出口退出概率造成的AoI随机不确定性,引入系统AoI的概率约束,基于随机优化理论对出口设置进行决策,以最小化多出口DNN的资源开销。文中提出了一种基于布谷鸟搜索的元启发式算法对所构建的具有概率约束的随机优化问题进行求解,基于各出口的退出概率预测系统AoI的统计分布,根据给定的AoI阈值计算相应的资源消耗量并将其作为布谷鸟个体的适应度值,迭代更新布谷鸟种群并搜索得到最小计算资源开销的出口设置方案。针对多种DNN模型的实验结果表明,与确定性的优化方法相比,随机优化方法能够获得更佳的出口设置决策,在满足AoI概率约束的前提下显著降低了DNN的计算开销。
中图分类号:
[1]ZHOU Z,CHEN X,LI E,et al.Edge intelligence:Paving the last mile of artificial intelligence with edge computing[J].Proceedings of the IEEE,2019,107(8):1738-1762. [2]DENG S,ZHAO H,FANG W,et al.Edge intelligence:The confluence of edge computing and artificial intelligence[J].IEEE Internet of Things Journal,2020,7(8):7457-7469. [3]WANG X,HAN Y,WANG C,et al.In-edge AI:Intelligentizing mobile edge computing,caching and communication by federated learning[J].IEEE Network,2019,33(5):156-165. [4]SHUVO M M H,ISLAM S K,CHENG J,et al.Efficient acceleration of deep learning inference on resource-constrained edge devices:A review[J].Proceedings of the IEEE,2022,111(1):42-91. [5]CUI W,ZHAO H,CHEN Q,et al.DVABatch:Diversity-aware multi-entry multi-exit batching for efficient processing of DNN services on GPUs[C]//USENIX Annual Technical Conference.USENIX,2022:183-198. [6]DONG F,WANG H,SHEN D,et al.Multi-exit DNN inference acceleration based on multi-dimensional optimization for edge intelligence[J].IEEE Transactions on Mobile Computing,2022,22(9):5389-5405. [7]ZHANG S,CUI W,CHEN Q,et al.PAME:Precision-awaremulti-exit DNN serving for reducing latencies of batched inferences[C]//ACM International Conference on Supercomputing.ACM,2022:1-12. [8]KAUL S,YATES R,GRUTESER M.Real-time status:Howoften should one update?[C]//International Conference on Computer Communications.IEEE,2012:2731-2735. [9]XU C,XU Q,WANG J,et al.AoI-centric task scheduling for autonomous driving systems[C]//International Conference on Computer Communications.IEEE,2022:1019-1028. [10]SORKHOH I,ASSI C,EBRAHIMI D,et al.Optimizing infor-mation freshness for MEC-enabled cooperative autonomous driving[J].IEEE Transactions on Intelligent Transportation Systems,2021,23(8):13127-13140. [11]MA M,WANG Z,GUO S,et al.Cloud-edge framework for AoI-efficient data processing in multi-UAV-assisted sensor networks[J].IEEE Internet of Things Journal,2024,11(14):25251-25267. [12]KADOTA I,SINHA A,MODIANO E.Scheduling algorithmsfor optimizing age of information in wireless networks with throughput constraints[J].IEEE/ACM Transactions on Networking,2019,27(4):1359-1372. [13]ZHANG J,XIN W,LV D,et al.Multi-exit DNN inference acceleration for intelligent terminal with heterogeneous processors[J].Sustainable Computing:Informatics and Systems,2023,40:100906. [14]KAYA Y,HONG S,DUMITRAS T.Shallow-deep networks:Understanding and mitigating network overthinking[C]//International Conference on Machine Learning.ACM,2019:3301-3310. [15]LASKARIDIS S,VENIERIS S I,ALMEIDA M,et al.SPINN:Synergistic progressive inference of neural networks over device and cloud[C]//International Conference on Mobile Computing and Networking.ACM,2020:1-15. [16]JU W,BAO W,YUAN D,et al.Learning early exit for deepneural network inference on mobile devices through multi-armed bandits[C]//International Symposium on Cluster,Cloud and Internet Computing.IEEE/ACM,2021:11-20. [17]JU W,BAO W,GE L,et al.Dynamic early exit scheduling for deep neural network inference through contextual bandits[C]//International Conference on Information & Knowledge Management.ACM,2021:823-832. [18]WANG X,NING Z,GUO S,et al.Minimizing the age-of-critical-information:An imitation learning-based scheduling approach under partial observations[J].IEEE Transactions on Mobile Computing,2021,21(9):3225-3238. [19]LI C,LIU Q,LI S,et al.Scheduling with age of information guarantee[J].IEEE/ACM Transactions on Networking,2022,30(5):2046-2059. [20]LI C,LI S,LIU Q,et al.Eywa:A general approach for scheduler design in AoI optimization [C]//International Conference on Computer Communications.2023:1-9. [21]LIN L,JU L,XUE C J,et al.Work or sleep:Freshness-aware energy scheduling for wireless powered communication networks with interference consideration[C]//Design Automation Conference.ACM/IEEE,2023:1-6. [22]YANG X S,DEB S.Cuckoo search via Lévy flights[C]//World Congress on Nature & Biologically Inspired Computing.IEEE,2009:210-214. [23]VISWANATHAN G M,AFANASYEV V,BULDYREV S V,et al.Lévy flights in random searches[J].Physica A:Statistical Mechanics and its Applications,2000,282(1/2):1-12. [24]MANTEGNA R N.Fast,accurate algorithm for numerical simulation of Lévy stable stochastic processes[J].Physical Review E,1994,49(5):4677. [25]GREFENSTETTE J J.Optimization of control parameters forgenetic algorithms[J].IEEE Transactions on Systems Man & Cybernetics,1986,16(1):122-128. [26]MO Y B,MA Y Z,ZHENG Q Y.Optimal choice of parameters for firefly algorithm[C]//International Conference on Digital Manufacturing & Automation.2013:887-892. [27]WANG D F,MENG L.Performance analysis and parameter selection of PSO algorithms[J].ACTA AUTOMATICA SINICA,2016,42(10):1552-1561. [28]STAHLEL,WOLD S.Analysis of variance(ANOVA)[J].Chemometrics and Intelligent Laboratory Systems,1989,6(4):259-272. |
|