计算机科学 ›› 2020, Vol. 47 ›› Issue (11): 316-321.doi: 10.11896/jsjkx.200400075
张昊, 管昕洁, 白光伟
ZHANG Hao, GUAN Xin-jie, BAI Guang-wei
摘要: 无线传感器网络在环境感知、目标跟踪等方面占据了重要地位。为了能够及时地为传感器节点补充能量,提出了一种基于强化学习的低功耗、高能效的移动路径充电算法。无线传感器网络采用移动充电车对传感器节点进行充电,将Q-Learning算法与epsilon-greedy算法相结合,以最短路径依次完成所有传感器节点的充电。现有的相关研究通常忽略了传感器节点自身所能承受电量的最大值,容易导致传感器节点因充电过程中电量超出最大值而暂停工作,因此限制了移动充电车的充电时间。结果表明,所提移动充电策略的效用更高,与传统的Q-Learning算法和贪心算法相比,训练周期大幅度下降且实现了能量利用率最大化。
中图分类号:
[1] LÓPEZ RIQUELME J A,SOTO F,SUARDÍAZ J,et al.Wireless Sensor Networks for precision horticulture in Southern Spain[J].Computers & Electronics in Agriculture,2009,68(1):25-35. [2] GIUSEPPE A,MARCO C,MARIO D F,et al.Energy conservation in wireless sensor networks:A survey[J].Ad Hoc Networks,2009,7(3):537-568. [3] FAFOUTIS X,VUCKOVIC D,DI M A,DRAGON N,et al.Energy-Harvesting wireless sensor networks[C]//Proc.of the 9th European Conf.on Wireless Sensor Networks(EWSN).Trento:University of Trento,2012:84-85. [4] KURS A,KARALIS A,MOFFATT R,et al.Wireless power transfer via strongly coupled magnetic resonances[J].Science,2007,317(5834):83-86. [5] KURS A,MOFFATT R,SOLJACIC M.Simultaneous midrange power transfer to multiple devices[J].Applied Physics Letters,2010,96(4):34. [6] DAI H P,CHEN G H,XU L J,et al.Effective Algorithm for Placement of Directional Wireless Chargers[J].Ruan Jian Xue Bao,2015,26(7):1711-1729. [7] SHI Y,XIE L HOU Y T,et al.On Renewable Sensor Networks with Wireless Energy Transfer[C]//INFOCOM,2011 Procee-dings IEEE.IEEE,2012:1350-1358. [8] XIE L,SHI Y,HOU Y T,et al.Multi-node wireless energy charging in sensor networks[J].IEEE/ACM Transactions on Networking,2015,23(2):437-450. [9] HE L,KONG L,GU Y,et al.Evaluating the On-Demand Mobile Charging in Wireless Sensor Networks[J].IEEE Transactions on Mobile Computing,2015,14(9):1861-1875. [10] JIANG F C,HE S B,CHENG P,et al.On optimal scheduling in wireless rechargeable sensor networks for stochastic event capture[C]//IEEE International Conference on Mobile Adhoc & Sensor Systems.IEEE,2011. [11] SU Z Z.Research and Improvement on Routing Protocols ofWireless Sensor Networks Based on Clustering[D].Changchun:Jilin University,2016. [12] XIE L,SHI Y,HOU Y T,et al.Multi-Node Wireless Energy Charging in Sensor Networks[J].IEEE/ACM Transactions on Networking,2015,23(2):437-450. [13] HE S B,CHEN J M,JIANG F C,et al.Energy provisioning in wireless rechargeable sensor networks[J].IEEE Transactions on Mobile Computing,2013,12(10):1931-1942. [14] YU L C,LV H F,HE L,et al.Optimization of Charging Path for Wireless Rechargeable Sensor Networks[J].Journal of Shanghai DianJi University,2018(4):25-30. [15] FU L K,CHENG P,GU Y,et al.Minimizing charging delay in wireless rechargeable sensor networks[C]//2013 Proceedings IEEE INFOCOM.IEEE,2013:2922-2930. [16] VARTIAINEN E M,INO Y,SHIMANO R,et al.Numericalphase correction method for terahertz time-domain reflection spectroscopy[J].Journal of Applied Physics,2004,96(8):4171-4176. [17] GU S,LILLICRAP T,SUTSKEVER I,et al.Continuous deep q-learning with model-based acceleration[C]//International Conference on Machine Learning.2016:2829-2838. [18] SONG Y C,ZHANG Y Y,MENG H D.Research Based on Euclid Distance with Weights of Clustering Method[J].Computer Engineering and Applications,2007,43(4):179-180,226. |
[1] | 刘兴光, 周力, 刘琰, 张晓瀛, 谭翔, 魏急波. 基于边缘智能的频谱地图构建与分发方法 Construction and Distribution Method of REM Based on Edge Intelligence 计算机科学, 2022, 49(9): 236-241. https://doi.org/10.11896/jsjkx.220400148 |
[2] | 黄丽, 朱焱, 李春平. 基于异构网络表征学习的作者学术行为预测 Author’s Academic Behavior Prediction Based on Heterogeneous Network Representation Learning 计算机科学, 2022, 49(9): 76-82. https://doi.org/10.11896/jsjkx.210900078 |
[3] | 吕晓锋, 赵书良, 高恒达, 武永亮, 张宝奇. 基于异质信息网的短文本特征扩充方法 Short Texts Feautre Enrichment Method Based on Heterogeneous Information Network 计算机科学, 2022, 49(9): 92-100. https://doi.org/10.11896/jsjkx.210700241 |
[4] | 熊丽琴, 曹雷, 赖俊, 陈希亮. 基于值分解的多智能体深度强化学习综述 Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization 计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112 |
[5] | 史殿习, 赵琛然, 张耀文, 杨绍武, 张拥军. 基于多智能体强化学习的端到端合作的自适应奖励方法 Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning 计算机科学, 2022, 49(8): 247-256. https://doi.org/10.11896/jsjkx.210700100 |
[6] | 袁唯淋, 罗俊仁, 陆丽娜, 陈佳星, 张万鹏, 陈璟. 智能博弈对抗方法:博弈论与强化学习综合视角对比分析 Methods in Adversarial Intelligent Game:A Holistic Comparative Analysis from Perspective of Game Theory and Reinforcement Learning 计算机科学, 2022, 49(8): 191-204. https://doi.org/10.11896/jsjkx.220200174 |
[7] | 王兵, 吴洪亮, 牛新征. 基于改进势场法的机器人路径规划 Robot Path Planning Based on Improved Potential Field Method 计算机科学, 2022, 49(7): 196-203. https://doi.org/10.11896/jsjkx.210500020 |
[8] | 于滨, 李学华, 潘春雨, 李娜. 基于深度强化学习的边云协同资源分配算法 Edge-Cloud Collaborative Resource Allocation Algorithm Based on Deep Reinforcement Learning 计算机科学, 2022, 49(7): 248-253. https://doi.org/10.11896/jsjkx.210400219 |
[9] | 李梦菲, 毛莺池, 屠子健, 王瑄, 徐淑芳. 基于深度确定性策略梯度的服务器可靠性任务卸载策略 Server-reliability Task Offloading Strategy Based on Deep Deterministic Policy Gradient 计算机科学, 2022, 49(7): 271-279. https://doi.org/10.11896/jsjkx.210600040 |
[10] | 高文龙, 周天阳, 朱俊虎, 赵子恒. 基于双向蚁群算法的网络攻击路径发现方法 Network Attack Path Discovery Method Based on Bidirectional Ant Colony Algorithm 计算机科学, 2022, 49(6A): 516-522. https://doi.org/10.11896/jsjkx.210500072 |
[11] | 谭任深, 徐龙博, 周冰, 荆朝霞, 黄向生. 海上风电场通用运维路径规划模型优化及仿真 Optimization and Simulation of General Operation and Maintenance Path Planning Model for Offshore Wind Farms 计算机科学, 2022, 49(6A): 795-801. https://doi.org/10.11896/jsjkx.210400300 |
[12] | 杨浩雄, 高晶, 邵恩露. 考虑一单多品的外卖订单配送时间的带时间窗的车辆路径问题 Vehicle Routing Problem with Time Window of Takeaway Food ConsideringOne-order-multi-product Order Delivery 计算机科学, 2022, 49(6A): 191-198. https://doi.org/10.11896/jsjkx.210400005 |
[13] | 王永, 崔源. 基于四边形最优圈内最短路径的旅行商问题割边方法 Cutting Edge Method for Traveling Salesman Problem Based on the Shortest Paths in Optimal Cycles of Quadrilaterals 计算机科学, 2022, 49(6A): 199-205. https://doi.org/10.11896/jsjkx.210400065 |
[14] | 陈钧吾, 余华山. 面向无尺度图的Δ-stepping算法改进策略 Strategies for Improving Δ-stepping Algorithm on Scale-free Graphs 计算机科学, 2022, 49(6A): 594-600. https://doi.org/10.11896/jsjkx.210400062 |
[15] | 谢万城, 李斌, 代玥玥. 空中智能反射面辅助边缘计算中基于PPO的任务卸载方案 PPO Based Task Offloading Scheme in Aerial Reconfigurable Intelligent Surface-assisted Edge Computing 计算机科学, 2022, 49(6): 3-11. https://doi.org/10.11896/jsjkx.220100249 |
|