基于情节经验回放的深度确定性策略梯度方法
张建行, 刘全
Deep Deterministic Policy Gradient with Episode Experience Replay
ZHANG Jian-hang, LIU Quan
计算机科学 . 2021, (10): 37 -43 .  DOI: 10.11896/jsjkx.200900208