基于Meta平衡的多Agent Q学习算法研究

Computer Science ›› 2012, Vol. 39 ›› Issue (Z6): 261-264.

Research on Multi-agent Q Learning Algorithm Based on Meta Equilibrium

Online:2018-11-16 Published:2018-11-16

Abstract

Abstract: Multi-agent reinforcement learning algorithms aim at cooperation strategy, while NashQ is frectuently menboned as a pivotal algorithm to the study of non-cooperative strategics. In multi agent systems, Nash equilibrium can not ensure the solutions obtained Pareto optimal, besides, the algorithm with high computation complexity. MetaQ algorithm was proposed in this paper. It is different from NashQ that MetaQ finds out the optimal solution by the pretreatment of its own behavior and the prediction of the others behavior. In the end,a game-climate cooperation strategy was used in this paper, and the results shows that MetaQ algorithm, with impressive performance, is fit for non-cooperative problem.

Key words: Reinforcement learning, Meta ectuilibrium, NashQ, Multi-agent system