MAS中基于多奖惩标准的Q学习算法研究

计算机科学 ›› 2012, Vol. 39 ›› Issue (Z6): 235-237.

MAS中基于多奖惩标准的Q学习算法研究

乔林，罗杰

(南京邮电大学自动化学院南京 210046)

出版日期:2018-11-16 发布日期:2018-11-16

Research on任learning Algorithm Based

Online:2018-11-16 Published:2018-11-16

摘要/Abstract

摘要： 传统的Q学习算法是基于单奖惩标准的。基于单奖惩标准的Q学习算法往往不能适应multi-agent system

关键词: Q学习算法，多奖惩标准，MAS，三维围捕

Abstract: Traditional C}learning algorithm is based on a single standard of reward, when the environments and the state is changed, the single standard of reward may not be able to adapt to new environments and state in multi agent system(MAS) , instead, it may restrict the learning efficiency. hhis paper proposed a method of multi agent "lcarning algorithm with multi-standard of reward. It adapt well to the changing environment and the state, complete the task in stages, different stages use different standards, so it can quickly complete the stage goal. In this paper, the simulation platform is pursuit problem in threcdimensional world. We increased the difficulty of rounding up and the complexity of the environment and state. Simulation results show that "lcarning algorithm based on multi-standard of reward can flexibly adapt to different environments and state,and efficiently complete learning tasks.

Key words: "learning algorithm, Multi-standard of reward, MAS, Pursuit problem in threcdimensional world

乔林，罗杰. MAS中基于多奖惩标准的Q学习算法研究[J]. 计算机科学, 2012, 39(Z6): 235-237. https://doi.org/

参考文献

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed