多代理模糊收益及策略学习

计算机科学 ›› 2005, Vol. 32 ›› Issue (8): 128-130.

多代理模糊收益及策略学习

张化祥黄上腾

山东师范大学信息管理学院,济南250014 上海交通大学计算机科学与工程系,上海200030

出版日期:2018-11-17 发布日期:2018-11-17

Zhang HuaXiang;Huang ShangTeng

Online:2018-11-17 Published:2018-11-17

摘要/Abstract

摘要： 本文研究了基于模糊知识的多代理决策问题。通过建立代理决策目标的模糊知识，我们给出了基于模糊收益的多代理决策模型，并研究了基于梯度的代理策略学习算法。

关键词: 模糊集合对策梯度学习多代理收益模糊知识决策问题决策目标决策模型学习算法

Abstract: The multi-agent decision based on fuzzy knowledge is discussed. The agent＇s fuzzy reward is proposed under the fuzzy knowledge of different decision goals, and a gradient learning algorithm is described to learn the agent＇s action policy under fuzzy rewar

Key words: Fuzzy set, Game, Gradient learning

张化祥黄上腾. 多代理模糊收益及策略学习[J]. 计算机科学, 2005, 32(8): 128-130. https://doi.org/

Zhang HuaXiang;Huang ShangTeng. [J]. Computer Science, 2005, 32(8): 128-130. https://doi.org/

参考文献

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed