Computer Science ›› 2017, Vol. 44 ›› Issue (1): 53-59.doi: 10.11896/j.issn.1002-137X.2017.01.010

Previous Articles     Next Articles

Optimized Negotiation Model Based on Reinforcement Learning of Medium Agent

ZHANG Jing-min and DONG Hong-bin   

  • Online:2018-11-13 Published:2018-11-13

Abstract: This paper proposed reinforcement learning bilateral optimized negotiation model based on reinforcement learning.The medium agent was introduced.It uses different parameters in the reinforcement learning strategy to produce proposals,and selects the best parameters to negotiate.The purpose is to further improve the performance of negotiation,and then the article presented the learning ability of adaptive based on medium agent.The simulation results show the effectiveness of the proposed method of negotiation and that it can improve the performance of negotiation.

Key words: Multi-agent system,Reinforcement learning,Adaptive learning,Medium agent

[1] BAARLAG T,HENDRIKX M J C,HINDRIKS K V,et al.Learning about the opponent in automated bilateral negotiation:a comprehensive survey of opponent modeling techniques[J].Autonomous Agents and Multi-Agent Systems,2015,30(5):849-898.
[2] ILANY L,GAL Y.Algorithm selection in bilateral negotiation[J].Autonomous Agents and Multi-Agent Systems,2016,30(4):697-723.
[3] ZHANG X S,KLEIN M,MARSAMAESTRE I.Scalable Complex Contract Negotiation with Structured Search and Agenda Management[C]∥AAAI Conference on Artificial Intelligence.2014.
[4] ZHANG Hua-xiang,HUANG Shang-teng.Agent NegotiationModel Based on Reinforcement learning[J].Computer Enginee-ring,2004,30(10):137-139.(in Chinese) 张化祥,黄上腾.基于增强学习的代理谈判模型[J].计算机工程,2004,30(10):137-139.
[5] SUN T H,CHEN F,ZHU Q S.Reinforcement learning negotiation strategy based on Bayesian classification[J].Chinese Journal of Computer Science,2011,38(9):227-229.(in Chinese) 孙天昊,陈飞,朱庆生.基于贝叶斯分类的增强学习协商策略[J].计算机科学,2011,38(9):227-229.
[6] ZHANG Lin-lan,SONG Hai-gang,CHEN Xue-guang,et al.Asimultaneous multi-issue negotiation through autonomous agents[J].European Journal of Operational Research,2010,210(1):95-105.
[7] SUN Tian-hao,DENG Jun-kun,CAO Feng,et al.Reinforcement Learning Negotiation Strategy based on Opponent Classification[C]∥The International Conference on Computer Science and Service System (CSSS 2011).Nanjing,China,2011:3987-3989.
[8] SUI Xin,CAI Guo-yong,SHI Lei.Multi-agent negotiation strategy and algorithm based on Q-Learning[J].Chinese Journal of computer engineering,2010,36(17):198-200.(in Chinese) 隋新,蔡国永,史磊.基于Q-强化学习的多Agent协商策略及算法[J].计算机工程,2010,36(17):198-200.
[9] CHEN Li-hong,DONG Hong-bin,HAN Qi-long,et al.Bilateral Multi-issue Parallel Negotiation Model Based on Reinforcement Learning[C]∥The 14th International Conference on Intelligent Data Engineering and Automated Learning (IDEAL 2013,LNCS 8206).2013.
[10] DIAMAH A,MOHAMMADIAN M,BALACHANDRAN B.Fuzzy utility and inference system for bilateral negotiation[C]∥2012 International Conference on Uncertainty Reasoning and Knowledge Engineering.2012:115-118.
[11] CHEN Li-hong,DONG Hong-bin,ZHOU Yang.A reinforce-ment learning optimized negotiation method based on mediator agent[J].Expert Systems with Applications,2014(41):7630-7640.
[12] 普尔.人工智能:计算Agent基础[M].董红斌,等译,北京:机械工业出版社,2015.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LEI Li-hui and WANG Jing. Parallelization of LTL Model Checking Based on Possibility Measure[J]. Computer Science, 2018, 45(4): 71 -75, 88 .
[2] XIA Qing-xun and ZHUANG Yi. Remote Attestation Mechanism Based on Locality Principle[J]. Computer Science, 2018, 45(4): 148 -151, 162 .
[3] LI Bai-shen, LI Ling-zhi, SUN Yong and ZHU Yan-qin. Intranet Defense Algorithm Based on Pseudo Boosting Decision Tree[J]. Computer Science, 2018, 45(4): 157 -162 .
[4] WANG Huan, ZHANG Yun-feng and ZHANG Yan. Rapid Decision Method for Repairing Sequence Based on CFDs[J]. Computer Science, 2018, 45(3): 311 -316 .
[5] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[6] ZHANG Jia-nan and XIAO Ming-yu. Approximation Algorithm for Weighted Mixed Domination Problem[J]. Computer Science, 2018, 45(4): 83 -88 .
[7] WU Jian-hui, HUANG Zhong-xiang, LI Wu, WU Jian-hui, PENG Xin and ZHANG Sheng. Robustness Optimization of Sequence Decision in Urban Road Construction[J]. Computer Science, 2018, 45(4): 89 -93 .
[8] LIU Qin. Study on Data Quality Based on Constraint in Computer Forensics[J]. Computer Science, 2018, 45(4): 169 -172 .
[9] ZHONG Fei and YANG Bin. License Plate Detection Based on Principal Component Analysis Network[J]. Computer Science, 2018, 45(3): 268 -273 .
[10] SHI Wen-jun, WU Ji-gang and LUO Yu-chun. Fast and Efficient Scheduling Algorithms for Mobile Cloud Offloading[J]. Computer Science, 2018, 45(4): 94 -99, 116 .