基于改进增强学习算法的双边多协议协商策略

计算机科学 ›› 2014, Vol. 41 ›› Issue (1): 290-292.

基于改进增强学习算法的双边多协议协商策略

张科,罗军,邓俊昆

重庆大学计算机学院重庆400044;重庆大学计算机学院重庆400044;重庆大学计算机学院重庆400044

出版日期:2018-11-14 发布日期:2018-11-14
基金资助:
本文受中央高校基本科研业务费科研专项项目(CDJZR10180014)资助

Bilateral Multi-protocol Negotiation Strategies Based on Reinforcement Learning

ZHANG Ke,LUO Jun and DENG Jun-kun

Online:2018-11-14 Published:2018-11-14

摘要/Abstract

摘要： 针对传统增强学习算法存在妥协过快导致自身效用降低的缺点,通过设计改进增强学习算法的双边多议题协商模型,引入期望还原率,还原Agent的期望,从而提高协商解的质量。通过实验分析了期望还原率不同取值对协商的影响,并对传统增强学习协商策略、基于时间的协商策略和改进增强学习协商策略的协商效果做了对比。实验表明,在协商次数允许的范围之内,基于期望还原率的改进增强学习算法在双边多议题协商中能够提升双方的效用。

关键词: 协商策略,增强学习,期望还原率,双边多议题

Abstract: Traditional reinforcement learning negotiation strategy has the shortcoming of compromising too fast and reduces the utility of agent．Aiming at this problem,improved reinforcement learning bilateral multi-issue negotiation strategy which imports expectation restoration rate to restore the expectation of agent can improve the quality of the negotiation result．This paper analysed the influence of different expectation reduction rate on negotiation and contrasted traditional reinforcement learning negotiation strategies,time-based negotiation strategy and the proposed enhance learning negotiation strategy consultation．The result shows that negotiation strategy can get higher bilateral utility within allowing negotiation turns．

Key words: Negotiation strategy,Reinforcement learning,Expectation restoration rate,Bilateral multi-issue negotiation

张科,罗军,邓俊昆. 基于改进增强学习算法的双边多协议协商策略[J]. 计算机科学, 2014, 41(1): 290-292. https://doi.org/

ZHANG Ke,LUO Jun and DENG Jun-kun. Bilateral Multi-protocol Negotiation Strategies Based on Reinforcement Learning[J]. Computer Science, 2014, 41(1): 290-292. https://doi.org/

参考文献

[1] Park S,Yang Sung-Bong．An efficient multilateral negotiationsystem for pervasive computing environments [J]．Engineering Applications of Artificial Intelligence,2008,21:633-643
[2] Zeng D,Sycara K．Bayesian learning in negotiation[J].Int’l J．Human-Computer Studies,1998,8:125-141
[3] 李剑,牛少彰．一种基于混合遗传算法的双边多议题协商[J]．北京邮电大学学报,2009,2(2):1-4
[4] 程昱,高济,古华茂,等．基于对手态度学习的协商决策模型[J].浙江大学学报:工学版,2008,2(10):1676-1
[5] Mitchell T M．机器学习[M]．曾华军,等译.北京:机械工业出版社,2009
[6] 张化祥,黄上腾．基于增强学习的代理谈判模型[J]．计算机工程,2004,30(10):137-139
[7] Chen Pei-you,Li Yi-jun,Li Xing．The research on E-business-oriented Automatic Negotiation System based on faithful and dynamic Q-study[C]∥Chinese Control and Decision Conference．Shenyang,2008
[8] 孙天昊,邓俊昆,陈飞,等．基于增强学习协商策略的研究及优化[J]．计算机工程与应用,2012,48(23):44-46
[9] 艾解清．双边多议题自动协商研究[D]．杭州:浙江大学,2011:46-47
[10] 罗志伟.协同设计系统中图形协同与网络协商的实现[J].重庆理工大学学报:自然科学版,2012,6(7):27-33

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed