计算机科学 ›› 2021, Vol. 48 ›› Issue (8): 253-262.doi: 10.11896/jsjkx.200700032
张杰1, 岳韶华2, 王刚2, 刘家义1, 姚小强2
ZHANG Jie1, YUE Shao-hua2, WANG Gang2, LIU Jia-yi1, YAO Xiao-qiang2
摘要: 针对分布式环境下多智能体系统的交互模型存在效率低、局部冲突消解困难、缺少实际应用场景等问题,基于Stac-kelberg博弈设计了多主多从的交互模型,并将其应用于指挥控制流程中指控方与参与方之间的交互博弈问题。首先通过对Stackelberg博弈模型的优化与多属性决策,设计出多主多从Stackelberg博弈的多智能体系统,并利用半正定的二次型性能指标的最优化正则性,引入一个正则Riccati方程来对Stackelberg博弈下的闭环解问题进行求解;然后基于图论相关知识建立基于边拉普拉斯矩阵的多智能体系统模型以降低复杂问题的求解难度;最后经过数值推导仿真与实验分析,从多个角度验证了模型的高效性与强鲁棒性,证明了所提模型的真实性与高效性。
中图分类号:
[1]WANG F Y.CC 5.0:Intelligent Command and Control Systems in the Parallel Age[J].Journal of Command and Control,2015,1(1):107-120. [2]CHEN L,SHEN Y.Stochastic Stackelberg differential reinsu-rance games under time-inconsistent mean-variance framework[J].Insurance Mathematics and Economics,2019,88(4):409-444. [3]ZHANG M Y,WANG M Y,WANG X D,et al.Research onReal-Time Task Allocation of UAV Group Collaboration Based on Improved Contract Network [J].Aero Weaponry,2019,26(4):38-46. [4]LI X Q.Research on the Control of Hybrid H_2/H_∞ Based on Stackelberg Game Method [D].Jinan:Shandong University,2019. [5]GALARZA-JIMENEZ F,TELLEZ-CASTRO D,SOFRONY J,et al.Cooperative Output Regulation for Multi-Agent Systems with EDMD Leader Approximation[J].IFAC Papers OnLine,2019,52(20):91-96. [6]PIVOVARCHIKV.On multiplicity of eigenvalues in quantumgraph theory[J].Journal of Mathematical Analysis and Applications,2019,480(2):977-986. [7]FAN Y Z,WANG Y,BAO Y H,et al.Eigenvectors of Laplacian or signless Laplacian of hypergraphs associated with zero eigenvalue[J].Linear Algebra and Its Applications,2019,579(6):244-261. [8]OKANO T,NODA I.Adaptation Method of the ExplorationRatio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments[J].Journal of Advanced Computational Intelligence & Intelligent Informatics,2017,21(5):939-947. [9]WU L L,JING Z X,WU Q H,et al.Equilibrium Interaction Strategy of Integrated Energy System Based on Stackelberg Game Model[J].Automation of Electrics Power Systems,2018,42(4):142-150,207. [10]ZACKSENHOUSE M,NEMETS S,LEBEDEV M A,et al.Robust satisficing linear regression:Performance/robustness trade-off and consistency criterion[J].Mechanical Systems and Signal Processing,2008,23(6):1954-1964. [11]CRETTEZ B.On Hobbes's state of nature and game theory[J].Theory and Decision,2017,83(4):499-511. [12]WU Q,WANG F,ZHOU L G,et al.Method of Multiple Attri-bute Group Decision Making Based on 2-Dimension Interval Type-2 Fuzzy Aggregation Operators with Multi-granularity Linguistic Information[J].International Journal of Fuzzy Systems,2017,19(6):1880-1903. [13]SOLMEYER N,DIXON R,BALU R.Characterizing the Nashequilibria of a three-player Bayesian quantum game[J].Quantum Information Processing,2017,16(6):146. [14]GREINER D,PERIAUX J,EMPERADOR M J,et al.GameTheory Based Evolutionary Algorithms:A Review with Nash Applications in Structural Engineering Optimization Problems[J].Archives of Computational Methods in Engineering,2017,24(4):703-750. [15]HOLMBERG E,THORE C J,KLARBRING A.Game theoryapproach to robust topology optimization with uncertain loading[J].Structural and Multidisciplinary Optimization,2017,55(4):1383-1397. [16]LIU X K,GE Y Y,LI Y.Stackelberg games for model-free continuous-time stochastic systems based on adaptive dynamic programming[J].Applied Mathematics and Computation,2019,363:301-306. [17]CHUGH T,SINDHYA K,HAKANEN J,et al.Handling computationally expensive multiobjective optimization problems with evolutionary algorithms:A survey[J].Soft Computing,2019,23(9):3137-3166. [18]ZENG X L.Research on Aircraft Route Planning Method Based on Multi-Agent Co-Evolution [D].Harbin:Harbin Engineering University,2014. [19]YAGER R R.Multi-Criteria Decision Making with Interval Criteria Satisfactions Using the Golden Rule Representative Value[J].IEEE Transactions on Fuzzy Systems,2018,26(2):1023-1031. [20]CONWAY T J.More indefinite integrals from Riccati equations[J].Integral Transforms and Special Functions,2019,30(12):33-44. [21]AZAD M,BABICˇ J,MISTRY M.Effects of the weighting matrix on dynamic manipulability of robots[J].Autonomous Robots,2019,43(7):1867-1879. [22]ZHANG H,LI L,XU J,et al.Linear quadratic regulation and stabilization of discrete-time systems with delay and multiplicative noise[J].IEEE Transactions on Automatic Control,2015,60(10):2599-2613. [23]REN Y.Research on First-Order Multi-Agent System with Va-riable Parameter Quantizer[D].Harbin:Harbin Industrial University,2013. [24]KONG F Q,WANG D D,SHEN Q.Robust target tracking of l_1-l_2 norm combined constraint [J].Chinese Journal of Scienti-fic Instrument,2016,37(3):690-697. [25]MARCO A,MARTíNEZ J J,VIAÑA R.Least squares prob-lems involving generalized Kronecker products and application to bivariate polynomial regression[J].Numerical Algorithms,2019,82(1):21-39. [26]ZHANG H.Structural Characteristics of Strongly Connected K Quasi-Transitive Directed Graphs [D].Taiyuan:Shanxi University,2017. [27]LUO W,SONG C N,XU Q X.Perturbation estimation for the parallel sum of Hermitian positive semi-definite matrices[J].Linear and Multilinear Algebra,2019,67(10):1971-1984. [28]REN W,BEARD R W.Consensus seeking in multiagent systems under dynam-ically changing interaction topologies [J].Automatic Control,IEEE Transactionson,2005,50(5):655-661. [29]LI C Y,QU Z H,QI D L,et al.Distributed finite-time estimation of the bounds on algebraic connectivity for directed graphs[J].Automatica,2019,107:289-295. [30]STEGAGNO P,YUAN C Z.Distributed cooperative adaptive state estimation and system identification for multi-agent systems[J].Control Theory & Applications,IET,2019,13(6):815-822. [31]ZHOU B,XU X T,LIU J G,et al.Information interaction model for the mobile communication networks[J].Physica A:Statistical Mechanics and its Applications,2019,525:1170-1176. [32]MA Y M.Research on Feedback Mechanism of Online Learning Based on Metacognition[D].Chendu:Sichuan Normal University,2016. [33]ZHANG J,WANG G,SONG Y F,et al.Optimization of Air-Defense Resource Deployment Based on Adaptive SGD-Multiagent[J].Systems Engineering and Electronic Technology,2019,41(7):1536-1543. [34]WANG G,LI W M,HE J.Research on Resource Management of Distributed Air Defense Battlefield Based on Multi-Agent[J].Fire Control & Command Control,2003,65(2):32-34. [35]SU J H.The Existence And Uniqueness of Solutions to Stochastic Differential Equations with Non-Lipschitz Coefficients[D].Hefei:University of Science and Technology of China,2017. |
[1] | 鲁晨阳, 邓苏, 马武彬, 吴亚辉, 周浩浩. 基于分层抽样优化的面向异构客户端的联邦学习 Federated Learning Based on Stratified Sampling Optimization for Heterogeneous Clients 计算机科学, 2022, 49(9): 183-193. https://doi.org/10.11896/jsjkx.220500263 |
[2] | 傅丽玉, 陆歌皓, 吴义明, 罗娅玲. 区块链技术的研究及其发展综述 Overview of Research and Development of Blockchain Technology 计算机科学, 2022, 49(6A): 447-461. https://doi.org/10.11896/jsjkx.210600214 |
[3] | 杨亚红, 王海瑞. 基于Renyi熵和BiGRU算法实现SDN环境下的DDoS攻击检测方法 DDoS Attack Detection Method in SDN Environment Based on Renyi Entropy and BiGRU Algorithm 计算机科学, 2022, 49(6A): 555-561. https://doi.org/10.11896/jsjkx.210800095 |
[4] | 孙浩, 毛瀚宇, 张岩峰, 于戈, 徐石成, 何光宇. 区块链跨链技术发展及应用 Development and Application of Blockchain Cross-chain Technology 计算机科学, 2022, 49(5): 287-295. https://doi.org/10.11896/jsjkx.210800132 |
[5] | 冯了了, 丁滟, 刘坤林, 马科林, 常俊胜. 区块链BFT共识算法研究进展 Research Advance on BFT Consensus Algorithms 计算机科学, 2022, 49(4): 329-339. https://doi.org/10.11896/jsjkx.210700011 |
[6] | 谭双杰, 林宝军, 刘迎春, 赵帅. 基于机器学习的分布式星载RTs系统负载调度算法 Load Scheduling Algorithm for Distributed On-board RTs System Based on Machine Learning 计算机科学, 2022, 49(2): 336-341. https://doi.org/10.11896/jsjkx.201200126 |
[7] | 王如斌, 李瑞远, 何华均, 刘通, 李天瑞. 面向海量空间数据的分布式距离连接算法 Distributed Distance Join Algorithm for Massive Spatial Data 计算机科学, 2022, 49(1): 95-100. https://doi.org/10.11896/jsjkx.210100060 |
[8] | 钱甜甜, 张帆. 基于分布式边缘计算的情绪识别系统 Emotion Recognition System Based on Distributed Edge Computing 计算机科学, 2021, 48(6A): 638-643. https://doi.org/10.11896/jsjkx.201000010 |
[9] | 唐飞, 陈云龙, 冯卓. 基于区块链和代理重加密的电子处方共享方案 Electronic Prescription Sharing Scheme Based on Blockchain and Proxy Re-encryption 计算机科学, 2021, 48(6A): 498-503. https://doi.org/10.11896/jsjkx.201000143 |
[10] | 张航, 唐聃, 蔡红亮. 分布式存储系统中的预测式纠删码研究 Study on Predictive Erasure Codes in Distributed Storage System 计算机科学, 2021, 48(5): 130-139. https://doi.org/10.11896/jsjkx.200300124 |
[11] | 高枫越, 王琰, 朱铁兰. 有适应力的分布式状态估计方法 Resilient Distributed State Estimation Algorithm 计算机科学, 2021, 48(5): 308-312. https://doi.org/10.11896/jsjkx.200300117 |
[12] | 张晓, 张思蒙, 石佳, 董聪, 李战怀. Ceph分布式存储系统性能优化技术研究综述 Review on Performance Optimization of Ceph Distributed Storage System 计算机科学, 2021, 48(2): 1-12. https://doi.org/10.11896/jsjkx.201000149 |
[13] | 王锡龙, 李鑫, 秦小麟. 电力物联网下分布式状态感知的源网荷储协同调度 Collaborative Scheduling of Source-Grid-Load-Storage with Distributed State Awareness UnderPower Internet of Things 计算机科学, 2021, 48(2): 23-32. https://doi.org/10.11896/jsjkx.200900209 |
[14] | 苑晨宇, 谢在鹏, 朱晓瑞, 屈志昊, 徐媛媛. 一种基于分布式编码的卷积优化算法 Convolutional Optimization Algorithm Based on Distributed Coding 计算机科学, 2021, 48(2): 47-54. https://doi.org/10.11896/jsjkx.200800187 |
[15] | 丁诗铭, 王天荆, 沈航, 白光伟. 基于能量分类器的抗SSDF攻击协作频谱感知算法 Energy Classifier Based Cooperative Spectrum Sensing Algorithm for Anti-SSDF Attack 计算机科学, 2021, 48(2): 282-288. https://doi.org/10.11896/jsjkx.191100124 |
|