论文

考虑通信和梯度时延的联盟博弈分布式对偶平均算法及在编队控制中的应用

  • 刘加勋 ,
  • 陈明飞 ,
  • 徐晓鹏 ,
  • 刘帅 ,
  • 王东
展开
  • 大连理工大学 控制科学与工程学院 工业装备智能控制与优化教育部重点实验室,大连 116024
.E-mail: dwang@dlut.edu.cn

收稿日期: 2024-09-30

  修回日期: 2024-10-18

  录用日期: 2024-11-04

  网络出版日期: 2024-11-25

基金资助

国家自然科学基金(61973050);国家自然科学基金(62173061);辽宁省科技合作项目(2023JH2/101700362);辽宁省科技合作项目(2023JH2/101300200)

Distributed dual average algorithm with communication and gradient delays for coalition games and its application in formation control

  • Jiaxun LIU ,
  • Mingfei CHEN ,
  • Xiaopeng XU ,
  • Shuai LIU ,
  • Dong WANG
Expand
  • Key Laboratory of Intelligent Control and Optimization for Industrial Equipment of Ministry of Education,School of Control Science and Engineering,Dalian University of Technology,Dalian 116024,China
E-mail: dwang@dlut.edu.cn

Received date: 2024-09-30

  Revised date: 2024-10-18

  Accepted date: 2024-11-04

  Online published: 2024-11-25

Supported by

National Natural Science Foundation of China(61973050);Liaoning Province Science and Technology Cooperation Programs(2023JH2/101700362)

摘要

针对通信时延与梯度时延共存下的联盟博弈,提出基于对偶平均技术和时延梯度的分布式对偶平均算法来求解纳什均衡。采用增广图方法表征通信时延以及利用布雷格曼散度度量时延梯度与当前梯度之间的误差,理论分析表明,提出的分布式对偶平均算法以次线性收敛率收敛至纳什均衡。同时,研究结果阐明了通信时延与梯度时延对算法收敛误差的影响。最后,将所提出的分布式对偶平均算法应用到无人机集群的编队控制中验证算法的有效性。

本文引用格式

刘加勋 , 陈明飞 , 徐晓鹏 , 刘帅 , 王东 . 考虑通信和梯度时延的联盟博弈分布式对偶平均算法及在编队控制中的应用[J]. 航空学报, 2025 , 46(11) : 531322 -531322 . DOI: 10.7527/S1000-6893.2024.31322

Abstract

To address coalition games with communication and gradient delays, this paper proposes a distributed algorithm based on dual averaging and delayed gradient to seek the Nash equilibrium. With the help of augmented graphs and Bregman divergence, it is demonstrated that the proposed algorithm converges to the Nash equilibrium at a sub-linear rate, and the effect of communication and gradient delays on the convergence error is also clarified. Simulations in formation of unmanned aerial vehicle swarms verify the effectiveness of the proposed algorithm.

参考文献

[1] 杨加秀, 李新凯, 张宏立, 等. 切换拓扑下异构集群的强化学习时变编队控制[J]. 航空学报202445(10): 329166.
  YANG J X, LI X K, ZHANG H L, et al. Time-varying formation control for heterogeneous clusters with switching topologies via reinforcement learning[J]. Acta Aeronautica et Astronautica Sinica202445(10): 329166 (in Chinese).
[2] 郭洪振, 陈谋, 戴永东, 等. 分布式自适应事件触发四旋翼无人机编队控制[J]. 航空学报202344(S2): 491-500.
  GUO H Z, CHEN M, DAI Y D, et al. Distributed adaptive event-triggered formation control for QUAVs?[J]. Acta Aeronautica et Astronautica Sinica202344(S2): 729917 (in Chinese).
[3] ZIMMERMANN J, TATARENKO T, WILLERT V, et al. Solving leaderless multi-cluster games over directed graphs[J]. European Journal of Control202162: 14-21.
[4] TAN S L, FANG Z H, WANG Y N, et al. A timestamp-based inertial best-response dynamics for distributed Nash equilibrium seeking in weakly acyclic games?[J]. IEEE Transactions on Neural Networks and Learning Systems202435(1): 1330-1340.
[5] LI Z K, JIAO J J, CHEN X. Distributed optimal control with recovered robustness for uncertain network systems: A complementary design approach?[J]. IEEE Transactions on Automatic Control202469(4): 2484-2491.
[6] XU H, LU K H, WANG T B, et al. Online distributed seeking for first-order Nash equilibria of nonconvex noncooperative games with multiple clusters?[J]. IEEE Transactions on Circuits and Systems II: Express Briefs202370(2): 621-625.
[7] YE M J, HU G Q, LEWIS F L. Nash equilibrium seeking for N-coalition noncooperative games?[J]. Automatica201895: 266-272.
[8] NIAN X H, NIU F X, YANG Z. Distributed Nash equilibrium seeking for multicluster game under switching communication topologies?[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems202252(7): 4105-4116.
[9] ZHOU J L, LV Y Z, WEN G H, et al. Distributed Nash equilibrium seeking in consistency-constrained multicoalition games?[J]. IEEE Transactions on Cybernetics202353(6): 3675-3687.
[10] LIU F, DONG X W, YU J L, et al. Distributed Nash equilibrium seeking of N-coalition noncooperative games with application to UAV swarms[J]. IEEE Transactions on Network Science and Engineering20229(4): 2392-2405.
[11] LI Y M, ZHU Y N, LI T, et al. Distributed Nash equilibrium seeking in a multi-coalition noncooperative game under incomplete decision information[J]. IEEE Transactions on Circuits and Systems Ⅱ: Express Briefs202269(8): 3400-3404.
[12] GHARESIFARD B, CORTéS J. Distributed convergence to Nash equilibria in two-network zero-sum games[J]. Automatica201349(6): 1683-1692.
[13] MENG M, LI X X. On the linear convergence of distributed Nash equilibrium seeking for multi-cluster games under partial-decision information[J]. Automatica2023151: 110919.
[14] CAO Y C, YU W W, REN W, et al. An overview of recent progress in the study of distributed multi-agent coordination?[J]. IEEE Transactions on Industrial Informatics20139(1): 427-438.
[15] HADJICOSTIS C N, CHARALAMBOUS T. Average consensus in the presence of delays in directed graph topologies?[J]. IEEE Transactions on Automatic Control201459(3): 763-768.
[16] TSIANOS K I, RABBAT M G. Distributed dual averaging for convex optimization under communication delays[C]∥2012 American Control Conference (ACC). Piscataway: IEEE Press, 2012: 1067-1072.
[17] NEDI? A, OZDAGLAR A. Convergence rate for consensus with delays?[J]. Journal of Global Optimization201047(3): 437-456.
[18] WANG X F, SUN X M, YE M J, et al. Robust distributed Nash equilibrium seeking for games under attacks and communication delays[J]. IEEE Transactions on Automatic Control202267(9): 4892-4899.
[19] LI J Y, CHEN G, DONG Z Y, et al. Distributed mirror descent method for multi-agent optimization with delay[J]. Neurocomputing2016177: 643-650.
[20] DUCHI J C, AGARWAL A, WAINWRIGHT M J. Dual averaging for distributed optimization: Convergence analysis and network scaling[J]. IEEE Transactions on Automatic Control201257(3): 592-606.
[21] BREGMAN L M. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming?[J]. USSR Computational Mathematics and Mathematical Physics19677(3): 200-217.
[22] NESTEROV Y. Primal-dual subgradient methods for convex problems[J]. Mathematical Programming2009120(1): 221-259.
[23] WANG H W, LIAO X F, HUANG T W, et al. Cooperative distributed optimization in multiagent networks with delays[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems201545(2): 363-369.
[24] NEDI? A, OLSHEVSKY A. Distributed optimization over time-varying directed graphs?[J]. IEEE Transactions on Automatic Control201560(3): 601-615.
文章导航

/