导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (10): 329166-329166.doi: 10.7527/S1000-6893.2023.29166

• Electronics and Electrical Engineering and Control • Previous Articles     Next Articles

Time-varying formation control for heterogeneous clusters with switching topologies via reinforcement learning

Jiaxiu YANG, Xinkai LI(), Hongli ZHANG, Hao WANG   

  1. School of Electrical Engineering,Xinjiang University,Urumqi 830017,China
  • Received:2023-06-13 Revised:2023-06-26 Accepted:2023-08-23 Online:2024-05-25 Published:2023-09-01
  • Contact: Xinkai LI E-mail:lxk@xju.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62263030);Youth Project of Natural Science Foundation of Xinjiang Uygur Autonomous Region(2022D01C86)

Abstract:

To address the problem of time-varying formation control of high-order heterogeneous unmanned cluster systems with uncertain system model dynamics and switching communication topology, an optimal distributed hierarchical formation control method is proposed based on integral reinforcement learning. The time-varying formation control problem for heterogeneous cluster systems is transformed into a stabilization problem by using time-varying formation switching vectors to construct an augmented system of multi-quadrotor Unmanned Aircraft System (UAS) with multi-unmanned vehicle systems. The value function with discount factor is introduced to transform the stabilization problem of the heterogeneous clustered system into an optimal control problem. Only the feedback gain parameters are replaced and averaged to obtain the optimal time-varying formation switching control protocol for the whole heterogeneous cluster without destroying the consistent distributed formation control protocol. The feedback gain of the distributed time-varying formation switching controller is updated online in real time using a single-network “actor network-critic network” structure, combined with the integral reinforcement learning algorithm and the distributed control method. The effectiveness and superiority of the proposed control scheme are verified by theoretical proof and simulation experiments.

Key words: integral reinforcement learning, heterogeneous cluster, time-varying formation control, distributed control, switching topology, optimal control

CLC Number: