导航

ACTA AERONAUTICAET ASTRONAUTICA SINICA ›› 2020, Vol. 41 ›› Issue (12): 324152-324152.doi: 10.7527/S1000-6893.2020.24152

• Electronics and Electrical Engineering and Control • Previous Articles     Next Articles

Cooperative attack-defense game of multiple UAVs with asymmetric maneuverability

CHEN Can1,2, MO Li1,2, ZHENG Duo1,2, CHENG Ziheng1,2, LIN Defu1,2   

  1. 1. School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China;
    2. Beijing Key Laboratory of UAV Autonomous Control, Beijing Institute of Technology, Beijing 100081, China
  • Received:2020-04-29 Revised:2020-05-22 Published:2020-06-24
  • Supported by:
    National Natural Science Foundation of China (61903350); Key Program of National Natural Science Foundation of China (U1613225)

Abstract: The attack-defense game is an important combat scenario of future military Unmanned Aerial Vehicles (UAVs). This paper studies an attack-defense game between groups of UAVs with different maneuverability, establishing a multi-UAV cooperative attack and defense evolution model. Based on the multi-agent reinforcement learning theory, the autonomous decision-making method of multi-UAV cooperative attack-defense game is studied, and a centralized critic and distributed actor algorithm structure is proposed based on the actor-critic algorithm, guaranteeing the convergence of the algorithm and improving the efficiency of decision-making. The critic module of UAVs uses the global information to evaluate the decision-making quality during training, while the actor module only needs to rely on the local perception information to make autonomous decisions during execution, hence improving the effectiveness of the multi-UAV attack-defense game. The simulation results show that the proposed multi-UAV reinforcement learning method has a strong self-evolution property, endowing the UAV certain intelligence, that is, the stable autonomous learning ability. Through continuous training, the UAVs can autonomously learn cooperative attack or defense policies to improve the effectiveness of decision-making.

Key words: multi-UAV coordination, attack-defense games, reinforcement learning, centralized critic, distributed actors

CLC Number: