导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (4): 328723-328723.doi: 10.7527/S1000-6893.2023.28723

• Electronics and Electrical Engineering and Control • Previous Articles    

Air combat intelligent decision-making method based on self-play and deep reinforcement learning

Shengzhe SHAN1,2, Weiwei ZHANG1()   

  1. 1.School of Aeronautics,Northwestern Polytechnical University,Xi’an  710072,China
    2.93995 Unit of the Chinese People’s Liberation Army,Xi’an  710306,China
  • Received:2023-03-21 Revised:2023-06-12 Accepted:2023-08-29 Online:2023-09-01 Published:2023-09-01
  • Contact: Weiwei ZHANG E-mail:aeroelastic@nwpu.edu.cn
  • Supported by:
    Science and Technology Foundation of National Defense Key Laboratory(6142219190302)

Abstract:

Air combat is an important element in the three-dimensional nature of war, and intelligent air combat has become a hotspot and focus of research in the military field both domestically and internationally. Deep reinforcement learning is an important technological approach to achieving air combat intelligence. To address the challenge of constructing high-level opponents in single agent training method, a self-play based air combat agent training method is proposed, and a visualization research platform is built to develop a decision-making agent for close-range air combat. The field knowledge of pilots is embedded in the design process of the agent’s observation, action, and reward, training the agent to convergence. Simulation experiments show that the air combat tactics of agent gradually improves by self-play training, achieving a win rate of over 70% against the decision making by single agent training and the emerging of the strategies similar to human “single/double loop” tactics.

Key words: air combat, artificial intelligence, deep reinforcement learning, self-play, agent

CLC Number: