导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (17): 530053-530053.doi: 10.7527/S1000-6893.2024.30053

• Articles • Previous Articles    

Hierarchical decision algorithm for air combat with hybrid action based on deep reinforcement learning

Zuolong LI1, Jihong ZHU1(), Minchi KUANG1, Jie ZHANG2, Jie REN2   

  1. 1.Department of Precision Instrument,Tsinghua University,Beijing 100084,China
    2.AVIC Chengdu Flight Design and Research Institute,Chengdu 610091,China
  • Received:2024-01-02 Revised:2024-01-11 Accepted:2024-04-22 Online:2024-04-26 Published:2024-04-25
  • Contact: Jihong ZHU E-mail:jhzhu@tsinghua.edu.cn

Abstract:

Intelligent air combat is a hot research topic among countries with strong military power in the world. To solve the maneuver decision problem of air combat Beyond Visual Range (BVR), we propose the hierarchical decision algorithm based on deep reinforcement learning. In the decision algorithm, we use the maneuver set appropriate to the BVR air combat to control the trajectory and the attitude of the aircraft. To expand the action space of the model and increase its decision-making ability, we hierarchize the action space and model it as the multi-discrete one. To solve the problem of sparse reward in air combat, we design a set of reward function taking into consideration the factors including the position advantage, weapon launching, and weapon threat, which can guide the agent to converge to the optimal policy. We also build a complete digital-twin simulation environment for air combat and an expert system. The decision algorithm is trained in the simulation environment, and is evaluated by fighting with the expert system. The experiment results indicate that the decision algorithm proposed has the ability to make autonomous and flexible decisions in BVR air combat based on current situations, and has some advantages against the expert system.

Key words: air combat beyond visual range, intelligent decision, deep reinforcement learning, proximal policy optimization, maneuver, hierarchical decision

CLC Number: