导航

Acta Aeronautica et Astronautica Sinica ›› 2023, Vol. 44 ›› Issue (S2): 729400-729400.doi: 10.7527/S1000-6893.2023.29400

• Near Space Technology • Previous Articles     Next Articles

Cooperative game guidance method for hypersonic vehicles based on reinforcement learning

Weilin NI1, Yonghai WANG2, Cong XU2, Fenghua CHI2, Haizhao LIANG1()   

  1. 1.School of Aeronautics and Astronautics,Sun Yat-sen University,Shenzhen  518107,China
    2.Science and Technology on Space Physics Laboratory,Beijing  100076,China
  • Received:2023-08-02 Revised:2023-08-03 Accepted:2023-09-04 Online:2023-09-15 Published:2023-09-13
  • Contact: Haizhao LIANG E-mail:lianghch5@mail.sysu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62003375)

Abstract:

The intelligent cooperative game guidance method for hypersonic vehicle active defense attack and defense confrontation in multiple interception scenarios is studied. Aiming at the game problem in which a hypersonic vehicle and an active defense vehicle cooperate against multiple interceptor attacks, we propose an intelligent cooperative game guidance method for a hypersonic vehicle based on a double-delay deep deterministic policy gradient algorithm. It can achieve a high success rate game for multi-interceptors in the case of insufficient maneuverability and response speed of hypersonic aircraft and active defense aircraft. By constructing a class of heuristic continuous reward functions and designing an adaptive progressive curriculum learning method, we propose a fast and stable convergence training method to solve the sparse reward problem in the training process of deep reinforcement learning, and realize the stable and fast convergence of intelligent game algorithms. Finally, the effectiveness of the proposed method is verified by numerical simulation. The simulation results show that the proposed theoretical method can improve the training convergence efficiency and stability, and has a higher game success rate than the traditional game guidance method.

Key words: game theory, reward shaping, curriculum learning, reinforcement learning, hypersonic vehicles

CLC Number: