导航

ACTA AERONAUTICAET ASTRONAUTICA SINICA ›› 2022, Vol. 43 ›› Issue (9): 325960-325960.doi: 10.7527/S1000-6893.2021.25960

• Electronics and Electrical Engineering and Control • Previous Articles     Next Articles

Generating new quality flight corridor for reentry aircraft based on reinforcement learning

HUI Junpeng, WANG Ren, YU Qidong   

  1. Research and Development Center, China Academy of Launch Vehicle Technology, Beijing 100076, China
  • Received:2021-06-11 Revised:2021-07-05 Online:2022-09-15 Published:2021-08-03

Abstract: The breakthrough of artificial intelligence provides a new technical approach for the research on aircraft reentry guidance. In both of the reference trajectory tracking guidance and predictor-corrector guidance, flight corridor parameters need to be designed based on manual experience in advance. In this paper, we propose to break through the constraint of "conical" flight path envelope, which is common in traditional guidance methods, by taking the natural advantage of reinforcement learning in intelligent decision making. Under the premise of satisfying dynamic equations and hard conditions such as heating rate, load factor and dynamic pressure, a large number of "trial-and-error" interactions can be taken between the aircraft and environment. Effective reward is set by referring to the human's idea of adjusting learning strategies based on feedback. Proximal Policy Optimization (PPO) algorithm in reinforcement learning is employed to train the bank angle guidance model, so as to generate bank angle instruction online based on real-time state information. The "new quality" flight corridor is explored, which is completely different from the traditional guidance method. Monte Carlo simulation analysis verifies that the intelligent guidance technology based on reinforcement learning can fully exploit the advantage of wide range flight of aircraft and further expand the flight profile of aircraft.

Key words: intelligent guidance, new quality flight corridor, reinforcement learning, PPO algorithm, artifical intelligence

CLC Number: