导航

ACTA AERONAUTICAET ASTRONAUTICA SINICA ›› 2023, Vol. 44 ›› Issue (11): 327416-327416.doi: 10.7527/S1000-6893.2022.27416

• Electronics and Electrical Engineering and Control • Previous Articles     Next Articles

Intelligent guidance for no⁃fly zone avoidance based on reinforcement learning

Junpeng HUI1(), Ren WANG2, Jifeng GUO1   

  1. 1.School of Astronautics,Harbin Institute of Technology,Harbin 150006,China
    2.China Academy of Aerospace Science and Innovation,Beijing 100176,China
  • Received:2022-05-11 Revised:2022-12-08 Accepted:2023-01-17 Online:2023-06-15 Published:2023-02-06
  • Contact: Junpeng HUI E-mail:hjpbuaa@126.com
  • Supported by:
    National Level project

Abstract:

The rapid development of Artificial Intelligence (AI) provides a new technical approach for the research of aircraft guidance. Aiming at the problem of reentry aircraft for avoiding uncertain no-fly zone, we propose the research frame of “predictor-corrector guidance-pre-training of bank angle guidance model based on supervised learning-further training of bank angle guidance model based on reinforcement learning”. On the one hand, lots of flying trajectory for avoiding no-fly zone are produced by predictor-corrector guidance. The bank angle guidance model is pre-trained with supervised learning algorithm. On the other hand, the bank angle guidance model is further trained by the use of Proximal Policy Optimization (PPO) algorithm. A large number of exploration interactions are taken between aircraft and environment with uncertain no-fly-zone. At the same time, the powerful lateral maneuverability of high lift-drag ratio reentry aircraft is exploited with effective reward. Such method will get rid of restriction of bank angle solution space produced by predictor-corrector guidance, which is expected to produce better strategy for avoiding no-fly zone. By comparing with traditional predictor-corrector guidance and intelligent guidance based on supervised learning, it is verified that the no-fly zone intelligent guidance technology based on reinforcement learning can fully exploit the wide area flight advantages of aircraft, so as to meet the adaptability requirements of future intelligent decision system under uncertain scenarios.

Key words: intelligent guidance, no-fly zone avoidance, reinforcement learning, PPO algorithm, supervised learning

CLC Number: