导航

ACTA AERONAUTICAET ASTRONAUTICA SINICA ›› 2021, Vol. 42 ›› Issue (4): 523810-523810.doi: 10.7527/S1000-6893.2020.23810

• Fluid Mechanics and Flight Mechanics • Previous Articles     Next Articles

Reinforcement learning method for supercritical airfoil aerodynamic design

LI Runze, ZHANG Yufei, CHEN Haixin   

  1. School of Aerospace Engineering, Tsinghua University, Beijing 100084, China
  • Received:2020-01-08 Revised:2020-02-01 Published:2020-02-21
  • Supported by:
    National Natural Science Foundation of China (11872230, 91852108); Innovation Program of Tsinghua University (2015Z22003)

Abstract: Reinforcement learning as a machine learning method for learning policies learns in a way similar to human learning process, interacting with the environment and learning how to achieve more rewards. The elements and algorithms of reinforcement learning are defined and adjusted in this paper for the supercritical airfoil aerodynamic design process. The results of imitation learning are then studied, and the policies from the imitation learning are adopted in reinforcement learning. The influence of different pretraining processes is studied, and the final policies tested in other similar environments. The results show that pretraining can improve reinforcement learning efficiency and policy robustness. The final policies obtained in this study can also have satisfactory performance in other similar environments.

Key words: reinforcement learning, incremental modification, Proximal Policy Optimization (PPO), pretraining, imitation learning, application transferability

CLC Number: