导航

ACTA AERONAUTICAET ASTRONAUTICA SINICA ›› 2020, Vol. 41 ›› Issue (10): 324040-324040.doi: 10.7527/S1000-6893.2020.24040

• Electronics and Electrical Engineering and Control • Previous Articles     Next Articles

Strategy solution of non-cooperative target pursuit-evasion game based on branching deep reinforcement learning

LIU Bingyan1,2, YE Xiongbing1, GAO Yong2, WANG Xinbo2, NI Lei3   

  1. 1. Academy of Military Sciences, Beijing 100091, China;
    2. 32032 Troops, Beijing 100094, China;
    3. Space Engineering University, Beijing 101416, China
  • Received:2020-03-31 Revised:2020-10-25 Published:2020-11-04

Abstract: To solve the space rendezvous problem between spacecraft and non-cooperative targets and alleviate application limitations of deep reinforcement learning in continuous space, this paper proposes a pursuit-evasion game algorithm based on branching deep reinforcement learning to obtain the space rendezvous strategy. The differential game is used to solve the optimal control problem of space intersection for non-cooperative targets, which is described as a pursuit-evasion game problem under the action of continuous thrust. To avoid the dimension disaster of the traditional deep reinforcement learning in dealing with continuous space, this paper constructs a fuzzy inference model to represent the continuous space, and proposes a branching deep reinforcement learning architecture with multiple parallel neural networks and a shared decision module. The combination of optimal control and game theory is realized, effectively overcoming the difficulty in solving the highly nonlinear differential game model by the classical optimal control theory, and further improving the training ability of deep reinforcement learning on discrete behaviors. Finally, an example is given to verify the effectiveness of the algorithm.

Key words: non-cooperative targets, space rendezvous, pursuit-evasion problem of spacecraft, continuous space, differential game, deep reinforcement learning, branching architectures

CLC Number: