导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (8): 329136-329136.doi: 10.7527/S1000-6893.2023.29136

• Electronics and Electrical Engineering and Control • Previous Articles     Next Articles

Spacecraft game decision making for threat avoidance of space targets based on machine learning

Honglin ZHANG1,2, Jianjun LUO1,2(), Weihua MA1,2   

  1. 1.School of Astronautics,Northwestern Polytechnical University,Xi’an  710072,China
    2.Science and Technology on Aerospace Flight Dynamics Laboratory,Xi’an  710072,China
  • Received:2023-06-06 Revised:2023-08-22 Accepted:2023-11-02 Online:2024-04-25 Published:2023-11-16
  • Contact: Jianjun LUO E-mail:jjluo@mail.nwpu.edu.cn;jjluo@nwpu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(12072269);Foundation of Science and Technology on Aerospace Flight Dynamics Laboratory(6142210210302)

Abstract:

An intelligent decision-making framework and a deep reinforcement learning-based autonomous decision-making method are proposed for the spacecraft decision-making in avoiding the threat of space targets. Taking into account the maneuvering characteristics of space targets and the gameplay of threat avoidance, an intelligent game decision-making framework for spacecraft threat avoidance is proposed based on the Observation-Orientation-Decision-Action (OODA) loop decision-making idea and machine learning techniques. Based on this framework and inference on the motion intentions of space targets, a deep reinforcement learning-based spacecraft maneuver decision-making algorithm and training environment are designed to enable spacecraft decision-making control with game response capability, which realizes the avoidance response to the typical motion intentions of space targets. Furthermore, the generalization of spacecraft autonomous maneuvering decision-making algorithm and its adaptability to possible uncertain maneuvers of space targets are improved by using the self-play learning technique. Finally, the effectiveness of our proposed method is verified through simulations.

Key words: spacecraft maneuver, intelligent decision-making, threat avoidance, OODA loop, deep reinforcement learning

CLC Number: