导航

Acta Aeronautica et Astronautica Sinica ›› 2026, Vol. 47 ›› Issue (8): 332753.doi: 10.7527/S1000-6893.2025.32753

• Electronics and Electrical Engineering and Control • Previous Articles    

Trajectory prediction method of incoming missiles based on improved inverse reinforcement learning in aircraft active defense mode

Hao ZHANG, Jianing LIU, Zhi XU(), Yuanxin YANG   

  1. School of Astronautics,Northwestern Polytechnical University,Xi’an 710129,China
  • Received:2025-09-03 Revised:2025-09-18 Accepted:2025-11-08 Online:2025-12-16 Published:2025-11-20
  • Contact: Zhi XU E-mail:xuzhi@nwpu.edu.cn

Abstract:

With the advancement of aircraft fire control systems and situational awareness capabilities, defense strategies against air-to-air missiles are evolving from passive methods such as jamming and deception to active defense modes involving interceptor missiles countering incoming threats. However, the low average velocity, limited defense space, and insufficient overload ratio of interceptor missiles make it difficult for traditional proportional navigation guidance to meet the precise collision requirements, posing new challenges for trajectory prediction of incoming missiles. To achieve the high-probability prediction of guidance information for interceptor missiles in a three-body active defense scenario involving the carrier aircraft, incoming missile, and interceptor missile, this paper provides an incoming missile trajectory prediction method based on inverse reinforcement learning. First, a mathematical model is constructed to extract the temporal maneuvering characteristics of incoming missiles under the principle of maximum causal entropy, and a behavioral strategy library for the guidance law of incoming missiles is established within the inverse reinforcement learning framework. Then, a quadratic-based calculation formula for the inverse reinforcement learning strategy function is derived, reducing the computational complexity of the strategy function in high-dimensional states. Finally, the weighting coefficients of the strategy function are computed online using rolling window measurement data, enabling real-time optimization and adaptive weighted trajectory prediction distribution to form a real-time prediction model for incoming missile trajectories. Simulation results demonstrate that in the three-body active defense context, the proposed trajectory prediction network algorithm exhibits strong generalization capability in “out-of-model-set/sample-set” scenarios, good dynamic adaptability to complex target maneuvers, and high prediction accuracy. The method provides a high-probability trajectory prediction model suitable for guidance in defense, and thus has notable theoretical significance and engineering application value.

Key words: three-body active defense scenario, guided missiles, active defense, inverse reinforcement learning, trajectory prediction

CLC Number: