Research on Trajectory Prediction Method of Incoming Missiles Based on Im-proved Inverse Reinforcement Learning in Aircraft Active Defense Mode

  • ZHANG Hao ,
  • LIU Jia-Ning ,
  • XU Zhi ,
  • YANG Yuan-Xin
Expand

Received date: 2025-09-03

  Revised date: 2025-11-17

  Online published: 2025-11-20

Abstract

With the advancement of aircraft fire control systems and situational awareness capabilities, defense strategies against air-to-air missiles are evolving from passive methods such as jamming and deception to active defense modes involving interceptor missiles countering incoming threats. However, the low average velocity, limited defense space, and insufficient overload ratio of interceptor missiles hinder their ability to meet the precise collision requirements of traditional proportional navigation guidance, posing new challenges for trajectory prediction of incoming missiles. This paper addresses the high-probability prediction of guidance information for interceptor missiles in a three-body active defense scenario involving the carrier aircraft, incoming missile, and interceptor missile. A trajectory prediction method for incoming missiles based on inverse reinforcement learning is proposed. First, a mathematical model is constructed to extract the temporal maneuvering characteristics of incoming missiles under maximum causal entropy, and a be-havioral strategy library for the guidance law of incoming missiles is established within the inverse reinforcement learning framework. Then, a quadratic-based calculation formula for the inverse reinforcement learning strategy func-tion is derived, reducing the computational complexity of the strategy function in high-dimensional states. Finally, the weighting coefficients of the strategy function are computed online using rolling window measurement data, enabling real-time optimization and adaptive weighted trajectory prediction distribution to form a real-time prediction model for incoming missile trajectories. Simulation results demonstrate that the proposed trajectory prediction network algorithm exhibits strong generalization capability in "out-of-model-set/sample-set" scenarios within the three-body active de-fense context. It shows good dynamic adaptability to complex target maneuvers and high prediction accuracy, provid-ing a high-probability trajectory prediction model usable for guidance in defending against incoming missiles. This work holds theoretical significance and offers valuable insights for engineering applications.

Cite this article

ZHANG Hao , LIU Jia-Ning , XU Zhi , YANG Yuan-Xin . Research on Trajectory Prediction Method of Incoming Missiles Based on Im-proved Inverse Reinforcement Learning in Aircraft Active Defense Mode[J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 0 : 1 -0 . DOI: 10.7527/S1000-6893.2025.32753

References

[1] 毕鹏, 陈永鹏, 祝雯生, 等. 机载主动防御系统毁伤技术发展现状及趋势[J]. 空天防御, 2024, 7(4): 67-72.
BI P, CHEN Y P, ZHU W S, et al. Development status and trend of countermeasure technology of airborne active pro-tection system[J]. Air&Space Defense, 2024, 7(4): 67-72.
[2] 乔要宾, 吴震, 吕明远. 空中平台主动防御系统发展现状及关键技术[J]. 航空兵器, 2023, 30(2): 77-82.
QIAO Y B, WU Z, LYU M Y. Development status and key technologies of air platform active defense system[J]. Aero Weaponry, 2023, 30(2): 77-82.
[3] 纪毅, 王伟, 张宏岩, 等. 面向高机动目标拦截任务的空空导弹制导方法综述[J]. 航空兵器, 2022, 29(6): 15-25.
JI Y, WANG W, ZHANG H Y, et al. A survey on guidance method of air-to-air missiles facing high maneuvering tar-gets[J]. Aero Weaponry, 2022, 29(6): 15-25.
[4] 陈维义, 何凡, 李逸源, 等. 三体对抗中的主动防御鲁棒最优预测制导律研究[J]. 北京理工大学学报, 2024, 44(6): 645-654.
CHEN W Y, HE F, LI Y Y, et al. Robust optimal predictive guidance law for active defense in three-body confronta-tion[J]. Journal of Beijing Institute of Technology, 2024, 44(6): 645-654.
[5] 雷虎民, 骆长鑫, 周池军, 等. 临近空间防御作战拦截弹制导与控制关键技术综述[J]. 航空兵器, 2021, 28(2): 1-10.
LEI H M, LUO C X, ZHOU C J, et al. Summary of key technologies of interceptor guidance and control in near space defense operations[J]. Aero Weaponry, 2021, 28(2): 1-10.
[6] 陈文雪, 胡玉东, 高长生, 等. 拦截高超声速滑翔飞行器:制导进展与展望[J]. 宇航学报, 2024, 45(6): 799-814.
CHEN W X, HU Y D, GAO C S, et al. Intercepting hyper-sonic glide vehicle: progress and prospect of guidance tech-nology[J]. Journal of Astronautics, 2024, 45(6): 799-814.
[7] 张浩, 张奕群, 张鹏飞. 三体对抗中的制导控制研究方法综述[J]. 战术导弹技术, 2021, No.205(1): 67-73, 83.
ZHANG H, ZHANG Y Q, ZHANG P F. A survey of guid-ance law design in active target defense scenario[J]. Tactical Missile Technology, 2021, 205(1): 67-73, 83.
[8] 史恒, 朱纪洪. 主动防御的最优预测协同制导律研究[J]. 空间控制技术与应用, 2019, 45(4): 64-70.
SHI H, ZHU J H. Optimal cooperative prediction guidance law for active defense[J]. Aerospace Control and Application, 2019, 45(4): 64-70.
[9] Fonod R, Shima T. Multiple Model Adaptive Evasion Against a Homing Missile[J]. Journal of Guidance, Control, and Dynamics, 2016, 39(7): 1578-1592.
[10] 姜易阳, 陈万春. 基于DGL/IMM算法的随机机动弹头拦截研究[J]. 弹箭与制导学报, 2012, 32(2): 6-10.
JIANG Y Y, CHEN W C. Ballistic missile defense against random maneuvering targets based on DGL/IMM algo-rithm[J]. Journal of Projectiles, Rockets, Missiles and Guid-ance, 2012, 32(2): 6-10.
[11] 杜润乐, 刘佳琪, 李志峰, 等. 低通滤波与卡尔曼滤波相结合的制导律识别[J]. 哈尔滨工业大学学报, 2017, 49(4): 66-72.
DU R L, LIU J Q, LI Z F, et al. A LPF enhanced adaptive kalman filter for guidance law recognition[J]. Journal of Harbin Institute of Technology, 2017, 49(4): 66-72.
[12] 王晓芳, 张楠. 基于信号分解的防御弹制导律辨识方法[J]. 战术导弹技术, 2024, No.223(1): 95-104.
WANG X F, ZHANG N. A method of guidance law identi-fication for defense missile based on signal decomposi-tion[J]. Tactical Missile Technology, 2024, No.223(1): 95-104.
[13] 袁则华, 崔颢, 徐琰珂, 等. 基于LSTM神经网络的来袭导弹制导律识别方法研究[J]. 航空兵器, 2024, 31(6): 57-63.
YUAN Z H, CUI H, XU Y K, et al. Research on guidance law recognition method of incoming missile based on LSTM neural network[J]. Aero Weaponry, 2024, 31(6): 57-63.
[14] 王因翰, 范世鹏, 吴广, 等. 基于GRU的敌方拦截弹制导律快速辨识方法[J]. 航空学报, 2022, 43(2): 393-404.
WANG Yinhan, FAN Shipeng, WU Guang, et al. Fast guid-ance law identification approach for incoming missile based on GRU network[J]. Acta Aeronautica et Astronautica Sinica, 2022, 43(2): 393-404.
[15] Wang Y, Wang J, Fan S. Parameter Identification of a PN-Guided Incoming Missile Using an Improved Multiple-Model Mechanism[J]. IEEE Transactions on Aerospace and Electronic Systems, 2023, 59(5): 5888-5899.
[16] Snoswell A J, Singh S P N, Ye N. Revisiting Maximum Entropy Inverse Reinforcement Learning: New Perspectives and Algorithms[C]//2020 IEEE Symposium Series on Com-putational Intelligence (SSCI). .
[17] Ziebart B D, Maas A, Bagnell J A, 等. Maximum entropy inverse reinforcement learning[C]//Proceedings of the 23rd National Conference on Artificial Intelligence - Volume 3. Chicago, Illinois: AAAI Press, 2008: 1433-1438.
[18] 颜鹏, 郭继峰, 白成超. 考虑移动目标不确定行为方式的轨迹预测方法[J]. 宇航学报, 2022, 43(8): 1040-1051.
YAN P, GUO J F, BAI C C. A Trajectory prediction method considering uncertain behavior patterns of moving targets[J]. Journal of Astronautics, 2022, 43(8): 1040-1051.
[19] Yang B, Lu Y, Wan R, 等. Meta-IRLSOT++: A meta-inverse reinforcement learning method for fast adaptation of trajectory prediction networks[J]. Expert Systems with Ap-plications, 2024, 240: 122499.
[20] 李银通, 韩统, 孙楚, 等. 基于逆强化学习的空战态势评估函数优化方法[J]. 火力与指挥控制, 2019, 44(8): 101-106.
LI Y T, HAN T, SUN C, et al. An optimization method of air combat situation assessment function based on inverse rein-forcement learning [J]. Fire Control & Command Control, 2019, 44(8): 101-106.
[21] 岳承磊, 汪雪川, 岳晓奎, 等. 基于逆强化学习的航天器交会对接方法[J]. 航空学报, 2023, 44(19): 257-268.
YUE C L, WANG X C, YUE X K, et al. A spacecraft ren-dezvous and docking method based on inverse reinforcement learning [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(19): 257-268.
[22] Levine S, Koltun V. Continuous inverse optimal con-trol with locally optimal examples[C]//Proceedings of the 29th International Coference on International Conference on Machine Learning. Madison, WI, USA: Omnipress, 2012: 475-482.
[23] Kim J, Yang I. Maximum Entropy Optimal Control of Continuous-Time Dynamical Systems[J]. IEEE Transactions on Automatic Control, 2022, PP: 1-1.
[24] Boyd S, Vandenberghe L. Convex Optimization[M/OL]. Cambridge, UK: Cambridge University Press, 2004.
[25] 梁津鑫, 张晓阳, 崔颢, 等. 雷达/红外抗干扰融合跟踪方法研究[J]. 航空兵器, (2025-06-18)[2025-07-28]. https://link.cnki.net/urlid/41.1228.TJ.20250617.1246.001.
LIANG Jinxin, ZHANG Xiaoyang, CUI Hao, et al. Research on radar/infrared anti-jamming fusion tracking method[J]. Aero Weaponry, (2025-06-18)[2025-07-28]. https://link.cnki.net/urlid/41.1228.TJ.20250617.1246.001.
Outlines

/