航空学报 > 2024, Vol. 45 Issue (S1): 730873-730873   doi: 10.7527/S1000-6893.2024.30873

逆轨拦截的目标命中点分支预测智能制导算法

万清橙, 余萌(), 李玉报, 王寅   

  1. 南京航空航天大学 航天学院,南京 210016
  • 收稿日期:2024-06-26 修回日期:2024-07-17 接受日期:2024-08-19 出版日期:2024-12-25 发布日期:2024-09-02
  • 通讯作者: 余萌 E-mail:yuxy21@nuaa.edu.cn
  • 基金资助:
    国家自然科学基金(U20B2001);青年科技创新基金(NT2024018)

Intelligent guidance algorithm for target hit point branch prediction for head-on interception

Qingcheng WAN, Meng YU(), Yubao LI, Yin WANG   

  1. College of Aerospace Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China
  • Received:2024-06-26 Revised:2024-07-17 Accepted:2024-08-19 Online:2024-12-25 Published:2024-09-02
  • Contact: Meng YU E-mail:yuxy21@nuaa.edu.cn
  • Supported by:
    National Natural Science Foundation of China(U20B2001);Youth Science and Technology Innovation Fund(NT2024018)

摘要:

为了实现逆轨拦截机动目标时达到最大拦截末速的目标,基于序列到序列方法构建目标机动弹道分支预测模型,基于深度Q学习算法和偏置制导律构建深度强化学习智能制导律。对于智能制导律训练过程中引发的稀疏奖励问题,采用预测-校正方法引入末制导的导引比以构建终端奖励,结合物理过程和性能指标构建过程奖励,结合过程奖励与终端奖励以提升训练效果。仿真表明目标机动弹道分支预测模型与弹道外推方法相比,在分方向上的机动弹道平均预测精度至少提高67%;在达到交班性能要求和低过载要求的前提下,智能制导律相比于基线制导律在相对拦截速度上提升67%。

关键词: 机动目标, 逆轨拦截, 弹道预测, 预测命中点, 智能制导律

Abstract:

To achieve the maximum interception terminal velocity when intercepting maneuvering targets in the contra-orbit, this paper constructs a target maneuvering ballistic branch prediction model based on the sequence-to-sequence method, and constructs a deep reinforcement learning intelligent guidance law based on the deep Q-learning algorithm and the bias guidance law. To address the sparse reward problem caused by the training process of the smart guidance law, the prediction-correction method is used to introduce the guidance ratio of the terminal guidance to construct the terminal reward, and the process reward is constructed by combining the physical process and performance indexes. The process reward and terminal reward are combined to improve the training effect. Simulation shows that the target maneuvering ballistic branch prediction model improves the average prediction accuracy of the maneuvering ballistic in the sub-direction by at least 67% compared with the ballistic extrapolation method, and the intelligent guidance law improves the relative interception speed by 67% compared with the baseline guidance law on the premise of meeting the requirements of shift performance and low overload.

Key words: maneuvering target, head-on interception, trajectory prediction, predicted hit point, intelligent guidance law

中图分类号: