航空学报 > 2025, Vol. 46 Issue (10): 331304-331304   doi: 10.7527/S1000-6893.2024.31304

考虑拦截器探测能力限制的飞行器智能机动突防制导策略

高树一1, 林德福1, 郑多1(), 徐骋2   

  1. 1.北京理工大学 宇航学院,北京 100081
    2.复杂系统控制与智能协同全国重点实验室,北京 100074
  • 收稿日期:2024-09-30 修回日期:2024-10-29 接受日期:2024-12-04 出版日期:2024-12-10 发布日期:2024-12-10
  • 通讯作者: 郑多 E-mail:zhengduohello@126.com
  • 基金资助:
    国家自然科学基金青年基金项目(61903350);教育部产学研创新项目(2021ZYA02002);北京理工大学青年教师学术启动计划(3010011182130)

Intelligent maneuvering penetration guidance strategies for aerial vehicles considering interceptor detection capability limitations

Shuyi GAO1, Defu LIN1, Duo ZHENG1(), Cheng XU2   

  1. 1.School of Aerospace Engineering,Beijing Institute of Technology,Beijing 100081,China
    2.Science and Technology on Complex System Control and Intelligent Agent Cooperative Laboratory,Beijing 100074,China
  • Received:2024-09-30 Revised:2024-10-29 Accepted:2024-12-04 Online:2024-12-10 Published:2024-12-10
  • Contact: Duo ZHENG E-mail:zhengduohello@126.com
  • Supported by:
    National Natural Science Foundation of China(61903350);Ministry of Education’s Industry-University-Research Innovation Project(2021ZYA02002);Beijing Institute of Technology Research Fund Program for Young Scholars(3010011182130)

摘要:

随着防空反导拦截技术和装备的发展,进攻飞行器面临被防御武器拦截导致的战场生存率低、效能差等问题。针对拦截器拦截场景下的飞行器智能攻防博弈对抗问题,提出了一种考虑拦截器探测能力限制的飞行器智能机动突防制导策略。首先定义了拦截器视线角和探测范围,并利用拦截器与进攻飞行器之间的相对运动关系来描述攻防双方对抗态势的演变,进而基于深度强化学习原理设计了近端策略优化制导方法,构建了引导飞行器主动摆脱拦截器探测的马尔可夫决策链,并进一步优化飞行器奖励函数设计方法实现对目标的精确打击。在此基础上,通过引入信任动作探索技术来解决智能算法收敛慢的问题。仿真结果表明,智能机动突防制导策略赋予了飞行器自主学习优化属性,可以通过主动规避机动增加拦截器的探测难度,最终突破拦截器的探测能力极限实现突防逃逸。相比于传统PN-sin突防制导方法,提出的突防制导策略能够在攻防双方非对称机动能力场景下保持更高的突防成功率。

关键词: 飞行器突防博弈对抗, 近端策略优化, 拦截器探测能力限制, 深度强化学习, 突防制导一体化

Abstract:

In the face of the development of air defense and missile defense interception technologies and equipment, attack aircraft are confronted with issues such as low battlefield survivability and poor effectiveness due to the interception by defensive weapons. To address the aircraft penetration game and countermeasures problem under the interception scenario, this research proposes an intelligent maneuvering penetration guidance strategy for aircraft that considers the limitations of the interceptor detection capabilities. Initially, the interceptor’s line-of-sight angle and detection range are defined, and the relative motion relationship between the interceptor and the attack aircraft is used to describe the evolution of the confrontation situation between the attack and the defense sides. Subsequently, a proximal policy optimization guidance method is designed based on the principles of deep reinforcement learning, constructing a Markov decision chain that guides the aircraft to actively evade the interceptor’s detection, and further optimizing the aircraft’s reward function design method to achieve precise targeting. On this basis, the convergence speed of the intelligent algorithm is addressed by introducing action exploration and generalized advantage functions. Simulation results show that the intelligent maneuvering penetration guidance strategy endows the aircraft with autonomous learning and optimization attributes, allowing it to increase the difficulty of detection for the interceptor through active evasive maneuvers, ultimately breaking through the detection capability limit of the interceptor to achieve penetration and escape. Compared with the traditional PN-sin penetration guidance method, the penetration guidance strategy proposed in this paper can maintain a higher penetration success rate in scenarios where the attack and defense sides have asymmetric maneuvering capabilities.

Key words: aircraft penetration game and countermeasures, proximal policy optimization, interceptor detection capability limitations, deep reinforcement learning, integrated penetration and guidance

中图分类号: