首页 >

基于DACM-PPO的机载末端红外复合干扰智能决策

韩滟泷1,张安1,毕文豪1,范秋岑2,侯天乐1   

  1. 1. 西北工业大学
    2. 西北工业大学航空学院
  • 收稿日期:2025-09-08 修回日期:2025-12-04 出版日期:2025-12-08 发布日期:2025-12-08
  • 通讯作者: 毕文豪
  • 基金资助:
    面向动态多任务的无人机多模集群敏捷协同控制研究

Intelligent decision-making of airborne terminal infrared composite jamming based on DACM-PPO

  • Received:2025-09-08 Revised:2025-12-04 Online:2025-12-08 Published:2025-12-08
  • Contact: Wen-Hao BI

摘要: 随着红外制导空空导弹制导精度和机动能力的不断提升,作战飞机通过机动规避或单一红外干扰难以有效规避红外导弹命中风险,红外复合干扰成为保障飞机生存的重要途径。针对机载末端红外复合干扰问题,提出了一种基于改进近端策略优化算法的机载末端红外复合干扰智能决策方法。从机载末端对抗场景出发,分析了作战飞机在红外制导导弹攻击下的决策约束,建立了红外诱饵弹与激光定向干扰模型,提出了一种动态非对称裁剪机制,克服裁剪参数固定僵化的局限,提升收敛效率与求解质量,在此基础上设计了融合干扰手段特性的奖励函数,并引入过量使用惩罚项和无效使用惩罚项,实现干扰效能与资源消耗之间的合理平衡。仿真结果表明,红外复合干扰智能决策方法够以合理的协同方式组织红外干扰手段,在多种典型机弹对抗态势下表现出良好性能,相较原始近端策略优化算法、柔性动作-评价算法及基于预设规则的方法,在飞机存活率、导弹脱靶量和资源利用效率等指标上均具有显著优势,具有良好应用价值。

关键词: 机载末端防御, 红外复合干扰, 深度强化学习, 红外诱饵弹, 激光定向干扰

Abstract: With the continuous improvement in the guidance accuracy and maneuverability of infrared-guided air-to-air missiles, combat aircraft find it increasingly difficult to effectively evade the risk of infrared missile hits through maneuvering avoidance or single infrared countermeasures alone. Composite infrared countermeasures have thus become a critical means to ensure aircraft survivability. Addressing the challenge of airborne terminal composite infrared countermeasures, this study proposes an intelligent decision-making method based on an improved proximity strategy optimization algorithm. Starting from the terminal airborne countermeasure scenario, this study analyzes the decision constraints faced by combat aircraft under infrared-guided missile attacks. It establishes models for infrared decoy missiles and laser directional jammers, proposing a dynamic asymmetric trimming mechanism to overcome the limitations of fixed trimming parameters, thereby enhancing convergence efficiency and solution quality. Building upon this foundation, a reward function integrating jamming means characteristics is designed, incorporating overuse penalties and ineffective use penalties to achieve a reasonable balance between jamming effectiveness and resource consumption. Simulation results demonstrate that the intelligent decision-making method for infrared composite jamming can organize infrared jamming measures in a reasonably coordinated manner, exhibiting excellent performance under various typical aircraft-missile confrontation scenarios. Compared with the original near-end strategy optimization algorithm, the flexible action-evaluation algorithm, and the preset rule-based method, it shows significant advantages in metrics such as aircraft survivability, missile miss distance, and resource utilization efficiency, demonstrating good application value.

Key words: airborne terminal defense, infrared composite jamming, deep reinforcement learning, infrared decoy bombs, laser directional jamming

中图分类号: