面向智能空战的有人/无人机协同可解释方法

doi:10.7527/S1000-6893.2025.32547

本期目录 | 过刊浏览 | 高级检索

前一篇 | 后一篇

面向智能空战的有人/无人机协同可解释方法

熊威¹,张栋¹,杨书恒¹,任智¹,²,刘文逸³

1. 西北工业大学
2. 西北工业大学空天飞行器设计陕西省重点实验室
3. 西北机电工程技术研究所

收稿日期:2025-07-10 修回日期:2025-11-22 出版日期:2025-11-25 发布日期:2025-11-25
通讯作者: 张栋
基金资助:
国家自然科学基金

A Manned/Unmanned Aerial Vehicle Cooperative Interpretable Method for Intelligent Air Combat

Received:2025-07-10 Revised:2025-11-22 Online:2025-11-25 Published:2025-11-25
Supported by:
National Natural Science Foundation of China

摘要/Abstract

摘要： 有人机和无人机协同是未来空战的重要作战形式，其中深度强化学习是实现有人/无人机协同空战的关键技术。然而深度强化学习的“黑箱特性”，使得学习到的策略难理解、难信任，因此具备可解释性的深度强化学习是实现有人/无人机协同智能空战的关键。本文提出了一种基于Bayesian-Shapley框架的深度强化学习解释方法，实现了决策过程的可解释性建模与验证分析，达到辅助飞行员理解无人机决策依据的目标。该方法首先基于动态贝叶斯网络构建了协同任务的决策意图解析框架，能够定位航迹切片中的决策关键节点；其次采用Shapley贡献度评估算法，实现了对关键节点决策依据的状态级量化分析；最后通过重构深度强化学习模型的状态输入空间，在保持原有策略性能的同时显著提升了模型的可解释性和可信度，并通过状态空间消融仿真验证了解释结果的有效性。

关键词: 人机协同, 强化学习, 可解释性, 智能空战, 意图识别

Abstract: Manned-unmanned aerial vehicle (UAV) cooperation represents a critical operational paradigm for future air combat, where deep reinforcement learning serves as a key enabling technology. However, the "black-box nature" of deep reinforcement learn-ing renders the learned strategies difficult to interpret and trust, making explainable deep reinforcement learning essential for achieving intelligent cooperative air combat. This paper proposes a Bayesian-Shapley framework-based explainable deep rein-forcement learning method, which enables interpretable modeling and verification of the decision-making process, thereby as-sisting pilots in understanding UAV decision logic. The proposed approach first constructs a decision intent analysis framework for cooperative missions using dynamic Bayesian networks, capable of identifying critical decision nodes in trajectory segments. Subsequently, it employs the Shapley value-based contribution assessment algorithm to achieve state-level quantitative analysis of decision rationale at key nodes. Finally, by reconstructing the state input space of the deep reinforcement learning model, the method significantly enhances model interpretability and trustworthiness while maintaining original policy performance, with the effectiveness of the explanatory results validated through state space ablation simulations.

Key words: Human machine collaboration, Deep reinforcement Learning, Interpretability, Intelligent air combat, Intention identification

中图分类号:

V271.4
TP181

熊威张栋杨书恒任智刘文逸. 面向智能空战的有人/无人机协同可解释方法[J]. 航空学报, doi: 10.7527/S1000-6893.2025.32547.

E-mail：hkxb@buaa.edu.cn

关于我们

期刊社服务

专业学科

封面文章

友情链接

主管单位：中国科学技术协会主办单位：中国航空学会北京航空航天大学

[1]	万开方, 吴志林, 武韫晖, 强皓植, 吴艺博, 李波. 拒止环境下基于深度强化学习的多无人机协同定位[J]. 航空学报, 2025, 46(8): 331024-331024.
[2]	姜凌峰, 李新凯, 张海, 李涵玮, 张宏立. 基于改进TD3算法的无人机动态环境无地图导航[J]. 航空学报, 2025, 46(8): 331035-331035.
[3]	李恒晖, 林前辉, 韩涛锋, 何阳. 基于能量机动的近距空战模型及应用[J]. 航空学报, 2025, 46(7): 330863-330863.
[4]	杨敏, 刘关俊, 周子渊. 基于安全强化学习的月球着陆器控制[J]. 航空学报, 2025, 46(3): 630553-630553.
[5]	王辰, 魏才盛, 殷泽阳, 靳锴, 李星辰. 考虑信道资源约束的多无人机航迹与通信策略协同规划[J]. 航空学报, 2025, 46(18): 331837-331837.
[6]	罗祎喆, 张辉, 余新得, 金钊, 冯朔, 石育澄, 徐明亮. 面向舰载机多波次弹药保障任务的分层动态调度[J]. 航空学报, 2025, 46(18): 331945-331945.
[7]	黄湘松, 王梦宇, 潘大鹏. 基于对抗强化学习的无人机逃离路径规划方法[J]. 航空学报, 2025, 46(17): 331637-331637.
[8]	王昱, 谢志鹏, 田永健, 孟光磊. 虚拟结构引领强化学习分布式无人机编队控制[J]. 航空学报, 2025, 46(15): 331354-331354.
[9]	陈伟, 李璐璐, 陈董, 张少辉, 李亚飞, 王可, 靳远远, 徐明亮. 差异化保障需求驱动的舰载机多机协同决策方法[J]. 航空学报, 2025, 46(13): 531274-531274.
[10]	陈旭东, 陈琦琦, 罗祎喆, 王佳宝, 徐明亮. 异构舰载机舰面保障作业动态并行调度[J]. 航空学报, 2025, 46(13): 531329-531329.
[11]	王政, 王华, 崔可可, 李超超, 刘俊楠, 徐明亮. 局部引导强化学习的舰载机自主调运方法[J]. 航空学报, 2025, 46(13): 531333-531333.
[12]	凌文辉, 牟春晖, 聂聆聪, 杜宪, 孙希明. 基于改进DDPG的宽速域几何可调燃烧室压力分布控制[J]. 航空学报, 2025, 46(12): 131092-131092.
[13]	余子杰, 郑征, 李清东, 郭林, 任素萍, 郭健. 基于深度强化学习的太阳能无人机航迹规划[J]. 航空学报, 2025, 46(12): 331420-331420.
[14]	赵长啸, 孙亦轩. 面向适航要求的eVTOL航电系统安全调度模型[J]. 航空学报, 2025, 46(11): 531252-531252.
[15]	贾永楠. 低空空域无人系统交通管理方案初探[J]. 航空学报, 2025, 46(11): 531399-531399.

面向智能空战的有人/无人机协同可解释方法

A Manned/Unmanned Aerial Vehicle Cooperative Interpretable Method for Intelligent Air Combat

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价