导航

Acta Aeronautica et Astronautica Sinica ›› 2026, Vol. 47 ›› Issue (7): 332547.doi: 10.7527/S1000-6893.2025.32547

• Electronics and Electrical Engineering and Control • Previous Articles    

Manned/unmanned aerial vehicle collaborative interpretable method for intelligent air combat

Wei XIONG1, Dong ZHANG1(), Shuheng YANG1, Zhi REN1, Wenyi LIU2   

  1. 1. School of Astronautics,Northwestern Polytechnical University,Xi’an 710072,China
    2. Northwest Institute of Mechanical & Electrical Engineering,Xianyang 712099,China
  • Received:2025-07-10 Revised:2025-08-11 Accepted:2025-11-10 Online:2025-11-26 Published:2025-11-25
  • Contact: Dong ZHANG
  • Supported by:
    National Natural Science Foundation of China(52472417)

Abstract:

Manned/Unmanned Aerial Vehicle (M/UAV) teaming represents a critical operational paradigm for future air combat, where deep reinforcement learning serves as a key enabling technology. However, the “black-box nature” of deep reinforcement learning renders the learned strategies difficult to interpret and trust, making interpretable deep reinforcement learning essential for achieving intelligent air combat collaboration. This paper proposes a deep reinforcement learning interpretation method based on the Bayesian Shapley framework, realizes the interpretability modeling and verification analysis of the decision-making process, and achieves the goal of explaining the decision-making basis of UAV. The proposed approach first constructs a decision intent analysis framework for cooperative missions using dynamic Bayesian networks, capable of identifying critical decision nodes in trajectory segments. Subsequently, the Shapley value-based contribution assessment algorithm is employed to achieve state-level quantitative analysis of decision rationale at key nodes. Finally, by reconstructing the state input space of the deep reinforcement learning model, the method significantly enhances model interpretability and trustworthiness while maintaining original policy performance, with the effectiveness of the explanatory results validated through state space ablation simulations.

Key words: human machine collaboration, deep reinforcement learning, interpretability, intelligent air combat, intention identification

CLC Number: