导航

Acta Aeronautica et Astronautica Sinica ›› 2025, Vol. 46 ›› Issue (24): 331872.doi: 10.7527/S1000-6893.2025.31872

• Electronics and Electrical Engineering and Control • Previous Articles    

An explainable decision-making method for resource allocation in IMA system based on PPO-SHAP

Jiachen LIU1,2, Lei DONG1,3(), Zijing SUN4, Ye NI5, Xi CHEN1,3, Peng WANG1,3   

  1. 1.Key Laboratory of Civil Aircraft Airworthiness Technology,Civil Aviation University of China,Tianjin 300300,China
    2.College of Safety Science and Engineering,Civil Aviation University of China,Tianjin 300300,China
    3.Science and Technology Innovation Research Institute,Civil Aviation University of China,Tianjin 300300,China
    4.AVIC Xi’an Aeronautics Computing Technology Research Institute,Xi’an 710065,China
    5.COMAC Shanghai Aircraft Flight Test Co. ,Ltd. ,Shanghai 200232,China
  • Received:2025-02-13 Revised:2025-04-01 Accepted:2024-06-26 Online:2025-07-16 Published:2025-07-15
  • Contact: Lei DONG E-mail:l-dong@cauc.edu.cn
  • Supported by:
    Ministerial-level Project

Abstract:

With the development of aviation Artificial Intelligence (AI) technology, the intelligent application software residing on the Integrated Modular Avionics (IMA) platform faces challenges such as resource scarcity and unreliable decision-making. Firstly, based on the consideration of avionics system architectural elements, the optimisation objective of IMA resource allocation decision-making under multiple constraints is designed. The proximal policy optimization algorithm is used to solve the near-optimal allocation scheme of IMA resources in the sequential decision-making process. Next, a decision attribution explanation framework for IMA resource allocation is established, and the expert policy of the reinforcement learning agent is extracted by aggregating and resampling the training dataset. Then, the SHapley Additive exPlanations (SHAP) method is used to achieve a combined global and local explanation of IMA resource allocation decisions. Simulation experiments and result analysis show that, compared with the greedy algorithm and other policy-based reinforcement learning algorithms, the proposed method exhibits good convergence speed and learning effect, and is remarkably superior in solving the IMA resource allocation problem. Additionally, this method generates quantitative and visual feature attribution explanations, which reveals the impact of input features on decision-making and clarify decision intent, thereby providing methodological guidance for airworthiness compliance validation of AI-based avionics system in terms of explainability.

Key words: explainability, integrated modular avionics, proximal policy optimization, resource allocation, single-pilot operations, SHapley additive exPlanations

CLC Number: