Electronics and Electrical Engineering and Control

An explainable decision-making method for resource allocation in IMA system based on PPO-SHAP

  • Jiachen LIU ,
  • Lei DONG ,
  • Zijing SUN ,
  • Ye NI ,
  • Xi CHEN ,
  • Peng WANG
Expand
  • 1.Key Laboratory of Civil Aircraft Airworthiness Technology,Civil Aviation University of China,Tianjin 300300,China
    2.College of Safety Science and Engineering,Civil Aviation University of China,Tianjin 300300,China
    3.Science and Technology Innovation Research Institute,Civil Aviation University of China,Tianjin 300300,China
    4.AVIC Xi’an Aeronautics Computing Technology Research Institute,Xi’an 710065,China
    5.COMAC Shanghai Aircraft Flight Test Co. ,Ltd. ,Shanghai 200232,China

Received date: 2025-02-13

  Revised date: 2025-04-01

  Accepted date: 2024-06-26

  Online published: 2025-07-15

Supported by

Ministerial-level Project

Abstract

With the development of aviation Artificial Intelligence (AI) technology, the intelligent application software residing on the Integrated Modular Avionics (IMA) platform faces challenges such as resource scarcity and unreliable decision-making. Firstly, based on the consideration of avionics system architectural elements, the optimisation objective of IMA resource allocation decision-making under multiple constraints is designed. The proximal policy optimization algorithm is used to solve the near-optimal allocation scheme of IMA resources in the sequential decision-making process. Next, a decision attribution explanation framework for IMA resource allocation is established, and the expert policy of the reinforcement learning agent is extracted by aggregating and resampling the training dataset. Then, the SHapley Additive exPlanations (SHAP) method is used to achieve a combined global and local explanation of IMA resource allocation decisions. Simulation experiments and result analysis show that, compared with the greedy algorithm and other policy-based reinforcement learning algorithms, the proposed method exhibits good convergence speed and learning effect, and is remarkably superior in solving the IMA resource allocation problem. Additionally, this method generates quantitative and visual feature attribution explanations, which reveals the impact of input features on decision-making and clarify decision intent, thereby providing methodological guidance for airworthiness compliance validation of AI-based avionics system in terms of explainability.

Cite this article

Jiachen LIU , Lei DONG , Zijing SUN , Ye NI , Xi CHEN , Peng WANG . An explainable decision-making method for resource allocation in IMA system based on PPO-SHAP[J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2025 , 46(24) : 331872 -331872 . DOI: 10.7527/S1000-6893.2025.31872

References

[1] LECUN Y, BENGIO Y, HINTON G. Deep learning[J]. Nature2015521(7553): 436-444.
[2] Systems Aviage. Cockpit automatic speech recognition[EB/OL]. (2022-04-13)[2025-01-20]. .
[3] Daedalean. Visual traffic detection: Noticing all fixed wing, rotorcraft and drones in time[EB/OL]. (2023-04-01)[2025-01-20]. .
[4] Acubed. Developing autonomous flight and machine learning solutions for the next generation of aircraft[EB/OL]. (2024-10-02)[2025-01-20]. .
[5] KAAKAI F, DMITRIEV K, ADIBHATLA S, et al. Toward a machine learning development lifecycle for product certification and approval in aviation[J]. SAE International Journal of Aerospace202215(2):127-143.
[6] EASA. Artificial intelligence roadmap 2.0[R]. Cologne: European Union Aviation Safety Agency, 2023.
[7] FAA. Roadmap for artificial intelligence safety assurance[R]. Washington, D.C.: Federal Aviation Administration, 2024.
[8] 工业和信息化部. 绿色航空制造业发展纲要[R]. 北京: 工业和信息化部, 2023.
  MIIT. Green aviation manufacturing development programme[R]. Beijing: Ministry of Industry and Information Technology, 2023 (in Chinese).
[9] 中国民用航空局. 民航中长期科学和技术发展规划纲要[R]. 北京: 中国民用航空局, 2023.
  CAAC. Outline of the medium and long-term science and technology development plan for civil aviation[R]. Beijing: Civil Aviation Administration of China, 2023 (in Chinese).
[10] GASKA T, WATKIN C, CHEN Y. Integrated modular avionics-past, present, and future[J]. IEEE Aerospace and Electronic Systems Magazine201530(9): 12-23.
[11] 周贵荣, 徐见源, 马少博, 等. 大型客机航电系统综合集成关键技术综述[J]. 航空学报202445(5): 529956.
  ZHOU G R, XU J Y, MA S B, et al. Review of key technologies for avionics systems integration on large passenger aircraft[J]. Acta Aeronautica et Astronautica Sinica202445(5): 529956 (in Chinese).
[12] LIM Y, BASSIEN-CAPSA V, RAMASAMY S, et al. Commercial airline single-pilot operations: System design and pathways to certification[J]. IEEE Aerospace and Electronic Systems Magazine201732(7): 4-21.
[13] YIN J, ZHU Z Q. Flight autonomy impact to the future avionics architecture[C]∥ 2018 IEEE/AIAA 37th Digital Avionics Systems Conference (DASC). Piscataway: IEEE Press, 2018: 1-7.
[14] ZOLGHADRI A. Augmented flight management system for future single pilot operations[C]∥ 2023 9th International Conference on Control, Decision and Information Technologies (CoDIT). Piscataway: IEEE Press, 2023: 29-34.
[15] 任宝平, 李创. 机载系统的智能架构及功能分析[J]. 航空工程进展202314(4): 149-157.
  REN B P, LI C. Analysis on intelligent architecture and function of airborne system[J]. Advances in Aeronautical Science and Engineering202314(4): 149-157 (in Chinese).
[16] IQBAL Z, LEHMANN M, LUETTIG B, et al. Introducing ML to IMA technology-system perspective[C]∥ 2024 AIAA DATC/IEEE 43rd Digital Avionics Systems Conference (DASC). Piscataway: IEEE Press, 2024: 1-10.
[17] EASA. Concepts of design assurance for neural networks (CoDANN)[R]. Cologne: European Union Aviation Safety Agency, 2020.
[18] DEY S, LEE S W. A multi-layered collaborative framework for evidence-driven data requirements engineering for machine learning-based safety-critical systems[C]∥ Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing. New York: ACM, 2023: 1404-1413.
[19] LHACHEMI H, DE AZUA ORTEGA J A R, SAUSSIé D, et al. Partition modeling and optimization of ARINC 653 operating systems in the context of IMA[C]∥ 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC). Piscataway: IEEE Press, 2016: 1-8.
[20] KHAMVILAI T, SUTTER L, BAUFRETON P, et al. Decentralized task reallocation on parallel computing architectures targeting an avionics application[J]. Journal of Optimization Theory and Applications2021191(2): 874-898.
[21] DEROCHE E, SCHARBARG J L, FRABOUL C. A greedy heuristic for distributing hard real-time applications on an IMA architecture[C]∥ 2017 12th IEEE International Symposium on Industrial Embedded Systems (SIES). Piscataway: IEEE Press, 2017: 1-8.
[22] CHU J Y, ZHAO T D, JIAO J, et al. Optimal design of configuration scheme for integrated modular avionics systems with functional redundancy requirements[J]. IEEE Systems Journal202115(2): 2665-2676.
[23] ZHANG T, CHEN J Y, LV D J, et al. Automatic generation of reconfiguration blueprints for IMA systems using reinforcement learning[J]. IEEE Embedded Systems Letters202113(4): 182-185.
[24] 张涛, 张文涛, 代凌, 等. 基于序贯博弈多智能体强化学习的综合模块化航空电子系统重构方法[J]. 电子学报202250(4): 954-966.
  ZHANG T, ZHANG W T, DAI L, et al. Integrated modular avionics system reconstruction method based on sequential game multi-agent reinforcement learning[J]. Acta Electronica Sinica202250(4): 954-966 (in Chinese).
[25] 赵长啸, 李道俊, 孙亦轩, 等. 基于深度强化学习的综合航电系统安全性优化方法[J]. 中国安全科学学报202434(7): 123-131.
  ZHAO C X, LI D J, SUN Y X, et al. Integrated avionics system safety optimization method based on deep reinforcement learning[J]. China Safety Science Journal202434(7): 123-131 (in Chinese).
[26] CHENG J, TAN W, LV G Z, et al. A method for solving reconfiguration blueprints based on multi-agent reinforcement learning[J]. Computer Science and Information Systems202421(4): 1335-1357.
[27] MILANI S, TOPIN N, VELOSO M, et al. Explainable reinforcement learning: A survey and comparative review[J]. ACM Computing Surveys202456(7): 1-36.
[28] 曹宏业, 刘潇, 董绍康, 等. 面向强化学习的可解释性研究综述[J]. 计算机学报202447(8): 1853-1882.
  CAO H Y, LIU X, DONG S K, et al. A survey of interpretability research methods for reinforcement learning[J]. Chinese Journal of Computers202447(8): 1853-1882 (in Chinese).
[29] EASA. Concept paper: Guidance for Level 1 & 2 machine learning applications[R]. Cologne: European Union Aviation Safety Agency, 2024.
[30] Committee G-34. Recommended practice for development and certification/approval of aeronautical safety-related products implementing ML: SAE ARP6983 Draft 6B[EB/OL]. (2023-06-26)[2025-01-24]. .
[31] DEEL. White paper machine learning in certified systems: Ref-S079L03T00-005[R]. Toulouse: DEpendable Explainable Learning, 2021.
[32] 谭龙华, 杜承烈. ARINC 653分区实时系统的主时间框架设计[J]. 北京航空航天大学学报201642(11): 2413-2422.
  TAN L H, DU C L. Design of major time frame for ARINC 653 partitioned real-time systems[J]. Journal of Beijing University of Aeronautics and Astronautics201642(11): 2413-2422 (in Chinese).
[33] 赵长啸, 何锋, 李浩, 等. 基于效能的先进战斗机航电系统动态重构方法[J]. 航空学报202041(6): 523416.
  ZHAO C X, HE F, LI H, et al. Dynamic reconfiguration method based on effectiveness for advanced fighter avionics system[J]. Acta Aeronautica et Astronautica Sinica202041(6): 523416 (in Chinese).
[34] 徐文, 熊智勇, 张国全. 基于Harmony系统工程的IMA应用开发[J]. 北京航空航天大学学报201541(11): 2067-2077.
  XU W, XIONG Z Y, ZHANG G Q. IMA application development based on Harmony system engineering[J]. Journal of Beijing University of Aeronautics and Astronautics201541(11): 2067-2077 (in Chinese).
[35] 何锋. 航空电子系统综合调度理论与方法[M]. 北京: 清华大学出版社, 2017: 13-20.
  HE F. Theory and approach to avionics system integrated scheduling[M]. Beijing: Tsinghua University Press, 2017: 13-20 (in Chinese).
[36] ZHOU T R, XIONQ H, ZHANG Z. Hierarchical resource allocation for integrated modular avionics systems[J]. Journal of Systems Engineering and Electronics201122(5): 780-787.
[37] ROY A, BORKAR V, KARANDIKAR A, et al. Online reinforcement learning of optimal threshold policies for Markov decision processes[J]. IEEE Transactions on Automatic Control202267(7): 3722-3729.
[38] SCHULMAN J, WOLSKI F, DHARIWAL P, et al. Proximal policy optimization algorithms[DB/OL]. arXiv preprint :1707.06347, 2017.
[39] CHENG Y H, HUANG L Y, WANG X S. Authentic boundary proximal policy optimization[J]. IEEE Transactions on Cybernetics202252(9): 9428-9438.
[40] ROSS S, GORDON G J, BAGNELL J A. A reduction of imitation learning and structured prediction to no-regret online learning[C]∥Proceedings of the 14th international conference on artificial intelligence and statistics (AISTATS), 2011: 627-635.
[41] AHN J, SHIN S, KOO J, et al. Data aggregation (DAgger) algorithm using adversarial agent policy for dynamic situations[C]∥Intelligent Autonomous Systems 18. Cham: Springer, 2024: 91-104.
[42] 王诗彬, 王世傲, 陈雪峰, 等. 可解释性智能监测诊断网络构造及航空发动机整机试车与中介轴承诊断应用[J]. 机械工程学报202460(12): 90-106.
  WANG S B, WANG S A, CHEN X F, et al. Interpretable network construction for intelligent monitoring and diagnosis, and application in inter-shaft bearing diagnosis while aero-engine test[J]. Journal of Mechanical Engineering202460(12): 90-106 (in Chinese).
[43] LUNDBERG S. A unified approach to interpreting model predictions[DB/OL]. arXiv preprint: 1705.07874, 2017.
[44] LUNDBERG S M, ERION G, CHEN H, et al. From local explanations to global understanding with explainable AI for trees[J]. Nature Machine Intelligence20202(1): 56-67.
[45] CAPPI C, COHEN N, DUCOFFE M, et al. How to design a dataset compliant with an ML-based system ODD?[DB/OL]. arXiv preprint: 2406.14027, 2024.
[46] GIRIJA A A, CHRISTENSEN J M, STEFANI T, et al. Towards the monitoring of operational design domains using temporal scene analysis in the realm of artificial intelligence in aviation[C]∥ 2024 AIAA DATC/IEEE 43rd Digital Avionics Systems Conference (DASC). Piscataway: IEEE Press, 2024: 1-8.
[47] SAE. Guidelines for conducting the safety assessment process on civil aircraft, systems, and equipment: SAE ARP 4761A-2023 [S]. Warrendale: Society of Automotive Engineers, 2023.
[48] DION B, NAJORK M, DALMASSO N, et al. Programming neural networks inference in a safety-critical simulation-based framework[C]∥Embedded Real Time Systems (ERTS) Congress, 2022: 1-12.
[49] YUAN J, PEI Y, XU Y, et al. Autonomous interval management of multi-aircraft based on multi-agent reinforcement learning considering fuel consumption[J]. Transportation Research Part C: Emerging Technologies2024165: 104729.
[50] WILLIAMS K R, SCHLOSSMAN R, WHITTEN D, et al. Trajectory planning with deep reinforcement learning in high-level action spaces[J]. IEEE Transactions on Aerospace and Electronic Systems202359(3): 2513-2529.
Outlines

/