导航

Acta Aeronautica et Astronautica Sinica ›› 2023, Vol. 44 ›› Issue (13): 327895-327895.doi: 10.7527/S1000-6893.2023.27895

• Electronics and Electrical Engineering and Control • Previous Articles    

Distributed multi-agent coalition task allocation strategy for single pilot operation mode based on DQN

Lei DONG1,2,3, Hongbing CHEN2,3, Xi CHEN1,2,3, Changxiao ZHAO1,2,3()   

  1. 1.Key Laboratory of Civil Aircraft Airworthiness Technology,Civil Aviation University of China,Tianjin 300300,China
    2.Civil Aircraft Airworthiness and Repair Key Laboratory of Tianjin,Civil Aviation University of China,Tianjin 300300,China
    3.College of Safety Science and Engineering,Civil Aviation University of China,Tianjin 300300,China
  • Received:2022-08-03 Revised:2022-11-30 Accepted:2023-02-23 Online:2023-03-13 Published:2023-03-10
  • Contact: Changxiao ZHAO E-mail:zhaochangxiao@yeah.net
  • Supported by:
    National Key Research and Development Program(2021YFB1600600);Tianjin Education Commission Scientific Research Project(2022KJ058);Fundamental Research Funds for the Central Universities(3122022044);Graduate Research Innovation Funding Project of Civil Aviation University of China(2021YJS011)

Abstract:

Distributed decision-making is essential for increasing the autonomy of multi-agent system in the distributed coordinated flight organization structure of Single Pilot Operation (SPO) mode. A coalition task assignment decision model of distributed multi-agent for SPO mode is built on the background of multi-agent collaboration for the execution of complicated tasks, taking into account several constraints such as task load resource requirements, agent resource space, and time windows. Then, we design a function approximation of a Q-valued network, and propose a coalition task allocation algorithm based on Deep Q-Network (DQN) that generates the best execution path of the optimal coalition task allocation results, allowing each agent in the coalition to achieve scheduling optimization in a more adaptive manner. The efficiency and speed of the DQN algorithm in addressing multi-agent coalition task allocation for the SPO mode under complex constraints are confirmed through numerical simulation.

Key words: single pilot operation, multi-agent system, task allocation, coalition formation, deep reinforcement learning, neural network

CLC Number: