导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (18): 329862.doi: 10.7527/S1000-6893.2024.29862

• Electronics and Electrical Engineering and Control • Previous Articles    

Multi-agent communication cooperation based on deep reinforcement learning and information theory

Bing GAO1, Zhejie ZHANG1(), Qijie ZOU1, Zhiguo LIU1,2, Xiling ZHAO1   

  1. 1.School of Information Engineering Faculty,Dalian University,Dalian  116622,China
    2.Key Laboratory of Communication & Network,Dalian University,Dalian  116622,China
  • Received:2023-11-10 Revised:2023-12-06 Accepted:2024-02-28 Online:2024-03-18 Published:2024-03-14
  • Contact: Zhejie ZHANG E-mail:zhangzhejie@s.dlu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(61673084);2021 Liaoning Provincial Department of Education Project(LJKZ1180)

Abstract:

Effective explicit communication among agents in a multi-agent system can increase their capacity for cooperation. However, existing communication strategies typically use the agents’ local observations as the communication content directly, and the communication objects are usually fixed with a certain topology structure. On the one hand, these strategies are difficult to adapt to changes in tasks and environments, which causes uncertainty in the communication process. On the other hand, the communication objects and contents lack focus, resulting in some resource waste and lower communication effectiveness. To address the issues above, this paper proposes an approach that integrates deep reinforcement learning and information theory to realize multi-agent adaptive communication mechanism. The approach uses a prior network to allow the agent to dynamically choose the object, then utilizes the constraints of mutual information and the information bottleneck theory to effectively filter redundant information. Finally, the agent summarizes its own and received information to extract more effective information. The method proposed is demonstrated to improve the stability and interaction efficiency of multi-agent systems compared to other methods through cooperative navigation and traffic junction environments.

Key words: multi-agent deep reinforcement learning, mutual information, explicit communication, information bottleneck, cooperation environment

CLC Number: