导航

Acta Aeronautica et Astronautica Sinica ›› 2025, Vol. 46 ›› Issue (18): 331837.doi: 10.7527/S1000-6893.2025.31837

• Electronics and Electrical Engineering and Control • Previous Articles     Next Articles

Collaborative planning of multi-UAV trajectories and communication strategies considering channel resource constraints

Chen WANG1, Caisheng WEI1(), Zeyang YIN1, Kai JIN2, Xingchen LI3   

  1. 1.School of Automation,Central South University,Changsha 410083,China
    2.The 54th Research Institute of CETC,Shijiazhuang 050081,China
    3.National Innovation Institute of Defense Technology,Academy of Military Science,Beijing 100071,China
  • Received:2025-01-22 Revised:2025-03-26 Accepted:2025-04-17 Online:2025-09-25 Published:2025-04-25
  • Contact: Caisheng WEI E-mail:caisheng_wei@csu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62373379);Hunan Provincial Natural Science Foundation(2024JJ6482);Central South University Innovation-Driven Research Program(2023CXQD066)

Abstract:

To address the optimization problem of flight trajectories and communication strategies in multi-UAV collaborative reconnaissance missions, this study proposes a collaborative planning approach based on deep reinforcement learning, considering diverse costs such as flight distance, communication energy consumption, and channel capacity, along with multiple constraints including base station channel resource constraints, UAV performance constraints, and collision avoidance constraints. Firstly, a collaborative planning model for multi-UAV trajectories and communication strategies is established for randomly unknown reconnaissance environments. Secondly, an end-to-end deep reinforcement learning framework based on the multi-agent proximal policy optimization algorithm is introduced to jointly optimize coupled variables such as UAV trajectories, communication connection strategies, and communication transmit power, with flight distance, communication energy consumption, and channel capacity serving as multiple optimization objectives. To reduce the complexity of learning and solving multi-objective tasks, a trajectory planning sub-model incorporating virtual gravity from base stations is designed, which decreases the decision space. A trajectory planning sub-model that incorporates the virtual gravitational force of base stations is designed based on the artificial potential field method. Through the approach of automatically optimizing parameters via reinforcement learning, the size of the decision space is reduced, and the convergence speed of the model is accelerated. Finally, simulation experiments demonstrate the advantages of the proposed method in optimizing the overall mission cost index.

Key words: UVA, trajectory planning, base station channel resource constraint, deep reinforcement learning, collaborative optimization

CLC Number: