导航

Acta Aeronautica et Astronautica Sinica ›› 2026, Vol. 47 ›› Issue (6): 332451.doi: 10.7527/S1000-6893.2025.32451

• Electronics and Electrical Engineering and Control • Previous Articles    

UAV complete data collection trajectory planning algorithm based on time window constraints

Sihua GAO, Bingyang ZHAO, Jianfu LI()   

  1. College of Computer Science and Technology,Civil Aviation University of China,Tianjin 300300,China
  • Received:2025-06-20 Revised:2025-07-04 Accepted:2025-08-25 Online:2025-09-09 Published:2025-09-05
  • Contact: Jianfu LI E-mail:jfli@cauc.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62173332)

Abstract:

Unmanned Aerial Vehicle (UAV) has been widely adopted to assist Wireless Sensor Networks (WSNs) in performing data collection tasks. However, time window constraints at the sensor nodes pose new challenges. The UAV must not only arrive in the vicinity of each data-transmitting node within its designated time window, but also complete the data collection task before the window closes. Inefficient trajectory planning increases the UAV’s flight distance, which may compromise the completeness of data collection. Although increasing flight speed can shorten travel time, it also accelerates energy consumption, potentially leading to task failure. To address these problems, We formulate a mathematical model for the UAV trajectory planning problem in a time-window-constrained complete data collection scenario, and then propose a reinforcement learning framework based on a Hierarchical Hybrid Action Representation (H-HyAR) to jointly optimize the UAV’s visiting order of target nodes, hovering offset, and flight speed, while capturing the hierarchical dependencies among these factors to minimize the UAV’s flight distance during the data collection task. Experiment results demonstrate that the H-HyAR algorithm outperforms three comparative hybrid action reinforcement learning algorithms and the Proximal Policy Optimization (PPO) algorithm in terms of flight distance and the influencing factors of this metric, while also exhibiting strong robustness and generalization capabilities.

Key words: unmanned aerial vehicle trajectory planning, hierarchical hybrid action representation, deep reinforcement learning, time window, complete data collection, wireless sensor networks

CLC Number: