导航

Acta Aeronautica et Astronautica Sinica ›› 2025, Vol. 46 ›› Issue (8): 331024.doi: 10.7527/S1000-6893.2024.31024

• Electronics and Electrical Engineering and Control • Previous Articles    

Cooperative location of multiple UAVs with deep reinforcement learning in GPS-denied environment

Kaifang WAN(), Zhilin WU, Yunhui WU, Haozhi QIANG, Yibo WU, Bo LI   

  1. School of Electronics and Information,Northwestern Polytechnical University,Xi’an 710072,China
  • Received:2024-08-01 Revised:2024-09-27 Accepted:2024-11-21 Online:2024-12-11 Published:2024-12-05
  • Contact: Kaifang WAN E-mail:wankaifang@nwpu.edu.cn
  • Supported by:
    National Nature Science Foundation of China(62003267);the Key Research and Development Program of ShaanXi Province(2023-GHZD-33);the Fundamental Research Funds for the Central Universities(G2022KYO602);the Key Laboratory for Electromagnetic Space Operations and Applications(2022ZX0090);the National Key Laboratory of Air-based Information Perception and Fusion(202471)

Abstract:

In strong adversarial scenarios, Unmanned Aerial Vehicles (UAVs) often experience GPS malfunction due to interference, making it difficult to obtain their accurate position. Since UAVs often operate in formations or clusters, this paper proposes a strategy that relies on UAVs within the formation to measure relative spatial positions and locate each other, allowing UAVs to update their position information in real time even after GPS signal loss. Firstly, in response to the GPS-denied environment, the theory of the Partially Observable Markov Decision Process (POMDP) is introduced and the elements of POMDP are analyzed to establish a POMDP decision model based on collaborative positioning and scheduling is established. A belief state update method based on the Extended Kalman Filter (EKF), as well as a Q-value estimation method based on Deep Q-Network (DQN) in deep reinforcement learning, is proposed to achieve accurate collaborative real-time positioning. Application tests in different scenarios show that the proposed model can achieve efficient management and scheduling of UAVs in formation, and can control GPS normal UAVs to effectively coordinate and locate GPS failed UAVs, which verifies the effectiveness of the model.

Key words: multiple UAVs, GPS-denied, collaborative positioning, deep reinforcement learning, Markov decision

CLC Number: