Acta Aeronautica et Astronautica Sinica ›› 2025, Vol. 46 ›› Issue (8): 331024.doi: 10.7527/S1000-6893.2024.31024
• Electronics and Electrical Engineering and Control • Previous Articles
Kaifang WAN(
), Zhilin WU, Yunhui WU, Haozhi QIANG, Yibo WU, Bo LI
Received:2024-08-01
Revised:2024-09-27
Accepted:2024-11-21
Online:2024-12-11
Published:2024-12-05
Contact:
Kaifang WAN
E-mail:wankaifang@nwpu.edu.cn
Supported by:CLC Number:
Kaifang WAN, Zhilin WU, Yunhui WU, Haozhi QIANG, Yibo WU, Bo LI. Cooperative location of multiple UAVs with deep reinforcement learning in GPS-denied environment[J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(8): 331024.
| 1 | CHUNG S J, PARANJAPE A A, DAMES P, et al. A survey on aerial swarm robotics[J]. IEEE Transactions on Robotics, 2018, 34(4): 837-855. |
| 2 | TITTERTON D, WESTON J. Strapdown inertial navigation technology || basic principles of strapdown inertial navigate on systems[M]∥IEEE Aerospace and Electronic Systems Magazine. Piscataway:IEEE Press, 2004: 17-58. |
| 3 | 徐玉, 任沁源, 孙文达, 等. 微小型无人直升机地磁导航算法研究[J]. 兵工学报, 2011, 32(3): 6. |
| XU Y, REN Q Y, SUN W D, et al. A geomagneic navigation algorithm for miniature unmanned heliope[J]. Acta Armamentarii, 2011, 32(3): 6 (in Chinese). | |
| 4 | 孔国杰, 冯时, 于会龙, 等. 无人集群系统协同运动规划技术综述[J]. 兵工学报, 2023, 44(1): 11-26. |
| KONG G J, FENG S, YU H L, et al. A review on cooperative motion planning of unmanned vehicles[J]. Acta Armamentarii, 2023, 44(1): 11-26 (in Chinese). | |
| 5 | SHARMA R, TAYLOR C. Vision based distributed cooperative navigation for MAVs in GPS denied areas: AIAA-2009-1932[R]. Reston: AIAA, 2009. |
| 6 | WYMEERSCH H, LIEN J, WIN M Z. Cooperative localization in wireless networks[J]. Proceedings of the IEEE, 2009, 97(2): 427-450. |
| 7 | ÇAKMAK B, URUP D N, MEYER F, et al. Cooperative localization for mobile networks: a distributed belief propagation-mean field message passing algorithm[J]. IEEE Signal Processing Letters, 2016, 23(6): 828-832. |
| 8 | VICENTE D, TOMIC S, BEKO M, et al. Performance analysis of a distributed algorithm for target localization in wireless sensor networks using hybrid measurements in a connection failure scenario[C]∥2017 International Young Engineers Forum (YEF-ECE). Piscataway: IEEE Press, 2017. |
| 9 | CHEN K. Jointed TOA/AOA positioning algorithm for OFDM[J]. Computer Engineering and Applications, 2009, 22(7): 988-992. |
| 10 | SILVER D, VENESS J. Monte-Carlo planning in large POMDPs[C]∥Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. New York: ACM, 2010. |
| 11 | MNIH V, KAVUKCUOGLU K, SILVER D, et al. Human-level control through deep reinforcement learning[J]. Nature, 2015, 518: 529-533. |
| 12 | BISONG E. Building machine learning and deep learning models on Google cloud platform[M]. Berkeley: Apress, 2019: 415-421. |
| 13 | 李波, 黄晶益, 万开方, 等. 基于深度强化学习的无人机系统应用研究综述[J]. 战术导弹技术, 2023(1): 58-68. |
| LI B, HUANG J Y, WAN K F, et al. A review of research on the application of UAV system based on deep reinforcement learning[J]. Tactical Missile Technology, 2023(1): 58-68 (in Chinese). | |
| 14 | GAO M S, ZHANG X X. Cooperative search method for multiple UAVs based on deep reinforcement learning[J]. Sensors, 2022, 22(18): 6737. |
| 15 | YANG S Y, YU G Z, MENG Z J, et al. Autonomous obstacle avoidance of UAV based on deep reinforcement learning1[J]. Journal of Intelligent & Fuzzy Systems, 2022, 42(4): 3323-3335. |
| 16 | DE WITT C S, PENG B, KAMIENNY P A, et al. Deep multi-agent reinforcement learning for decentralized continuous cooperative control[DB/OL]. arXiv: preprint: 2003. 06709; 2003. |
| 17 | 桂林, 武小悦. 部分可观测马尔可夫决策过程算法综述[J]. 系统工程与电子技术, 2008, 30(6): 1058-1064. |
| GUI L, WU X Y. Survey of algorithms for partially observable Markov decision processes[J]. Systems Engineering and Electronics, 2008, 30(6): 1058-1064 (in Chinese). | |
| 18 | GMYTRASIEWICZ P J, DOSHI P. A framework for sequential planning in multi-agent settings[J]. Journal of Artificial Intelligence Research, 2005, 24: 49-79. |
| 19 | KAUNE R, HÖRST JULIAN, KOCH W. Accuracy analysis for TDOA localization in sensor networks[C]∥14th International Conference on Information Fusion. Piscataway: IEEE Press, 2011. |
| 20 | BAXTER L A, PUTERMAN M L. Markov decision processes: discrete stochastic dynamic programming[J]. Technometrics, 1995, 37(3): 353. |
| 21 | SENGIJPTA S K. Fundamentals of statistical signal processing: estimation theory[J]. Technometrics, 1995, 37: 465-466. |
| 22 | GELMAN A, CARLIN J B B, STERN H S S, et al. Bayesian data analysis[M]. London: Chapman and Hall/CRC, 2015: 138-258. |
| 23 | 李琳, 张修社, 韩春雷, 等. 基于卡尔曼滤波和DDQN算法的无人机机动目标跟踪[J]. 战术导弹技术, 2022(2): 98-104. |
| LI L, ZHANG X S, HAN C L, et al. UAV maneuvering target tracking based on Kalman filter and DDQN algorithm[J]. Tactical Missile Technology, 2022(2): 98-104 (in Chinese). | |
| 24 | JULIER S J, UHLMANN J K. Corrections to “unscented filtering and nonlinear estimation”[J]. Proceedings of the IEEE, 2004, 92(12): 1958. |
| 25 | LANGE R J. Bellman filtering and smoothing for state–space models[J]. Journal of Econometrics, 2024, 238(2): 105632. |
| 26 | 范哲. 反向传播算法浅析[J]. 黑龙江科技信息, 2017(23): 132-133. |
| FAN Z. Analysis of back propagation algorithm[J]. Scientific and Technological Innovation, 2017(23): 132-133 (in Chinese). | |
| 27 | 秦宁宁. 无线传感器网络栅栏覆盖的研究[D]. 无锡: 江南大学, 2008. |
| QIN N N. Research on fence coverage in wireless sensor networks[D].Wuxi: Jiangnan University, 2008 (in Chinese). |
| [1] | Min YANG, Guanjun LIU, Ziyuan ZHOU. Control of lunar landers based on secure reinforcement learning [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(3): 630553-630553. |
| [2] | Honglin ZHANG, Jianjun LUO, Weihua MA. Spacecraft game decision making for threat avoidance of space targets based on machine learning [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(8): 329136-329136. |
| [3] | Yunpeng CAI, Dapeng ZHOU, Jiangchuan DING. Intelligent collaborative control of UAV swarms with collision avoidance safety constraints [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(5): 529683-529683. |
| [4] | Shengzhe SHAN, Weiwei ZHANG. Air combat intelligent decision-making method based on self-play and deep reinforcement learning [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(4): 328723-328723. |
| [5] | Bing GAO, Zhejie ZHANG, Qijie ZOU, Zhiguo LIU, Xiling ZHAO. Multi-agent communication cooperation based on deep reinforcement learning and information theory [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(18): 329862-329862. |
| [6] | Zuolong LI, Jihong ZHU, Minchi KUANG, Jie ZHANG, Jie REN. Hierarchical decision algorithm for air combat with hybrid action based on deep reinforcement learning [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(17): 530053-530053. |
| [7] | Zhaojun GU, Huan ZHAO, Jialiang WANG, Liuyang NIE. Automatic landing method for quad-rotor helicopter based on Markov decision process [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(15): 329652-329652. |
| [8] | Tiancai WU, Honglun WANG, Bin REN, Yiheng LIU, Xingyu WU, Guocheng YAN. Learning-based integrated fault-tolerant guidance and control for hypersonic vehicles considering avoidance and penetration [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(15): 329607-329607. |
| [9] | Xuejian WANG, Yongming WEN, Xiaorong SHI, Ningning ZHANG, Jiexi LIU. Design of hybrid intelligent decision framework for multi⁃agent and multi⁃coupling tasks [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(S2): 729770-729770. |
| [10] | Xizhen GAO, Liang TANG, Huang HUANG. Deep reinforcement learning in autonomous manipulation for celestial bodies exploration: Applications and challenges [J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2023, 44(6): 26762-026762. |
| [11] | Pan ZHOU, Jiangtao HUANG, Sheng ZHANG, Gang LIU, Bowen SHU, Jigang TANG. Intelligent air combat decision making and simulation based on deep reinforcement learning [J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2023, 44(4): 126731-126731. |
| [12] | Xiangwei ZHU, Dan SHEN, Kai XIAO, Yuexin MA, Xiang LIAO, Fuqiang GU, Fangwen YU, Kefu GAO, Jingnan LIU. Mechanisms, algorithms, implementation and perspectives of brain⁃inspired navigation [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(19): 28569-028569. |
| [13] | Lei DONG, Hongbing CHEN, Xi CHEN, Changxiao ZHAO. Distributed multi-agent coalition task allocation strategy for single pilot operation mode based on DQN [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(13): 327895-327895. |
| [14] | Wenxue CHEN, Changsheng GAO, Wuxing JING. Trust region policy optimization guidance algorithm for intercepting maneuvering target [J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2023, 44(11): 327596-327596. |
| [15] | Sheng ZHANG, Pan ZHOU, Yang HE, Jiangtao HUANG, Gang LIU, Jigang TANG, Huaizhi JIA, Xin DU. Air combat maneuver decision-making test based on deep reinforcement learning [J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2023, 44(10): 128094-128094. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||
Address: No.238, Baiyan Buiding, Beisihuan Zhonglu Road, Haidian District, Beijing, China
Postal code : 100083
E-mail:hkxb@buaa.edu.cn
Total visits: 6658907 Today visits: 1341All copyright © editorial office of Chinese Journal of Aeronautics
All copyright © editorial office of Chinese Journal of Aeronautics
Total visits: 6658907 Today visits: 1341

