复杂背景下反无人机红外目标鲁棒跟踪算法
收稿日期: 2025-05-20
修回日期: 2025-07-14
录用日期: 2025-08-19
网络出版日期: 2025-09-05
基金资助
国家自然科学基金(12202485)
Robust infrared target tracking algorithm for anti-UAV in complexbackgrounds
Received date: 2025-05-20
Revised date: 2025-07-14
Accepted date: 2025-08-19
Online published: 2025-09-05
Supported by
National Natural Science Foundation of China(12202485)
面向复杂背景下反无人机任务,红外目标跟踪面临着目标多尺度、遮挡、移出视野、背景干扰等诸多挑战。针对这些问题,提出了一种红外无人机鲁棒跟踪算法。首先,提出了一个全局跟踪模型,采用高效的一阶段无锚框框架,通过全局搜索以应对因遮挡和移出视野导致的无人机目标消失问题,并使用多层级结构跟踪不同尺寸的目标。其次,提出一个基于记忆网络的时空特征融合模块,利用视频中的时空域特征来提高目标的可判别性。然后,提出了一个目标增强与干扰抑制模块,通过记录和匹配帧间的目标和干扰特征以生成权重,并与得分图加权以提升算法的抗背景干扰能力。最后,提出了一个动态层级加速方法,通过删除冗余层级以提高运行效率。实验结果表明:该算法在2nd和3rd Anti-UAV数据集上分别取得92.4%和78.7%的精度以及69.4%和56.5%的成功率,并能够以26.9帧/s的速度运行,其性能均明显超过了现有算法,实现了在复杂场景中对无人机的实时和鲁棒跟踪。
冯子成 , 张文龙 , 刘冬辉 , 于起峰 . 复杂背景下反无人机红外目标鲁棒跟踪算法[J]. 航空学报, 2026 , 47(4) : 332264 -332264 . DOI: 10.7527/S1000-6893.2025.32264
In the anti-UAV task under complex backgrounds, infrared target tracking faces numerous challenges,such as multi-scale targets, occlusion, moving out of view, and background interference. To address these problems,a robust infrared UAV tracking algorithm is proposed. First, a global tracking model is designed, which adopts an effi⁃cient one-stage anchor-free framework to perform global search for handling target disappearance caused by occlusionor moving out of view, while employing a multi-level structure to track targets of varying sizes. Second, a spatiotemporalfeature fusion module based on the memory network is proposed, which leverages spatio-temporal featuresfrom video sequences to enhance target discriminability. Third, a target enhancement and interference suppressionmodule is introduced, which records and matches target and interference features between frames to generate weight⁃ing maps. These maps are applied to the score maps to improve the algorithm’s anti-interference capability. Finally, adynamic hierarchical acceleration method is proposed to improve the running efficiency by removing redundant hierar⁃chical computations. Experimental results demonstrate that the proposed algorithm achieves 92. 4% and 78. 7% preci⁃sion and 69. 4% and 56. 5% success rates on the 2nd and 3rd Anti-UAV datasets, respectively, with a running speedof 26. 9 framen per second, which significantly outperforms existing methods and achieves real-time and robust UAVtracking in complex scenarios
| [1] | 王传云, 苏阳, 王琳霖, 等. 面向反制无人机集群的多目标连续鲁棒跟踪算法[J]. 航空学报, 2024, 45(7): 329017. |
| WANG C Y, SU Y, WANG L L, et al. Multi-object continuous robust tracking algorithm for anti-UAV swarm[J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(7): 329017 (in Chinese). | |
| [2] | 王红涛, 邓淼磊, 赵文君, 等. 基于深度学习的单目标跟踪算法综述[J]. 计算机系统应用, 2022, 31(5): 40-51. |
| WANG H T, DENG M L, ZHAO W J, et al. Survey on single object tracking algorithms based on deep learning[J]. Computer Systems and Applications, 2022, 31(5): 40-51 (in Chinese). | |
| [3] | YAN B, PENG H W, FU J L, et al. Learning spatio-temporal transformer for visual tracking[C]∥2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2021: 10428-10437. |
| [4] | YE B T, CHANG H, MA B P, et al. Joint feature learning and relation modeling for tracking: A one-stream framework[C]∥Computer Vision-ECCV 2022. Cham: Springer, 2022: 341-357. |
| [5] | CHEN X, PENG H W, WANG D, et al. SeqTrack: Sequence to sequence learning for visual object tracking[C]∥2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2023: 14572-14581. |
| [6] | DAI K N, ZHANG Y H, WANG D, et al. High-performance long-term tracking with meta-updater[C]∥2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2020: 6297-6306. |
| [7] | YU Q J, MA Y C, HE J F, et al. A unified transformer-based tracker for anti-UAV tracking[C]∥2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Piscataway: IEEE Press, 2023: 3036-3046. |
| [8] | HUANG L H, ZHAO X, HUANG K Q. GlobalTrack: A simple and strong baseline for long-term tracking[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(7): 11037-11044. |
| [9] | HUANG B, LI J N, CHEN J J, et al. Anti-UAV410: A thermal infrared benchmark and customized scheme for tracking drones in the wild[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(5): 2852-2865. |
| [10] | REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. |
| [11] | FANG H Z, WANG X L, LIAO Z K, et al. A real-time anti-distractor infrared UAV tracker with channel feature refinement module[C]∥2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW). Piscataway: IEEE Press, 2021: 1240. |
| [12] | JIANG N, WANG K R, PENG X K, et al. Anti-UAV: A large-scale benchmark for vision-based UAV tracking[J]. IEEE Transactions on Multimedia, 2021, 25: 486-500. |
| [13] | LI S J, ZHAO S, CHENG B, et al. Robust visual tracking via hierarchical particle filter and ensemble deep features[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30(1): 179-191. |
| [14] | HARE S, GOLODETZ S, SAFFARI A, et al. Struck: Structured output tracking with kernels[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(10): 2096-2109. |
| [15] | HENRIQUES J F, CASEIRO R, MARTINS P, et al. Exploiting the circulant structure of tracking-by-detection with kernels[C]∥Computer Vision-ECCV 2012. Cham: Springer, 2012: 702-715. |
| [16] | LI B, WU W, WANG Q, et al. SiamRPN++: Evolution of Siamese visual tracking with very deep networks[C]∥2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2019: 4277-4286. |
| [17] | 丁奇帅, 雷帮军, 吴正平. 基于孪生网络的轻量型无人机单目标跟踪算法 [J]. 航空学报, 2025, 46(4): 330925. |
| DING Q S, LEI B J, WU Z P. A lightweight single object tracking algorithm for UAVs based on Siamese network [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(4): 330925 (in Chinese). | |
| [18] | MAYER C, DANELLJAN M, BHAT G, et al. Transforming model prediction for tracking[C]∥2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 8721-8730. |
| [19] | ZHU Z, WANG Q, LI B, et al. Distractor-aware Siamese networks for visual object tracking[C]∥Computer Vision-ECCV 2018. Cham: Springer, 2018: 103-119. |
| [20] | HUANG B, CHEN J J, XU T F, et al. SiamSTA: Spatio-temporal attention based Siamese tracker for tracking UAVs[C]∥2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW). Piscataway: IEEE Press, 2021: 1204-1212. |
| [21] | CHEN X, YAN B, ZHU J W, et al. Transformer tracking [C]∥2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2021: 8122-8131. |
| [22] | WEI X, BAI Y F, ZHENG Y C, et al. Autoregressive visual tracking[C]∥2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2023: 9697-9706. |
| [23] | XIE J X, ZHONG B N, MO Z Y, et al. Autoregressive queries for adaptive tracking with spatio-temporal transformers[C]∥2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2024: 19300-19309. |
| [24] | WU H, LI W Q, LI W Q, et al. A real-time robust approach for tracking UAVs in infrared videos[C]∥2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Piscataway: IEEE Press, 2020: 4448-4455. |
| [25] | DANELLJAN M, BHAT G, KHAN F S, et al. ATOM: Accurate tracking by overlap maximization[C]∥2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2019: 4655-4664. |
| [26] | ZHAO J J, ZHANG X H, ZHANG P Y. A unified approach for tracking UAVs in infrared[C]∥2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW). Piscataway: IEEE Press, 2021: 1213-1222. |
| [27] | DANELLJAN M, VAN GOOL L, TIMOFTE R. Probabilistic regression for visual tracking[C]∥2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2020: 7181-7190. |
| [28] | SHI X J, CHEN Z R, WANG H, et al. Convolutional LSTM network: A machine learning approach for precipitation nowcasting[C]∥Proceedings of the 29th International Conference on Neural Information Processing Systems. New York: ACM, 2015: 802-810.. |
| [29] | TIAN Z, SHEN C H, CHEN H, et al. FCOS: Fully convolutional one-stage object detection[C]∥2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2019: 9626-9635. |
| [30] | BHAT G, DANELLJAN M, VAN GOOL L, et al. Learning discriminative model prediction for tracking[C]∥2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2019: 6181-6190. |
| [31] | BHAT G, DANELLJAN M, VAN GOOL L, et al. Know your surroundings: Exploiting scene information for object tracking[C]∥Computer Vision-ECCV 2020. Cham: Springer, 2020: 205-221. |
| [32] | LIU Z, NING J, CAO Y, et al. Video swin transformer [C]∥2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 3192-3201. |
/
| 〈 |
|
〉 |