ACTA AERONAUTICAET ASTRONAUTICA SINICA >
Multi-object feature association in UAV videos: Recent progress and perspectives
Received date: 2025-03-12
Revised date: 2025-03-29
Accepted date: 2025-05-28
Online published: 2025-06-06
Supported by
National Natural Science Foundation of China(61971426)
Unmanned Aerial Vehicle (UAV) videos have become essential sources of information in both civilian and military domains, including intelligent surveillance, smart cities, situational awareness, low-altitude economy and military reconnaissance. Multi-object feature association in UAV videos aims to continuously predict target positions and maintain the identity of each target, serving as the foundation for tasks such as multi-object tracking. However, existing reviews predominantly focus on UAV object detection and tracking, lacking a systematic review for multi-object feature association in UAV videos. This paper provides the first systematic review of the research progress on multi-object feature association in UAV videos. First, existing methods are summarized and categorized based on application scenarios and data source characteristics, which covers multi-view and multi-spectral feature association approaches for the first time. Then, the representative algorithms are analyzed in depth, including their strengths, limitations, and applicable scenarios. In addition, mainstream public datasets used in this research field are summarized, including single-view, multi-view, and multi-spectral UAV video datasets. Representative datasets such as VisDrone, MDMT, and VT-Tiny-MOT are selected to evaluate and compare existing methods, with the purpose of analyzing the root causes of the performance differences among existing methods and laying the foundation for subsequent studies. Finally, the paper highlights the key challenges that remain in UAV multi-object feature association and discusses future research directions, particularly in the areas of foundation model development and multi-modal deep fusion. This review aims to provide valuable insights for advancing research in this field.
Han WU , Hao SUN , Kui LIU , Kefeng JI , Gangyao KUANG . Multi-object feature association in UAV videos: Recent progress and perspectives[J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2026 , 47(4) : 331967 -331967 . DOI: 10.7527/S1000-6893.2025.31967
| [1] | ZHOU M L, XING R, HAN D L, et al. PDT: UAV target detection dataset for Pests and Diseases tree[C]∥Com puter Vision-ECCV 2024. Cham: Springer, 2025: 56-72. |
| [2] | KAUFMANN E, BAUERSFELD L, LOQUERCIO A, et al. Champion-level drone racing using deep reinforcement learning[J]. Nature, 2023, 620(7976): 982-987. |
| [3] | 吴一全, 童康. 基于深度学习的无人机航拍图像小目 标检测研究进展[J]. 航空学报, 2025, 46(3): 030848. |
| WU Y Q, TONG K. Research advances on deep learning-based small object detection in UAV aerial images[J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(3): 030848 (in Chinese). | |
| [4] | LIN J L, LUO Z M, LIN D Z, et al. A self-adaptive feature extraction method for aerial-view geo-localization[J]. IEEE Transactions on Image Processing, 2025, 34: 126-139. |
| [5] | 王海峰. 高性能协同作战无人机的发展与思考[J]. 航 空学报, 2024, 45(17): 530304. |
| WANG H F. Development of high performance collaborative combat UAVs[J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(17): 530304 (in Chinese). | |
| [6] | CAO X Y, ZHENG Y Y, YAO Y, et al. TOPIC: A parallel association paradigm for multi-object tracking under complex motions and diverse scenes[J]. IEEE Transactions on Image Processing, 2025, 34: 743-758. |
| [7] | DING J G, LI W, YANG M, et al. SeaTrack: Rethinking observation-centric SORT for robust nearshore multiple object tracking[J]. Pattern Recognition, 2025, 159: 111091. |
| [8] | LI Z P, ZHANG D X, WU S, et al. Sampling-resilient multi-object tracking[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38(4): 3297-3305. |
| [9] | FENG M Z, SU J B. RGBT tracking: A comprehensive review[J]. Information Fusion, 2024, 110: 102492. |
| [10] | 何友, 刘瑜, 李耀文, 等. 多源信息融合发展及展望 [J]. 航空学报, 2025, 46(6): 531672. |
| HE Y, LIU Y, LI Y W, et al. Development and prospects of multisource information fusion[J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(6): 531672 (in Chinese). | |
| [11] | WU Z W, ZHENG J L, REN X X, et al. Single-model and any-modality for video object tracking[C]∥2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2024: 19156-19166. |
| [12] | 王传云, 苏阳, 王琳霖, 等. 面向反制无人机集群的多目标连续鲁棒跟踪算法[J]. 航空学报, 2024, 45(7): 329017. |
| WANG C Y, SU Y, WANG L L, et al. Multi-object continuous robust tracking algorithm for anti-UAV swarm[J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(7): 329017 (in Chinese). | |
| [13] | LUO W H, XING J L, MILAN A, et al. Multiple object tracking: A literature review[J]. Artificial Intelligence, 2021, 293: 103448. |
| [14] | PAL S K, PRAMANIK A, MAITI J, et al. Deep learning in multi-object detection and tracking: State of the art[J]. Applied Intelligence, 2021, 51(9): 6400-6429. |
| [15] | NGUYEN P, QUACH K G, DUONG C N, et al. Multi camera multi-object tracking on the move via single stage global association approach[J]. Pattern Recognition, 2024, 152: 110457. |
| [16] | LUO R, SONG Z K, MA L T, et al. DiffusionTrack: Diffusion model for multi-object tracking[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38(5): 3991-3999. |
| [17] | ZHANG Y J, LIANG Y Q, LENG J X, et al. SCGTracker: Spatio-temporal correlation and graph neural networks for multiple object tracking[J]. Pattern Recognition, 2024, 149: 110249. |
| [18] | YUAN X Y, XU T F, LIU X C, et al. Multi-step temporal modeling for UAV tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(8): 7216-7230. |
| [19] | 薛远亮, 金国栋, 谭力宁, 等. 基于多尺度融合的自 适应无人机目标跟踪算法[J]. 航空学报, 2023, 44(1): 326107. |
| XUE Y L, JIN G D, TAN L N, et al. Adaptive UAV target tracking algorithm based on multi-scale fusion[J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(1): 326107 (in Chinese). | |
| [20] | 杨永刚, 姜文韬, 高志云. 低空无人机实时目标检测 算法[J]. 航空学报, 2025, 46(16): 331619. |
| YANG Y G, JIANG W T, GAO Z Y. Real-time target detection algorithm for low altitude UAVs[J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(16): 331619 (in Chinese). | |
| [21] | YIN N Z, LIU C X, TIAN R H, et al. SDPDet: Learning scale-separated dynamic proposals for end-to-end drone-view detection[J]. IEEE Transactions on Multimedia, 2024, 26: 7812-7822. |
| [22] | HUANG B, LI J N, CHEN J J, et al. Anti-UAV410: A thermal infrared benchmark and customized scheme for tracking drones in the wild[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(5): 2852-2865. |
| [23] | YE N Y, ZENG Z Y, ZHOU J D, et al. OoD-control: Generalizing control in unseen environments[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(11): 7421-7433. |
| [24] | DAI M, ZHENG E H, FENG Z H, et al. Vision-based UAV self-positioning in low-altitude urban environments[J]. IEEE Transactions on Image Processing, 2023, 33: 493-508. |
| [25] | JIMéNEZ-BRAVO D M, LOZANO MURCIEGO á, SALES MENDES A, et al. Multi-object tracking in traffic environments: A systematic literature review[J]. Neurocomputing, 2022, 494: 43-55. |
| [26] | TANG G Y, NI J J, ZHAO Y H, et al. A survey of object detection for UAVs based on deep learning[J]. Remote Sensing, 2024, 16(1): 149. |
| [27] | 苑玉彬, 吴一全, 赵朗月, 等. 基于深度学习的无人机航拍视频多目标检测与跟踪研究进展[J]. 航空学报, 2023, 44(18): 028334. |
| YUAN Y B, WU Y Q, ZHAO L Y, et al. Research progress of UAV aerial video multi-object detection and tracking based on deep learning[J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(18): 028334 (in Chinese). | |
| [28] | FU C H, LU K H, ZHENG G Z, et al. Siamese object tracking for unmanned aerial vehicle: A review and comprehensive analysis[J]. Artificial Intelligence Review, 2023, 56(1): 1417-1477. |
| [29] | SUN N Y, ZHAO J, SHI Q, et al. Moving target tracking by unmanned aerial vehicle: A survey and taxonomy[J]. IEEE Transactions on Industrial Informatics, 2024, 20(5): 7056-7068. |
| [30] | WANG J K, WU Z X, CHEN D D, et al. OmniTracker: Unifying visual object tracking by tracking-with-detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025, 47(4): 3159-3174. |
| [31] | FUNG A, BENHABIB B, NEJAT G. LDTrack: Dynamic people tracking by service robots using diffusion models[J]. International Journal of Computer Vision, 2025, 133(6): 3392-3412. |
| [32] | GAO Y, XU H J, LI J, et al. BPMTrack: Multi-object tracking with detection box application pattern mining[J]. IEEE Transactions on Image Processing, 2024, 33: 1508-1521. |
| [33] | ZHAO X, HU S Y, WANG Y P, et al. BioDrone: A bionic drone-based single object tracking benchmark for robust vision[J]. International Journal of Computer Vision, 2024, 132(5): 1659-1684. |
| [34] | WANG Y, HUANG Z R, LAGANIèRE R, et al. A UAV to UAV tracking benchmark[J]. Knowledge-Based Systems, 2023, 261: 110197. |
| [35] | TRAN T M, BUI D C, NGUYEN T V, et al. Transformer based spatio-temporal unsupervised traffic anomaly detection in aerial videos[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(9): 8292-8309. |
| [36] | WANG J, LI X Q, ZHOU L H, et al. Adaptive receptive field enhancement network based on attention mechanism for detecting the small target in the aerial image[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 62: 5600118. |
| [37] | KOUZEGHAR M, SONG Y, MEGHJANI M, et al. Multi-target pursuit by a decentralized heterogeneous UAV swarm using deep multi-agent reinforcement learning[C]∥2023 IEEE International Conference on Robot ics and Automation (ICRA). Piscataway: IEEE Press, 2023: 3289-3295. |
| [38] | KHAN M U, DIL M, ALAM M Z, et al. SafeSpace MFNet: Precise and efficient multifeature drone detection network[J]. IEEE Transactions on Vehicular Technology, 2024, 73(3): 3106-3118. |
| [39] | BEWLEY A, GE Z Y, OTT L, et al. Simple online and realtime tracking[C]∥2016 IEEE International Conference on Image Processing (ICIP). Piscataway: IEEE Press, 2016: 3464-3468. |
| [40] | WANG P, WANG Y C, LI D Y. DroneMOT: Drone-based multi-object tracking considering detection difficulties and simultaneous moving of drones and objects[C]∥ 2024 IEEE International Conference on Robotics and Automation (ICRA). Piscataway: IEEE Press, 2024: 7397-7404. |
| [41] | WU H, NIE J, HE Z, et al.One-shot multiple object tracking in UAV video using task-specific fine-grained features[J]. Remote Sensing, 2022, 14(16): 3853. |
| [42] | LV W Y, ZHANG N, ZHANG J J, et al. One-shot multiple object tracking with robust ID preservation[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(6): 4473-4488. |
| [43] | CHU Q, OUYANG W L, LI H S, et al. Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism[C]∥2017 IEEE International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2017: 4846-4855. |
| [44] | DANG Z Y, SUN X Y, SUN B, et al. OMCTrack: Integrating occlusion perception and motion compensation for UAV multi-object tracking[J]. Drones, 2024, 8(9): 480. |
| [45] | DENG C W, WU J P, HAN Y Q, et al. Learning a robust topological relationship for online multiobject tracking in UAV scenarios[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 5628615. |
| [46] | ZENG F G, DONG B, ZHANG Y A, et al. MOTR: End-to-end multiple-object tracking with transformer[C]∥Computer Vision-ECCV 2022. Cham: Springer, 2022: 659-675. |
| [47] | ZHU P F, ZHENG J Y, DU D W, et al. Multi-drone-based single object tracking with agent sharing network[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2021, 31(10): 4058-4070. |
| [48] | XUE Y L, JIN G D, SHEN T, et al. Consistent representation mining for multi-drone single object tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(11): 10845-10859. |
| [49] | WU H, SUN H, JI K F, et al. Temporal-spatial feature interaction network for multi-drone multi-object tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2025, 35(2): 1165-1179. |
| [50] | LIU Z H, SHANG Y Y, LI T M, et al. Robust multi-drone multi-target tracking to resolve target occlusion: A benchmark[J]. IEEE Transactions on Multimedia, 2023, 25: 1462-1476. |
| [51] | GUAN D Y, CAO Y P, YANG J X, et al. Fusion of multispectral data through illumination-aware deep neural networks for pedestrian detection[J]. Information Fusion, 2019, 50: 148-157. |
| [52] | TANG L F, YUAN J T, ZHANG H, et al. PIAFusion: A progressive infrared and visible image fusion network based on illumination aware[J]. Information Fusion, 2022, 83-84: 79-92. |
| [53] | LIANG T F, JIN Y, LIU W, et al. Cross-modality transformer with modality mining for visible-infrared person re-identification[J]. IEEE Transactions on Multimedia, 2023, 25: 8432-8444. |
| [54] | ZHU Y B, WANG Q W, LI C L, et al. Visible-thermal multiple object tracking: Large-scale video dataset and progressive fusion approach[J]. Pattern Recognition, 2025, 161: 111330. |
| [55] | ZHU P F, WEN L Y, DU D W, et al. Detection and tracking meet drones challenge[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(11): 7380-7399. |
| [56] | DU D W, QI Y K, YU H Y, et al. The unmanned aerial vehicle benchmark: Object detection and tracking[C]∥Computer Vision-ECCV 2018. Cham: Springer, 2018: 375-391. |
| [57] | MANDAL M, KUMAR L K, VIPPARTHI S K. MOR UAV: A benchmark dataset and baselines for moving ob ject recognition in UAV videos[C]∥Proceedings of the 28th ACM International Conference on Multimedia. New York: ACM, 2020: 2626-2635. |
| [58] | YE H, SUNDERRAMAN R, JI S H. UAV3D: A large scale 3D perception benchmark for unmanned aerial ve hicles[C]∥Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024: 55425-55442. |
| [59] | YING X Y, XIAO C, AN W, et al. Visible-thermal tiny object detection: A benchmark dataset and baselines[J]. IEEE Transactions on Pattern Analysis and Machine In telligence, 2025, 47(7): 6088-6096. |
| [60] | YANG M Z, HAN G X, YAN B, et al. Hybrid-SORT: Weak cues matter for online multi-object tracking[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38(7): 6504-6512. |
| [61] | CAO J K, PANG J M, WENG X S, et al. Observationcentric SORT: Rethinking SORT for robust multi-object tracking[C]∥2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2023: 9686-9696. |
| [62] | LI J, YE D H, CHUNG T, et al. Multi-target detection and tracking from a single camera in Unmanned Aerial Vehicles (UAVs)[C]∥2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Piscataway: IEEE Press, 2016: 4992-4997. |
| [63] | PAN S Y, TONG Z H, ZHAO Y Y, et al. Multi-object tracking hierarchically in visual data taken from drones[C]∥2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). Piscataway: IEEE Press, 2019: 135-143. |
| [64] | DUAN K W, BAI S, XIE L X, et al. CenterNet: Keypoint triplets for object detection[C]∥2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2019: 6568-6577. |
| [65] | SHI L K, ZHANG Q R, PAN B, et al. Global-local and occlusion awareness network for object tracking in UAVs[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2023, 16: 8834-8844. |
| [66] | BARBARY M, ELAZEEM M H A. Drones tracking based on robust cubature Kalman-TBD-Multi-Bernoulli filter[J]. ISA Transactions, 2021, 114: 277-290. |
| [67] | LIU S, LI X, LU H C, et al. Multi-object tracking meets moving UAV[C]∥2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 8866-8875. |
| [68] | CHENG S, YAO M B, XIAO X M. DC-MOT: Motion deblurring and compensation for multi-object tracking in UAV videos[C]∥2023 IEEE International Conference on Robotics and Automation (ICRA). Piscataway: IEEE Press, 2023: 789-795. |
| [69] | QIU B Y, GUO Y F, XUE A K, et al. Improved Gaussian processes linear JPDA filter for multiple extended targets tracking in dense clutter[J]. Digital Signal Processing, 2024, 153: 104600. |
| [70] | XU S Y, SAVVARIS A, HE S M, et al. Real-time implementation of YOLO+JPDA for small scale UAV multiple object tracking[C]∥2018 International Conference on Unmanned Aircraft Systems (ICUAS). Piscataway: IEEE Press, 2018: 1336-1341. |
| [71] | REDMON J, FARHADI A. YOLO9000: Better, faster, stronger[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2017: 6517-6525. |
| [72] | MEMON S A, ULLAH I. Detection and tracking of the trajectories of dynamic UAVs in restricted and cluttered environment[J]. Expert Systems with Applications, 2021, 183: 115309. |
| [73] | WANG D J, LIAN B W, LIU Y Y, et al. A cooperative UAV swarm localization algorithm based on probabilistic data association for visual measurement[J]. IEEE Sensors Journal, 2022, 22(20): 19635-19644. |
| [74] | CHAI J D, HE S M, SHIN H S, et al. Domain-knowledge-aided airborne ground moving targets tracking[J]. Aerospace Science and Technology, 2024, 144: 108807. |
| [75] | MILAN A, REZATOFIGHI S H, DICK A, et al. Online multi-target tracking using recurrent neural networks[C]∥Proceedings of the AAAI Conference on Artificial Intel ligence. Reston: AIAA, 2017: 4255-4232. |
| [76] | SADEGHIAN A, ALAHI A, SAVARESE S. Tracking the untrackable: Learning to track multiple cues with long-term dependencies[C]∥2017 IEEE International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2017: 300-311. |
| [77] | XIAO F Y, LEE Y J. Video object detection with an aligned spatial-temporal memory[C]∥Computer Vision-ECCV 2018. Cham: Springer, 2018: 494-510. |
| [78] | YAO M F, WANG J Q, PENG J L, et al. FOLT: Fast multiple object tracking from UAV-captured videos based on optical flow[C]∥Proceedings of the 31st ACM International Conference on Multimedia. New York: ACM, 2023: 3375-3383. |
| [79] | YU H Y, LI G R, SU L, et al. Conditional GAN based individual and global motion fusion for multiple object tracking in UAV videos[J]. Pattern Recognition Letters, 2020, 131: 219-226. |
| [80] | LIU Z, LIN Y T, CAO Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows[C]∥2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2021: 9992-10002. |
| [81] | YAO T, LI Y H, PAN Y W, et al. HIRI-ViT: Scaling vision transformer with high resolution inputs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(9): 6431-6442. |
| [82] | HU M J, ZHU X T, WANG H T, et al. STDFormer: Spatial-temporal motion transformer for multiple object tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2023, 33(11): 6571-6594. |
| [83] | SONG I, LEE J. SFTrack: A robust scale and motion adaptive algorithm for tracking small and fast moving objects[C]∥2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Piscataway: IEEE Press, 2024: 10870-10877. |
| [84] | KAPANIA S, SAINI D, GOYAL S, et al. Multi object tracking with UAVs using deep SORT and YOLOv3 RetinaNet detection framework[C]∥Proceedings of the 1st ACM Workshop on Autonomous and Intelligent Mobile Systems. New York: ACM, 2020: 1-6. |
| [85] | WOJKE N, BEWLEY A, PAULUS D. Simple online and realtime tracking with a deep association metric[C]∥2017 IEEE International Conference on Image Processing (ICIP). Piscataway: IEEE Press, 2017: 3645-3649. |
| [86] | ZHANG Y F, SUN P Z, JIANG Y, et al. ByteTrack: Multi-object tracking by Associating every detection box[C]∥Computer Vision-ECCV 2022. Cham: Springer, 2022: 1-21. |
| [87] | ZHANG W, LI J M, XIA M, et al. OffsetNet: Towards efficient multiple object tracking, detection, and segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025, 47(2): 949-960. |
| [88] | MA J B, LIU D X, QIN S L, et al. An asymmetric feature enhancement network for multiple object tracking of unmanned aerial vehicle[J]. Remote Sensing, 2024, 16(1): 70. |
| [89] | BERGMANN P, MEINHARDT T, LEAL-TAIXE L. Tracking without bells and whistles[C]∥2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2019: 941-951. |
| [90] | WU H, HE Z W, GAO M Y. GCEVT: Learning global context embedding for vehicle tracking in unmanned aerial vehicle videos[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 20: 6000705. |
| [91] | XU L B, HUANG Y P. Rethinking joint detection and embedding for multiobject tracking in multiscenario[J]. IEEE Transactions on Industrial Informatics, 2024, 20(6): 8079-8088. |
| [92] | LI W Q, MU J T, LIU G Z. Multiple object tracking with motion and appearance cues[C]∥2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). Piscataway: IEEE Press, 2019: 161-169. |
| [93] | ZHANG Y F, WANG C Y, WANG X G, et al. FairMOT: On the fairness of detection and re-identification in multiple object tracking[J]. International Journal of Computer Vision, 2021, 129(11): 3069-3087. |
| [94] | SHEN Z Q, CAI K Q, ZHAO P, et al. An interactively motion-assisted network for multiple object tracking in complex traffic scenes[J]. IEEE Transactions on Intelligent Transportation Systems, 2024, 25(2): 1992-2004. |
| [95] | KIM C, LI F X, REHG J M. Multi-object tracking with neural gating using bilinear LSTM[C]∥Computer Vision-ECCV 2018. Cham: Springer, 2018: 208-224. |
| [96] | YU Q J, MA Y C, HE J F, et al. A unified transformerbased tracker for anti-UAV tracking[C]∥2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Piscataway: IEEE Press, 2023: 3036-3046. |
| [97] | NIE J H, WU H, HE Z W, et al. Spreading fine-grained prior knowledge for accurate tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(9): 6186-6199. |
| [98] | ZHENG L Y, TANG M, CHEN Y Y, et al. Improving multiple object tracking with single object tracking[C]∥2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2021: 2453-2462. |
| [99] | FENG W T, LI B P, OUYANG W L. Multi-object tracking with multiple cues and switcher-aware classification[C]∥2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA). Piscataway: IEEE Press, 2022: 1-10. |
| [100] | LI B, WU W, WANG Q, et al. SiamRPN++: Evolution of Siamese visual tracking with very deep networks[C]∥2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2019: 4277-4286. |
| [101] | ZHU J, YANG H, LIU N, et al. Online multi-object tracking with dual matching attention networks[C]∥Computer Vision-ECCV 2018. Cham: Springer, 2018: 379-396. |
| [102] | DANELLJAN M, BHAT G, KHAN F S, et al. ECO: Efficient convolution operators for tracking[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2017: 6931-6939. |
| [103] | GHOSH S, PATRIKAR J, MOON B, et al. AirTrack: Onboard deep learning framework for long-range air craft detection and tracking[C]∥2023 IEEE International Conference on Robotics and Automation (ICRA). Pisca taway: IEEE Press, 2023: 1277-1283. |
| [104] | SHUAI B, BERNESHAWI A, LI X Y, et al. SiamMOT: Siamese multi-object tracking[C]∥2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2021: 12367-12377. |
| [105] | YANG L, WANG H Q, SUN H J, et al. MOFTrack: Multi object formation tracking in Remote sensing videos[C]∥Pattern Recognition and Computer Vision. Singapore: Springer, 2025: 551-565. |
| [106] | REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. |
| [107] | WANG Y X, KITANI K, WENG X S. Joint object detection and multi-object tracking with graph neural networks[C]∥2021 IEEE International Conference on Robotics and Automation (ICRA). New York: ACM, 2021: 13708-13715. |
| [108] | HE X J, JIN J, CHEN D, et al. RoMATer: An end-to-end robust multiaircraft tracker with transformer[C]∥ 2024 International Joint Conference on Neural Networks (IJCNN). Piscataway: IEEE Press, 2024: 1-8. |
| [109] | XU Y H, BAN Y T, DELORME G, et al. TransCenter: Transformers with dense representations for multiple object tracking[J]. IEEE Transactions on Pattern Analy sis and Machine Intelligence, 2023, 45(6): 7820-7835. |
| [110] | MEINHARDT T, KIRILLOV A, LEAL-TAIXé L, et al. TrackFormer: Multi-object tracking with transform ers[C]∥2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 8834-8844. |
| [111] | LEI D, XU M, WANG S A. A deep multimodal network for multi-task trajectory prediction[J]. Information Fusion, 2025, 113: 102597. |
| [112] | WANG Z C, CHENG P R, CHEN M X, et al. Drones help drones: A collaborative framework for multi-drone object trajectory prediction and beyond[C]∥Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024: 64604-64628. |
| [113] | CHEN G L, ZHU P F, CAO B, et al. Cross-drone transformer network for robust single object tracking[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2023, 33(9): 4552-4563. |
| [114] | FU Z H, FU Z H, LIU Q J, et al. SparseTT: Visual tracking with sparse transformers[DB/OL]. arXiv preprint:2205.03776, 2022. |
| [115] | 伍瀚, 孙浩, 计科峰, 等. 时序信息引导跨视角特征 融合的多无人机多目标跟踪方法[J]. 电子学报, 2025, 53(3): 728-743. |
| WU H, SUN H, JI K F, et al. Temporal-guided crossview feature fusion network for multi-drone multi-object tracking[J]. Acta Electronica Sinica, 2025, 53(3): 728 743 (in Chinese). | |
| [116] | JAVED S, HASSAN A, AHMAD R, et al. State-of-the-art and future research challenges in UAV swarms[J]. IEEE Internet of Things Journal, 2024, 11(11): 19023-19045. |
| [117] | SUN J M, SHEN Z H, WANG Y A, et al. LoFTR: Detector-free local feature matching with transformers[C]∥2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2021: 8918-8927. |
| [118] | LINDENBERGER P, SARLIN P E, POLLEFEYS M. LightGlue: Local feature matching at light speed[C]∥ 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2023: 17581-17592. |
| [119] | AMOSA T I, SEBASTIAN P, IZHAR L I, et al. Multicamera multi-object tracking: A review of current trends and future advances[J]. Neurocomputing, 2023, 552: 126558. |
| [120] | QIAN Y J, YU L J, LIU W H, et al. ELECTRICITY: An efficient multi-camera vehicle tracking system for intelligent city[C]∥2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Piscataway: IEEE Press, 2020: 2511-2519. |
| [121] | HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2016: 770-778. |
| [122] | ZUO G B, ZHOU K, WANG Q, UAV-to-UAV small target detection method based on deep learning in Complex scenes[J]. IEEE Sensors Journal, 2025, 25(2): 3806-3820. |
| [123] | GUO Y D, LIU Z Y, LUO H, et al. Multi-person multicamera tracking for live stream videos based on improved motion model and matching cascade[J]. Neurocomputing, 2022, 492: 561-571. |
| [124] | 周翰祺, 方东旭, 张宁波, 等. 基于深度学习的多无 人机多目标跟踪[J]. 计算机工程, 2025, 51(4): 57-65. |
| ZHOU H Q, FANG D X, ZHANG N B, et al. Multi-UAV multi-object tracking based on deep learning[J]. Computer Engineering, 2025, 51(4): 57-65 (in Chinese). | |
| [125] | BELLAVIA F. SIFT matching by context exposed[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(2): 2445-2457. |
| [126] | POURFARD M, HOSSEINIAN T, SAEIDI R, et al. KAZE-SAR: SAR image registration using KAZE detector and modified SURF descriptor for tackling speckle noise[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 60: 5207612. |
| [127] | QIN Z, ZHOU S P, WANG L, et al. MotionTrack: Learning robust short-term and long-term motions for multi-object tracking[C]∥2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2023: 17939-17948. |
| [128] | ZHOU K L, CHEN L S, CAO X. Improving multispectral pedestrian detection by addressing modality imbalance problems[C]∥Computer Vision-ECCV 2020. Cham: Springer, 2020: 787-803. |
| [129] | XU H, MA J Y, JIANG J J, et al. U2Fusion: A unified unsupervised image fusion network[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(1): 502-518. |
| [130] | 张帆, 丛玮, 田润操, 等. 基于双层变权的异构数据融合及可靠性分析[J]. 航空学报, 2024, 45(22): 230297. |
| ZHANG F, CONG W, TIAN R C, et al. Heterogeneous data fusion and reliability analysis based on two-layer variable weights[J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(22): 230297 (in Chinese). | |
| [131] | XU H, MA J Y, YUAN J T, et al. RFNet: Unsupervised network for mutually reinforcing multi-modal image registration and fusion[C]∥2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 19647-19656. |
| [132] | HOU R C, ZHOU D M, NIE R C, et al. VIF-net: An unsupervised framework for infrared and visible image fusion[J]. IEEE Transactions on Computational Imaging, 2020, 6: 640-651. |
| [133] | SUN Y M, CAO B, ZHU P F, et al. Drone-based RGBinfrared cross-modality vehicle detection via uncertainty-aware learning[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(10): 6700-6713. |
| [134] | DU X X, ZARE A. Multiresolution multimodal sensor fusion for remote sensing data with label uncertainty[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(4): 2755-2769. |
| [135] | YE M, SHEN J B, LIN G J, et al. Deep learning for person re-identification: A survey and outlook[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(6): 2872-2893. |
| [136] | LI H C, LI C L, ZHU X P, et al. Multi-spectral vehicle re-identification: A challenge[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(7): 11345-11353. |
| [137] | CHEN S G, XU L Z, LI X Y, et al. Frequency-space enhanced and temporal adaptative RGBT object tracking[J]. Neurocomputing, 2025, 640: 130240. |
| [138] | ZHANG S Z, YANG Y F, WANG P, et al. Attend to the difference: Cross-modality person re-identification via contrastive correlation[J]. IEEE Transactions on Image Processing, 2021, 30: 8861-8872. |
| [139] | YE M, WANG Z, LAN X Y, et al. Visible thermal person re-identification via dual-constrained top-ranking[C]∥Proceedings of the 27th International Joint Conference on Artificial Intelligence. New York: ACM, 2018: 1092-1099. |
| [140] | ZHANG Y Y, ZHAO S Y, KANG Y H, et al. Modality synergy complement learning with cascaded aggregation for visible-infrared person re-identification[C]∥Computer Vision-ECCV 2022. Cham: Springer, 2022: 462-479. |
| [141] | ZHU P F, PENG T, DU D W, et al. Graph regularized flow attention network for video animal counting from drones[J]. IEEE Transactions on Image Processing, 2021, 30: 5339-5351. |
| [142] | VARGA L A, KIEFER B, MESSMER M, et al. SeaDronesSee: A maritime benchmark for detecting humans in open water[C]∥2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). Piscataway: IEEE Press, 2022: 3686-3696. |
| [143] | DOSOVITSKIY A, ROS G, CODEVILLA F, et al. CARLA: An open urban driving simulator[C]∥Proceedings of the 1st Annual Conference on Robot Learning. New York: PMLR Press, 2017: 1-16. |
| [144] | XU Q Y, WANG L G, SHENG W D, et al. Heterogeneous graph transformer for multiple tiny object tracking in RGB-T videos[J]. IEEE Transactions on Multimedia, 2024, 26: 9383-9397. |
| [145] | DENDORFER P, O?EP A, MILAN A, et al. MOTChallenge: A benchmark for single-camera multiple target tracking[J]. International Journal of Computer Vision, 2021, 129(4): 845-881. |
| [146] | LUITEN J, O?EP A, DENDORFER P, et al. HOTA: A higher order metric for evaluating multi-object tracking[J]. International Journal of Computer Vision, 2021, 129(2): 548-578. |
| [147] | ZHU B, WANG J, JIANG Z, et al. Autoassign: Differentiable label assignment for dense object detection[DB/OL]. arXiv preprint: 2007.03496, 2020. |
| [148] | GE Z, LIU S T, WANG F, et al. YOLOX: Exceeding YOLO series in 2021[DB/OL]. arXiv preprint: 2107.08430, 2021. |
| [149] | FENG C J, ZHONG Y J, GAO Y, et al. TOOD: Taskaligned one-stage object detection[C]∥2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2021: 3490-3499. |
| [150] | WANG J Q, CHEN K, XU R, et al. CARAFE: Content aware Reassembly of Features[C]∥2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2019: 3007-3016. |
| [151] | PANG J M, QIU L L, LI X, et al. Quasi-dense similarity learning for multiple object tracking[C]∥2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2021: 164-173. |
| [152] | VU T, JANG H, PHAM T X, et al. Cascade RPN: Delving into high-quality region proposal network with adaptive convolution[DB/OL]. arXiv prepint: 1909.06720, 2019. |
| [153] | HE L X, LIAO X Y, LIU W, et al. FastReID: A pytorch toolbox for general instance re-identification[C]∥Proceedings of the 31st ACM International Conference on Multimedia. New York: ACM, 2023: 9664-9667. |
| [154] | CHEN Y T, SHI J H, YE Z L, et al. Multimodal object detection via Probabilistic ensembling[C]∥Computer Vision-ECCV 2022. Cham: Springer, 2022: 139-158. |
| [155] | ZHOU X Y, KOLTUN V, KR?HENBüHL P. Tracking objects as points[C]∥Computer Vision-ECCV 2020. Cham: Springer International Publishing, 2020: 474-490. |
| [156] | WU J L, CAO J L, SONG L C, et al. Track to detect and segment: An online multi-object tracker[C]∥2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2021: 12347-12356. |
| [157] | SUN Y M, CAO B, ZHU P F, et al. Drone-based RGBinfrared cross-modality vehicle detection via uncertainty-aware learning[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(10): 6700-6713. |
| [158] | 刘延芳, 佘佳宇, 袁秋帆, 等. 无人机遥感图像实时 小目标检测方法[J]. 航空学报, 2024, 45(14): 630119. |
| LIU Y F, SHE J Y, YUAN Q F, et al. Real-time small target detection networks for UAV remote sensing[J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(14): 630119 (in Chinese). | |
| [159] | 於志文, 孙卓, 程岳, 等. 智能无人机集群协同感知 计算研究综述[J]. 航空学报, 2024, 45(20): 630912. |
| YU Z W, SUN Z, CHENG Y, et al. A review of intelligent UAV swarm collaborative perception and computation[J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(20): 630912 (in Chinese). | |
| [160] | LI S Y, CHEN S L, LI X X, et al. Accurate and automatic spatiotemporal calibration for multi-modal sensor system based on continuous-time optimization[J]. Information Fusion, 2025, 120: 103071. |
/
| 〈 |
|
〉 |