电子电气工程与控制

基于多阶段蒸馏的无人机图像时敏目标增量检测算法

  • 成桢灏 ,
  • 杨小冈 ,
  • 卢瑞涛 ,
  • 张涛 ,
  • 王思宇
展开
  • 火箭军工程大学 导弹工程学院,西安 710025
.E-mail: doctoryxg@163.com

收稿日期: 2025-03-10

  修回日期: 2025-05-23

  录用日期: 2025-06-23

  网络出版日期: 2025-07-03

基金资助

陕西省重点研发计划(2024CY2-GJHX-42);国家自然科学基金(62276274);陕西省三秦英才特殊支持计划(2024-SQ-001)

Multi-stage distillation for incremental detection of time-sensitive targets in UAV images

  • Zhenhao CHENG ,
  • Xiaogang YANG ,
  • Ruitao LU ,
  • Tao ZHANG ,
  • Siyu WANG
Expand
  • College of Missile Engineering,Rocket Force University of Engineering,Xi’an 710025,China
E-mail: doctoryxg@163.com

Received date: 2025-03-10

  Revised date: 2025-05-23

  Accepted date: 2025-06-23

  Online published: 2025-07-03

Supported by

Key Research and Development Program of Shaanxi Province(2024CY2-GJHX-42);National Natural Science Foundation of China(62276274);Shaanxi SanqinElite Special Support Program(2024-SQ-001)

摘要

针对当前无人机图像时敏目标类增量检测面临的灾难性遗忘、过拟合以及难以适配密集检测器特性导致检测精度受限等问题,提出了一种基于多阶段蒸馏的时敏目标增量检测算法,算法主要包含基于连续Wasserstein距离的类间蒸馏(WICD)模块、基于原型引导的类内一致性蒸馏(PGICD)模块以及交叉预测自适应蒸馏(CAD)模块。WICD模块从特征图和语义查询向量中捕捉类间特征差异,利用高斯分布与连续Wasserstein距离,增强类间区分性。PGICD模块通过最小化教师网络和学生网络中实例的高层语义查询和低层特征图的原型差异,实现类内特征有效传递,增强类内一致性。CAD模块通过动态调整分类和回归分支的蒸馏权重,优化交叉预测蒸馏过程,缓解了增量学习中灾难性遗忘问题,提升了模型在复杂场景下的检测精度。在SIMD和MAR20数据集上的实验结果显示,所提方法在各类型的单步和多步增量场景下均表现优异,平均精度(AP)相比传统方法有显著提升。在SIMD数据集8类+7类的增量场景下,AP高达70.8%,与上限绝对差距为1.7%,相对差距为2.3%。在MAR20数据集10类+10类的增量场景下,AP高达60.2%,与上限的绝对差距为2.3%,相对差距为3.6%。此外,通过消融实验验证了各模块有效性,所得结果表明各模块有效地提升了无人机图像时敏目标增量检测性能。

本文引用格式

成桢灏 , 杨小冈 , 卢瑞涛 , 张涛 , 王思宇 . 基于多阶段蒸馏的无人机图像时敏目标增量检测算法[J]. 航空学报, 2025 , 46(24) : 331959 -331959 . DOI: 10.7527/S1000-6893.2025.31959

Abstract

To address the problems of catastrophic forgetting, overfitting, and difficulty in adapting dense detector characteristics leading to limited detection accuracy faced by the current time-sensitive target class incremental object detection of UAV images, this paper proposes a time-sensitive target incremental detection algorithm based on multi-stage distillation. The proposed method includes three key modules: the continuous Wasserstein distance-based Inter-Class Distillation (WICD) module, the Prototype-Guided Intra-class Consistency Distillation (PGICD) module, and the Cross-prediction Adaptive Distillation (CAD) module. The WICD module captures the inter-class feature differences from the feature graphs and semantic query vectors using Gaussian distributions with continuous Wasserstein distances to enhance the inter-class discriminability. The PGICD module enhances the inter-class discriminative properties by minimizing the high-level semantic query of the instances in both the teacher’s network and student’s network, and the low-level feature graphs with the prototype differences to achieve effective intra-class feature transfer and enhance intra-class consistency. The CAD module optimizes the cross-prediction distillation process by dynamically adjusting the distillation weights of the classification and regression branches, mitigating the problem of catastrophic forgetting in incremental learning, and improving the model’s detection accuracy in complex scenarios. Experimental results on SIMD and MAR20 datasets show that the proposed method performs well in all types of one-step and multi-step incremental scenarios, and the Average Precision (AP) is significantly improved compared with traditional methods. The AP is as high as 70.8% in the incremental scenario of SIMD dataset with 8+7 classes, with an absolute gap of 1.7% and a relative gap of 2.3% from the upper limit. The results also show that the AP is as high as 60.2% in the incremental scenario of MAR20 dataset with 10+10 classes, with an absolute gap of 2.3% and a relative gap of 3.6% from the upper limit. In addition, the effectiveness of each module is verified by ablation experiments, which effectively improves the incremental detection performance of time-sensitive targets in UAV images.

参考文献

[1] ZHANG X R, ZHANG T Y, WANG G C, et al. Remote sensing object detection meets deep learning: A metareview of challenges and advances[J]. IEEE Geoscience and Remote Sensing Magazine202311(4): 8-44.
[2] 赵其昌, 吴一全, 苑玉彬. 光学遥感图像舰船目标检测与识别方法研究进展[J]. 航空学报202445(8): 029025.
  ZHAO Q C, WU Y Q, YUAN Y B. Progress of ship detection and recognition methods in optical remote sensing images[J]. Acta Aeronautica et Astronautica Sinica202445(8): 029025 (in Chinese).
[3] 刘延芳, 佘佳宇, 袁秋帆, 等. 无人机遥感图像实时小目标检测方法[J]. 航空学报202445(14): 630119.
  LIU Y F, SHE J Y, YUAN Q F, et al. Real-time small target detection networks for UAV remote sensing[J]. Acta Aeronautica et Astronautica Sinica202445(14): 630119 (in Chinese).
[4] 肖欣林, 施伟超, 郑向涛, 等. 基于多模型协同的舰船目标检测[J]. 航空学报202445(14): 630241.
  XIAO X L, SHI W C, ZHENG X T, et al. Multiple models collaboration for ship detection[J]. Acta Aeronautica et Astronautica Sinica202445(14): 630241 (in Chinese).
[5] LI Z, WANG Y C, ZHANG N, et al. Deep learning-based object detection techniques for remote sensing images: A survey[J]. Remote Sensing202214(10): 2385.
[6] ZHOU D W, WANG Q W, QI Z H, et al. Class-incremental learning: A survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence202446(12): 9851-9873.
[7] MASANA M, LIU X L, TWARDOWSKI B, et al. Class-incremental learning: Survey and performance evaluation on image classification[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence202345(5): 5513-5533.
[8] MA B T, CONG Y, REN Y. IOSL: Incremental open set learning[J]. IEEE Transactions on Circuits and Systems for Video Technology202434(4): 2235-2248.
[9] KANG M X, ZHANG J P, ZHANG J M, et al. Alleviating catastrophic forgetting of incremental object detection via within-class and between-class knowledge distillation[C]∥ 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2023: 18848-18858.
[10] 方维维, 陈爱方, 孟娜, 等. 基于知识蒸馏的目标检测模型增量深度学习方法[J]. 工程科学与技术202254(6): 59-66.
  FANG W W, CHEN A F, MENG N, et al. Incremental deep learning method for object detection model based on knowledge distillation[J]. Advanced Engineering Sciences202254(6): 59-66 (in Chinese).
[11] KIM J, CHO H, KIM J, et al. SDDGR: Stable diffusion-based deep generative replay for class incremental object detection[C]∥ 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2024: 28772-28781.
[12] KIM J, KU Y, KIM J, et al. VLM-PL: Advanced pseudo labeling approach for class incremental object detection via vision-language model[C]∥ 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Piscataway: IEEE Press, 2024: 4170-4181.
[13] PENG C, ZHAO K, LOVELL B C. Faster ILOD: Incremental learning for object detectors based on faster RCNN[J]. Pattern Recognition Letters2020140: 109-115.
[14] FENG T, WANG M, YUAN H J. Overcoming catastrophic forgetting in incremental object detection via elastic response distillation[C]∥ 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 9417-9426.
[15] LIU Y Y, CONG Y, GOSWAMI D, et al. Augmented box replay: Overcoming foreground shift for incremental object detection[C]∥ 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2023: 11333-11343.
[16] LIU Y Y, SCHIELE B, VEDALDI A, et al. Continual detection transformer for incremental object detection[C]∥ 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2023: 23799-23808.
[17] LI W Z, ZHOU J W, LI X, et al. InfRS: Incremental few-shot object detection in remote sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing202462: 5644314.
[18] 张涛, 杨小冈, 卢孝强, 等. Dense RFB和LSTM遥感图像舰船目标检测[J]. 遥感学报202226(9): 1859-1871.
  ZHANG T, YANG X G, LU X Q, et al. Ship detection in remote sensing image based on dense RFB and LSTM[J]. National Remote Sensing Bulletin202226(9): 1859-1871 (in Chinese).
[19] JOSEPH K J, RAJASEGARAN J, KHAN S, et al. Incremental object detection via meta-learning[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence202244(12): 9209-9216.
[20] BAI L, SONG H, FENG T, et al. Revisiting class-incremental object detection: An efficient approach via intrinsic characteristics alignment and task decoupling[J]. Expert Systems with Applications2024257: 125057.
[21] MO Q J, GAO Y P, FU S H, et al. Bridge past and future: Overcoming information asymmetry in incremental object detection[C]∥ Computer Vision-ECCV 2024. Cham: Springer Nature Switzerland, 2024: 463-480.
[22] LU X N, DIAO W H, LI J X, et al. Few-shot incremental object detection in aerial imagery via dual-frequency prompt[J]. IEEE Transactions on Geoscience and Remote Sensing202462: 5624017.
[23] YU Q Z, ZHU K, WANG W, et al. Incremental object detection with image-level labels[J]. IEEE Transactions on Artificial Intelligence20245(5): 2331-2341.
[24] DONG N, ZHANG Y Q, DING M L, et al. Incremental-DETR: Incremental few-shot object detection via self-supervised learning[J].Proceedings of the AAAI Conference on Artificial Intelligence202337(1): 543-551.
[25] LI J Y, CAO Z J, GAN Q, et al. Class-incremental SAR obeject detection via adaptive distributed response distillation[C]∥ IGARSS 2024-2024 IEEE International Geoscience and Remote Sensing Symposium. Piscataway: IEEE Press, 2024: 9922-9925.
[26] DONG N, ZHANG Y, DING M, et al. Bridging non-co-occurrence with unlabeled in-the-wild data for incremental object detection[C]∥ Proceedings of the 35th International Conference on Neural Information Processing Systems, New York:ACM, 2021:30492-30503.
[27] LI D, TASCI S, GHOSH S, et al. RILOD: Near real-time incremental learning for object detection at the edge[C]∥ Proceedings of the 4th ACM/IEEE Symposium on Edge Computing. New York:ACM, 2019: 113-126.
[28] CERMELLI F, GERACI A, FONTANEL D, et al. Modeling missing annotations for incremental learning in object detection[C]∥ 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Piscataway: IEEE Press, 2022: 3699-3709.
[29] WANG Y J, CHEN L Q, ZHAO T M, et al. High-dimension prototype is a better incremental object detection learner[C]∥ The Thirteenth International Conference on Learning Representations, 2025: 1-17.
[30] WANG J B, CHEN Y M, ZHENG Z H, et al. CrossKD: Cross-head knowledge distillation for object detection[C]∥ 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2024: 16520-16530.
[31] HAROON M, SHAHZAD M, FRAZ M M. Multisized object detection using spaceborne optical imagery[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing202013: 3032-3046.
[32] 禹文奇,程塨,王美君,等.MAR20:遥感图像军用飞机目标识别数据集[J].遥感学报202327(12):2688-2696.
  YU W Q, CHENG G, WANG M J, et al. MAR20: A benchmark for military aircraft recognition in remote sensing images[J]. National Remote Sensing Bulletin202327(12): 2688-2696 (in Chinese).
[33] LIU L Y, KUANG Z H, CHEN Y M, et al. IncDet: In defense of elastic weight consolidation for incremental object detection[J]. IEEE Transactions on Neural Networks and Learning Systems202132(6): 2306-2319.
[34] MENEZES A G, DE MOURA G, ALVES C, et al. Continual object detection: A review of definitions, strategies, and challenges[J]. Neural Networks2023161: 476-493.
文章导航

/