无人机遥感图像实时小目标检测方法

doi:10.7527/S1000-6893.2024.30119

本期目录 | 过刊浏览 | 高级检索

| 后一篇

无人机遥感图像实时小目标检测方法

刘延芳¹,佘佳宇¹,袁秋帆²,周芮¹,齐乃明¹

1. 哈尔滨工业大学
2. 上海宇航系统工程研究所

收稿日期:2024-01-08 修回日期:2024-04-02 出版日期:2024-04-10 发布日期:2024-04-10
通讯作者: 刘延芳
基金资助:
国家自然科学基金;黑龙江省自然科学基金

Real-time Small Target Detection Networks for UAV Remote Sensing

Yan-Fang LIUJia-Yu SHE²,Qiu-Fan YUAN³,Rui ZHOU⁴, ⁴

Received:2024-01-08 Revised:2024-04-02 Online:2024-04-10 Published:2024-04-10
Contact: Yan-Fang LIU
Supported by:
National Natural Science Foundation of China;Natural Science Foundation of Heilongjiang Province of China

摘要/Abstract

摘要： 得益于深度学习方法的发展，近年来目标检测方法的性能有了很大的提升。然而，从无人机（Unmanned Aerial Vehicle，UAV）遥感图像中检测目标仍然存在很大的挑战，原因包括：UAV遥感图像中目标分辨率小、背景复杂，现有算法难以满足实时性要求。面对这些挑战，提出了一种基于多尺度多深度特征提取（Multi-scalar & Multi-depth Feature Extraction，MMFE）网络的实时小目标检测（Real-Time Small Target Detection，RTSTD）方法，能够高效的从UAV遥感图像中检测小目标。RTSTD将一幅输入图像剪裁成多个小尺寸的图像，并将一部分小尺寸图像输入到轻量化的MMFE网络中。因此，RTSTD具有处理任意分辨率的遥感图像而不丢失图像细节特征的能力。对于MMFE网络，提出了一种更有效的输出：重叠向量，它能够表示目标在输入图像中的位置和置信度。为了增强MMFE网络区分目标和复杂背景的能力，重新定义了正样本和负样本。为测试RTSTD的性能，从开源数据集UAV123、DTB70和AU-AIR中筛选重构了7个数据集，共8369张UAV遥感图像，涉及地面和海面场景下的小目标检测。结果证明，与现有的检测方法相比，RTSTD方法在准确性和速度方面都取得了改善，平均F1-Score大于0.90，GPU运行每秒大于66帧，CPU运行每秒大于35帧。

关键词: 遥感图像, 小目标检测, 实时检测, 卷积神经网络, 特征融合

Abstract: Benefiting from deep learning methods, the performance of object detection methods has greatly improved in recent years. However, significant challenges still exist in detecting targets from Unmanned Aerial Vehicle (UAV) remote sensing images. These challenges include: the small resolution and complex background of targets in UAV remote sensing imag-es, and the existing algorithms are difficult to meet the real-time requirements. Confronting these challenges, this paper proposes a Real-Time Small Target Detection (RTSTD) method based on a Multi-scalar & Multi-depth Feature Extraction (MMFE) network, which efficiently detects small targets from UAV remote sensing images. The proposed RTSTD crops an input image into multiple small-size images and feeds a portion of these small-size images into the lightweight MMFE network. Therefore, RTSTD has the capability to handle remote sensing images of arbitrary resolutions without losing image feature. A more effective output was proposed for the MMFE network: an overlap vector that represents the posi-tion and confidence of the target in the input image. To enhance the MMFE network's ability to distinguish targets from complex backgrounds, the definition of positive and negative samples is redefined. In order to test the performance of RTSTD, this paper selects and reconstructs seven datasets from UAV123, DTB70 and AU-AIR, comprising a total of 8369 UAV remote sensing images involving small target detection in ground and sea scenarios. The experimental results demonstrate that, compared to existing detection methods, the RTSTD method achieves improvements in both accuracy and speed. It achieves an F-Score of 0.90 or above, with a running speed of over 66 frames per second (FPS) using GPU acceleration and over 35 FPS using only CPU.

Key words: Remote Sensing Images, Small Target Detection, Real-time Detection, Convolutional Neural Network, Feature Fusion

中图分类号:

P407.8

刘延芳佘佳宇袁秋帆周芮齐乃明. 无人机遥感图像实时小目标检测方法[J]. 航空学报, doi: 10.7527/S1000-6893.2024.30119.

Yan-Fang LIU Jia-Yu SHE Qiu-Fan YUAN Rui ZHOU. Real-time Small Target Detection Networks for UAV Remote Sensing[J]. Acta Aeronautica et Astronautica Sinica, doi: 10.7527/S1000-6893.2024.30119.

参考文献

[1]XU C, XU M, YIN C.Optimized multi-UAV cooperative path planning under the complex confrontation environment[J]. Computer Communications, 2020, 162: 196-203.
[2] PIERUCCI L, BOCCHI L.Improvements of radar clutter classification in air traffic control environment[C]. 2007 IEEE International Symposium on Signal Processing and Information Tech nology. IEEE, 2007: 721-724.
[3] YU X, GONG Y, JIANG N, et al.Scale match for tiny person detection[C]. Proceedings of the IEEE/CVF winter conference on applications of computer vision. 2020: 1257-1265.
[4]W.CASBEER D,BKINGSTON D,W.BEARD R,et al. Cooperative forest fire surveillance using a team of small unmanned air vehicles[J].International Journal of Systems Science, 2006, 37(6):351-360.
[5]MADEMLIS I, MYGDALIS V, NIKOLAIDIS N, et al.Highlevel multiple-UAV cinematography tools for covering outdoor events[J].IEEE Transactions on Broadcasting, 2019, 65(3):627-635.
[6] KELLENBERGER B, MARCOS D, TUIA D.Detecting mammals in UAV images: Best practices to address a substantially imbalanced dataset with deep learning[J]. Remote Sensing of Envi ronment, 2018, 216: 139-153.
[7]ZHANG L, PENG Z.Infrared small target detection based on partial sum of the tensor nuclear norm[J].Remote Sensing, 2019, 11(4):382.
[8] LI B, XIAO C, WANG L, et al.Dense nested attention network for infrared small target detection[J]. IEEE Transactions on Image Processing, 2022, 32: 1745-1758.
[9]NASRABADI N M.Deeptarget: An automatic target recognition using deep convolutional neural networks[J].IEEE Transactions on Aerospace and Electronic Systems, 2019, 55(6):2687-2697.
[10] SHARIF RAZAVIAN A, AZIZPOUR H, SULLIVAN J, et al.CNN features off-the-shelf: An astounding baseline for recognition[C]. 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops. 2014: 512-519.
[11] GIRSHICK R, DONAHUE J, DARRELL T, et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]. Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.
[12] REN S, HE K, GIRSHICK R, et al.Faster R-CNN: Towards realtime object detection with region proposal networks[J]. Advances in neural information processing systems, 2015, 28.
[13] HE K, GKIOXARI G, DOLLáR P, et al.Mask r-cnn[C]. Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.
[14] DING J, XUE N, LONG Y, et al.Learning roi transformer for oriented object detection in aerial images[C]. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2019.
[15] XIE X, CHENG G, WANG J, et al.Oriented r-cnn for object detection[C]. Proceedings of the IEEE/CVF international conference on computer vision. 2021: 3520-3529.
[16]XU Y, FU M, WANG Q, et al.Gliding vertex on the horizontal bounding box for multi-oriented object detection[J].IEEE transactions on pattern analysis and machine intelligence, 2020, 43(4):1452-1459.
[17] REDMON J, DIVVALA S, GIRSHICK R, et al.You only look once: Unified, real-time object detection[C]. Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
[18] REDMON J, FARHADI A.YOLOv3: An incremental improvement[A]. 2018.
[19] BOCHKOVSKIY A, WANG C Y, LIAO H Y M.YOLOv4: Optimal speed and accuracy of object detection[A]. 2020.
[20] TAN L, LV X, LIAN X, et al.YOLOv4_Drone: UAV image target detection based on an improved yolov4 algorithm[J]. Computers & Electrical Engineering, 2021, 93: 107261.
[21] ZHU X, LYU S, WANG X, et al.TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]. Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 2778-2788.
[22] CAO S, WANG T, LI T, et al.Uav small target detection algorithm based on an improved yolov5s model[J]. Journal of Visual Communication and Image Representation, 2023, 97: 103936.
[23] REIS D, KUPEC J, HONG J, et al.Real-time flying object detection with yolov8[A]. 2023.
[24] LIU W, ANGUELOV D, ERHAN D, et al.SSD: Single shot multibox detector[C]. European conference on computer vision. Springer, 2016: 21-37.
[25] LIN T Y, DOLLáR P, GIRSHICK R, et al.Feature pyramid networks for object detection[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017: 936-944.
[26] LIN T Y, GOYAL P, GIRSHICK R, et al.Focal loss for dense object detection[C]. Proceedings of the IEEE international conference on computer vision. 2017: 2980-2988.
[27] TIAN Z, SHEN C, CHEN H, et al.Fcos: Fully convolutional one-stage object detection[C]. Proceedings of the IEEE/CVF international conference on computer vision. 2019: 9627-9636.
[28] YANG F, FAN H, CHU P, et al.Clustered object detection in aerial images[C]. Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 8311-8320.
[29] ZHANG P, ZHANG Y, HUANG Y, et al.SAR fast target imaging in sparse field based on AlexNet[C]. 2021 IEEE Radar Conference (RadarConf21). IEEE, 2021: 1-6.
[30] LIU Z, LI D, GE S S, et al.Small traffic sign detection from large image[J]. Applied Intelligence, 2020, 50: 1-13.
[31] LIU Z, DU J, TIAN F, et al.Mr-cnn: A multi-scale region-based convolutional neural network for small traffic sign recognition[J]. IEEE Access, 2019, 7: 57120-57128.
[32]DUAN K, DU D, QI H, et al.Detecting small objects using a channel-aware deconvolutional network[J].IEEE Transactions on Circuits and Systems for Video Technology, 2019, 30(6):1639.
[33]LENG J, LIU Y, DU D, et al.Robust obstacle detection and recognition for driver assistance systems[J].IEEE transactions on intelligent transportation systems, 2019, 21(4):1560-1571.
[34] LI C, YANG T, ZHU S, et al.Density map guided object detection in aerial images[C]. proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. 2020: 190- 191.
[35] XIA G S, BAI X, DING J, et al.DOTA: A large-scale dataset for object detection in aerial images[C]. Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 3974- 3983.
[36] LAM D, KUZMA R, MCGEE K, et al.xview: Objects in context in overhead imagery[A]. 2018.
[37] MUELLER M, SMITH N, GHANEM B.A benchmark and simulator for uav tracking[C]. European conference on computer vision. Springer, 2016: 445-461.

E-mail：hkxb@buaa.edu.cn

关于我们

期刊社服务

专业学科

封面文章

友情链接

主管单位：中国科学技术协会主办单位：中国航空学会北京航空航天大学

无人机遥感图像实时小目标检测方法

Real-time Small Target Detection Networks for UAV Remote Sensing

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	王子玲, 熊振宇, 杨璐铖, 杨蕊宁, 黄林周. AIS和光学遥感图像引导的星载SAR舰船目标识别网络[J]. 航空学报, 2024, 45(2): 328672-328672.
[2]	陈立群, 邹旭, 张磊, 朱颖盼, 王港, 陈金勇. 基于国产商用器件的星载智能目标检测技术[J]. 航空学报, 2023, 44(S2): 728860-728860.
[3]	奉志强, 谢志军, 包正伟, 陈科伟. 基于改进YOLOv5的无人机实时密集小目标检测算法[J]. 航空学报, 2023, 44(7): 327106-327106.
[4]	何磊, 钱炜祺, 董康生, 易贤, 柴聪聪. 基于卷积神经网络的结冰翼型气动特性建模[J]. 航空学报, 2023, 44(5): 126434-126434.
[5]	冒国韬, 邓天民, 于楠晶. 基于多尺度分割注意力的无人机航拍图像目标检测算法[J]. 航空学报, 2023, 44(5): 326738-326738.
[6]	熊振宇, 崔亚奇, 董凯, 李孟洋, 熊伟. 基于属性引导的多源遥感舰船目标可解释融合关联网络[J]. 航空学报, 2023, 44(22): 627476-627476.
[7]	李晓航, 周建江. 基于自适应记忆长度的多尺度模态融合网络[J]. 航空学报, 2023, 44(22): 628977-628977.
[8]	李子豪, 王正平, 贺云涛. 基于自适应协同注意力机制的航拍密集小目标检测算法[J]. 航空学报, 2023, 44(13): 327944-327944.
[9]	王强, 吴乐天, 王勇, 王欢, 杨万扣. 基于关键点检测的红外弱小目标检测[J]. 航空学报, 2023, 44(10): 328173-328173.
[10]	薛远亮, 金国栋, 谭力宁, 许剑锟. 基于多尺度融合的自适应无人机目标跟踪算法[J]. 航空学报, 2023, 44(1): 326107-326107.
[11]	王子玲, 熊振宇, 顾祥岐. 可见光与SAR多源遥感图像关联学习算法[J]. 航空学报, 2022, 43(S1): 727239-727239.
[12]	韩淞宇, 邵海东, 姜洪开, 张笑阳. 基于提升卷积神经网络的航空发动机高速轴承智能故障诊断[J]. 航空学报, 2022, 43(9): 625479-625479.
[13]	刘芳, 孙亚楠. 基于自适应融合网络的无人机目标跟踪算法[J]. 航空学报, 2022, 43(7): 325522-325522.
[14]	刘芳, 韩笑. 基于多尺度深度学习的自适应航拍目标检测[J]. 航空学报, 2022, 43(5): 325270-325270.
[15]	刘传凯, 李东升, 谢剑锋, 雷俊雄, 袁春强, 何锡明. 多特征融合的月面采样遥操作视觉定位方法[J]. 航空学报, 2022, 43(12): 326296-326296.