基于改进YOLOv5的无人机实时密集小目标检测算法

doi:10.7527/S1000-6893.2022.27106

Abstract

Abstract:

UAV aerial images have more complex backgrounds and a large number of dense small targets compared with natural scene images, which impose higher requirements on the detection network. On the premise of ensuring real-time object detection, a YOLOv5-based UAV real-time dense small object detection algorithm is proposed for the problem of low accuracy of dense small object detection in UAV view. First, combining Spatial Attention Module (SAM) with Channel Attention Module (CAM), the fully connected layer after feature compression in CAM is improved to reduce the computational effort. In addition, the connection structure of CAM and SAM is changed to improve the spatial dimensional feature capture capability. In summary, a Spatial-Channel Attention Module (SCAM) is proposed to improve the model's attention to the aggregated regions of small targets in the feature map; secondly, an SCAM- based Attentional Feature Fusion module (SC-AFF) is proposed to enhance the feature fusion efficiency of small targets by adaptively assigning attentional weights according to feature maps of different scales; finally, a backbone network is introduced in the Transformer in the backbone network, and use the SC-AFF to improve the feature fusion at the original residual connections to better capture global information and rich contextual information, and improve the feature extraction capability of dense small targets in complex backgrounds. Experiments are conducted on the VisDrone2021 dataset. The effects of different network scale parameters and different input resolutions on the detection accuracy and speed of YOLOv5 are first investigated. The analysis concludes that YOLOv5s is more suitable to be used as a benchmark model for UAV real-time object detection. Under the benchmark of YOLOv5s, the improved model improves mAP50 by 6.4% and mAP75 by 5.8%, and the FPS for high-resolution images can reach 46. The mAP50 of the model trained at an input resolution of 1504×1504 can reach 54.5%, which is 11.5% better than that of YOLOv4. The accuracy is improved while the detection speed FPS remains at 46, which is more suitable for real-time UAV object detection in dense small target scenarios.

Key words: UAV, small object detection, attention mechanism, self-attention mechanism, feature fusion

CLC Number:

V279

Zhiqiang FENG, Zhijun XIE, Zhengwei BAO, Kewei CHEN. Real⁃time dense small object detection algorithm for UAV based on improved YOLOv5[J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2023, 44(7): 327106-327106.

Figures/Tables 18

Fig. 1

Table 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

Fig. 6

Fig. 7

Fig. 8

Fig. 9

Fig. 10

Table 2

Fig. 11

Table 3

Fig. 12

Table 4

Fig. 13

Fig. 14

References 27

1	江波，屈若锟，李彦冬，等. 基于深度学习的无人机航拍目标检测研究综述［J］. 航空学报， 2021， 42（4）： 524519.
	JIANG B， QU R K， LI Y D， et al. Object detection in UAV imagery based on deep learning： Review［J］. Acta Aeronautica et Astronautica Sinica， 2021， 42（4）： 524519 （in Chinese）.
2	REN S Q， HE K M， GIRSHICK R， et al. Faster R-CNN： Towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017， 39（6）： 1137-1149.
3	REDMON J， DIVVALA S， GIRSHICK R， et al. You only look once： Unified， real-time object detection［C］∥2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE Press， 2016： 779-788.
4	LIU W， ANGUELOV D， ERHAN D， et al. SSD： Single shot MultiBox detector［C］∥European Conference on Computer Vision （ECCV）. Amsterdam： Springer， 2016： 21-37.
5	REDMON J， FARHADI A. YOLO9000： Better， faster， stronger［C］∥2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE Press， 2017： 6517-6525.
6	REDMON J， FARHADI A. YOLOv3： An incremental improvement［DB/OL］. arXiv preprint： 1804.02767， 2018.
7	BOCHKOVSKIY A， WANG C Y， LIAO H Y M. YOLOv4： Optimal speed and accuracy of object detection［DB/OL］. arXiv preprint： 2004.10934， 2020.
8	李科岑，王晓强，林浩，等. 深度学习中的单阶段小目标检测方法综述［J］. 计算机科学与探索， 2022， 16（1）： 41-58.
	LI K C， WANG X Q， LIN H， et al. Survey of one-stage small object detection methods in deep learning［J］. Journal of Frontiers of Computer Science and Technology， 2022， 16（1）： 41-58 （in Chinese）.
9	WANG Q C， ZHANG H， HONG X G， et al. Small object detection based on modified FSSD and model compression［J］. 2021 IEEE 6th International Conference on Signal and Image Processing （ICSIP）， 2021： 88-92.
10	GONG Y Q， YU X H， DING Y， et al. Effective fusion factor in FPN for tiny object detection［C］∥2021 IEEE Winter Conference on Applications of Computer Vision. Piscataway： IEEE Press， 2021： 1159-1167.
11	LIN T Y， DOLLÁR P， GIRSHICK R， et al. Feature pyramid networks for object detection［C］∥2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE Press， 2017： 936-944.
12	刘芳，韩笑. 基于多尺度深度学习的自适应航拍目标检测［J］. 航空学报， 2022， 43（5）： 325270.
	LIU F， HAN X. Adaptive aerial object detection based on multi-scale deep learning［J］. Acta Aeronautica et Astronautica Sinica， 2022， 43（5）： 325270 （in Chinese）.
13	WOO S， PARK J， LEE J Y， et al. CBAM： Convolutional block attention module［C］∥Computer Vision – ECCV 2018， 2018.
14	WANG Q L， WU B G， ZHU P F， et al. ECA-net： Efficient channel attention for deep convolutional neural networks［C］∥2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. Piscataway： IEEE Press， 2020： 11531-11539.
15	LIU S， QI L， QIN H F， et al. Path aggregation network for instance segmentation［C］∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE Press， 2018： 8759-8768.
16	DAI Y M， GIESEKE F， OEHMCKE S， et al. Attentional feature fusion［C］∥2021 IEEE Winter Conference on Applications of Computer Vision. Piscataway： IEEE Press， 2021： 3559-3568.
17	ZHU L L， GENG X， LI Z， et al. Improving YOLOv5 with attention mechanism for detecting boulders from planetary images［J］. Remote Sensing， 2021， 13（18）： 3776.
18	ZHU X K， LYU S C， WANG X， et al. TPH-YOLOv5： Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios［C］∥2021 IEEE/CVF International Conference on Computer Vision Workshops （ICCVW）. Piscataway： IEEE Press， 2021： 2778-2788.
19	DOSOVITSKIY A， BEYER L， KOLESNIKOV A， et al. An image is worth 16×16 words： Transformers for image recognition at scale［C］∥ International Conference on Learning Representations （ICLR）， 2021.
20	PAN X R， GE C J， LU R， et al. On the integration of self-attention and convolution［C］∥2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. Piscataway： IEEE Press， 2022： 805-815.
21	VASWANI A， SHAZEER N， PARMAR N， et al. Attention is all You need［DB/OL］. arXiv preprint： 1706.03762， 2017.
22	LIN T Y， GOYAL P， GIRSHICK R， et al. Focal loss for dense object detection［C］∥2017 IEEE International Conference on Computer Vision. Piscataway： IEEE Press， 2017： 2999-3007.
23	ZHANG S F， WEN L Y， BIAN X， et al. Single-shot refinement neural network for object detection［C］∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE Press， 2018： 4203-4212.
24	CAI Z W， VASCONCELOS N. Cascade R-CNN： Delving into high quality object detection［C］∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE Press， 2018： 6154-6162.
25	LI Z M， PENG C， YU G， et al. Light-head R-CNN： In defense of two-stage object detector［DB/OL］. arXiv preprint： 1711. 07264， 2017.
26	LAW H， DENG J. CornerNet： Detecting objects as paired keypoints［J］. International Journal of Computer Vision， 2020， 128（3）： 642-656.
27	HE K M， ZHANG X Y， REN S Q， et al. Spatial pyramid pooling in deep convolutional networks for visual recognition［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2015， 37（9）： 1904-1916.

模型	参数量	计算量/G	mAP₅₀/%
YOLOv5s	7 037 095	15.926 8	33.04
YOLOv5s+CAM	7 099 079	16.067 1	33.21
YOLOv5s+CAM-	7 037 310	15.927 3	33.30

模型	算法	mAP₅₀/%	mAP₇₅/%	mAP_50：95/%	Pre/%	Params/M	GFLOPs	FPS_{1 504}
A	YOLOv5s	33.0	14.8	16.5	45.1	7.037 1	15.9	122
B	YOLOv5s+CBAM_neck	33.6	15.2	16.8	47.3	7.347 0	16.7	82
C	YOLOv5s+SCAM_neck	34.3	16.9	17.7	48.0	7.047 9	15.9	122
D	C+SC-AFF	37.0	18.7	19.5	49.1	7.054 4	16.0	118
E	YOLOv5s+Transformer	36.0	16.3	18.1	48.7	8.429 4	19.1	46
F	YOLOv5s+SC-Transformer	37.2	19.5	20.2	49.3	8.431 6	19.1	46
G	F+SCAM_{backbone&neck}+SC-AFF	39.4	20.6	21.4	50.9	8.457 5	19.2	46

算法	mAP₅₀	Param/M	GFLOPs	Size/MB	FPS_{1 504}
YOLOv5n⁶⁴⁰	27.9	1.777	4.2	14.8	384
YOLOv5n^{1 024}	39.3	1.777	10.8	15.0	384
YOLOv5n^{1 504}	46.7	1.777	23.3	15.3	384
YOLOv5s⁶⁴⁰	33.0	7.037	15.9	57.0	122
YOLOv5s^{1 024}	47.2	7.037	40.8	57.1	122
YOLOv5s^{1 504}	51.9	7.037	88.0	57.5	122
Proposed-s⁶⁴⁰	39.4	8.438	19.2	68.2	46
Proposed-s^{1 024}	48.6	8.438	51.2	69.2	46
Proposed-s^{1 504}	54.5	8.438	109.7	71.0	46
Proposed-m⁶⁴⁰	40.3	25.480	58.3	201.9	31
Proposed-m^{1 024}	50.5	25.480	149.2	202.9	31
Proposed-m^{1 504}	55.6	25.480	321.8	204.7	31

算法	mAP₅₀/%	mAP₇₅/%	mAP_50：95/%	FPS_{1 504}
RetinaNet^［22］	28.7	11.6	11.8
RetfineDet^［23］	28.8	14.1	14.9
Cascade-RCNN^［24］	31.9	15.6	16.1
FPN	32.2	14.9	16.5
Light-RCNN^［25］	32.8	15.1	16.5
Faster-RCNN	33.2	15.2	17.0	15
CornerNet^［26］	34.1	15.9	17.4	33
YOLOv3	41.7	22.9	24.5	31
YOLOv3-SPP	41.9	23.1	25.4	32
YOLOv4	43.0	25.2	24.9	35
YOLOv5-v6.0	44.7	26.8	26.4	35
本文算法	54.5	33.1	32.0	46

[1]	Haifeng WANG, Kunpeng LIU, Hongxin JIANG, Chenxi DU. Aerodynamic optimization method of propeller multi⁃design points and variable pitch angle strategy [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(9): 528831-528831.
[2]	Jing ZHAO, Dan SONG. Integrity monitoring method for GNSS/IMU integrated navigation system of UAV [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(7): 328943-328943.
[3]	Chuanyun WANG, Yang SU, Linlin WANG, Tian WANG, Jingjing WANG, Qian GAO. Multi-object continuous robust tracking algorithm for anti-UAV swarm [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(7): 329017-329017.
[4]	Hongyu YIN, Yu WU, Tianjiao LIANG. Cooperative path planning for patrol coverage of fixed wing UAV [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(6): 328944-328944.
[5]	Xudong LUO, Yiquan WU, Jinlin CHEN. Research progress on deep learning methods for object detection and semantic segmentation in UAV aerial images [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(6): 28822-028822.
[6]	Chuihuan KONG, Dawei WU, Zhaoguang TAN, Lijun PAN, Rubing MA, Jiangtao SI. Design of fully electric scheme for three⁃surface verification aircraft [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(6): 629618-629618.
[7]	Guangjia LI, Hongbo WANG, Kai ZHANG, Zhisheng YI. Lift enhancement and drag reduction technologies of solar powered unmanned aerial vehicles in near space: Review [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(5): 529644-529644.
[8]	Yunpeng CAI, Dapeng ZHOU, Jiangchuan DING. Intelligent collaborative control of UAV swarms with collision avoidance safety constraints [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(5): 529683-529683.
[9]	Zhu WANG, Mengtong ZHANG, Zhenpeng ZHANG, Guangtong XU. Multi-UAV cooperative path planning based on multi-index dynamic priority [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(4): 328816-328816.
[10]	Gaojie ZHENG, Xiaoming HE, Dongpo LI, Huijun TAN, Kun WANG, Zhenlong WU, Depeng WANG. Double 90° deflection inlet/volute coupling flow characteristics of tail-powered unmanned aerial vehicle [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(4): 128782-128782.
[11]	Jiqiang GAN, Xiaoping WANG. Surface defect detection of fiber placement based on virtual sample generation [J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(1): 428624-428624.
[12]	Hongzhen GUO, Mou CHEN, Yongdong DAI, Maofei WANG. Distributed adaptive event⁃triggered formation control for QUAVs [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(S2): 729917-729917.
[13]	Kunda LIU, Xueming LIU, Bo ZHU, Qingrui ZHANG. Robust safe control for multi⁃UAV formation flight through narrow corridors [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(S2): 729768-729768.
[14]	Wenkang HAO, Suyan BAO, Qifeng CHEN. Distributed control of UAVs formation based on port⁃Hamiltonian system [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(S2): 729868-729868.
[15]	Yuqi CAO, Haoran FU, Fei GAO, Ximin LYU. Trajectory tracking control algorithm for canard⁃equipped tail⁃sitting vertical takeoff and landing UAV based on MPCC [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(S2): 729950-729950.

Real⁃time dense small object detection algorithm for UAV based on improved YOLOv5

RichHTML

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 18

References 27

Related Articles 15

Recommended Articles 0

Metrics

Comments