ACTA AERONAUTICAET ASTRONAUTICA SINICA ›› 2023, Vol. 44 ›› Issue (7): 327106.doi: 10.7527/S1000-6893.2022.27106
• Electronics and Electrical Engineering and Control • Previous Articles Next Articles
Zhiqiang FENG1, Zhijun XIE1(
), Zhengwei BAO2, Kewei CHEN3
Received:2022-03-04
Revised:2022-03-22
Accepted:2022-04-28
Online:2023-04-15
Published:2022-05-11
Contact:
Zhijun XIE
E-mail:xiezhijun@nbu.edu.cn
Supported by:CLC Number:
Zhiqiang FENG, Zhijun XIE, Zhengwei BAO, Kewei CHEN. Real⁃time dense small object detection algorithm for UAV based on improved YOLOv5[J]. ACTA AERONAUTICAET ASTRONAUTICA SINICA, 2023, 44(7): 327106.
Table 2
Ablation experiment
| 模型 | 算法 | mAP50/% | mAP75/% | mAP50:95/% | Pre/% | Params/M | GFLOPs | FPS1 504 |
|---|---|---|---|---|---|---|---|---|
| A | YOLOv5s | 33.0 | 14.8 | 16.5 | 45.1 | 7.037 1 | 15.9 | 122 |
| B | YOLOv5s+CBAMneck | 33.6 | 15.2 | 16.8 | 47.3 | 7.347 0 | 16.7 | 82 |
| C | YOLOv5s+SCAMneck | 34.3 | 16.9 | 17.7 | 48.0 | 7.047 9 | 15.9 | 122 |
| D | C+SC-AFF | 37.0 | 18.7 | 19.5 | 49.1 | 7.054 4 | 16.0 | 118 |
| E | YOLOv5s+Transformer | 36.0 | 16.3 | 18.1 | 48.7 | 8.429 4 | 19.1 | 46 |
| F | YOLOv5s+SC-Transformer | 37.2 | 19.5 | 20.2 | 49.3 | 8.431 6 | 19.1 | 46 |
| G | F+SCAMbackbone&neck+SC-AFF | 39.4 | 20.6 | 21.4 | 50.9 | 8.457 5 | 19.2 | 46 |
Table 3
Effect of different input image resolutions and network size during training
| 算法 | mAP50 | Param/M | GFLOPs | Size/MB | FPS1 504 |
|---|---|---|---|---|---|
| YOLOv5n640 | 27.9 | 1.777 | 4.2 | 14.8 | 384 |
| YOLOv5n1 024 | 39.3 | 1.777 | 10.8 | 15.0 | 384 |
| YOLOv5n1 504 | 46.7 | 1.777 | 23.3 | 15.3 | 384 |
| YOLOv5s640 | 33.0 | 7.037 | 15.9 | 57.0 | 122 |
| YOLOv5s1 024 | 47.2 | 7.037 | 40.8 | 57.1 | 122 |
| YOLOv5s1 504 | 51.9 | 7.037 | 88.0 | 57.5 | 122 |
| Proposed-s640 | 39.4 | 8.438 | 19.2 | 68.2 | 46 |
| Proposed-s1 024 | 48.6 | 8.438 | 51.2 | 69.2 | 46 |
| Proposed-s1 504 | 54.5 | 8.438 | 109.7 | 71.0 | 46 |
| Proposed-m640 | 40.3 | 25.480 | 58.3 | 201.9 | 31 |
| Proposed-m1 024 | 50.5 | 25.480 | 149.2 | 202.9 | 31 |
| Proposed-m1 504 | 55.6 | 25.480 | 321.8 | 204.7 | 31 |
Table 4
Comparison experiments of different object detection algorithms
| 算法 | mAP50/% | mAP75/% | mAP50:95/% | FPS1 504 |
|---|---|---|---|---|
| RetinaNet[ | 28.7 | 11.6 | 11.8 | |
| RetfineDet[ | 28.8 | 14.1 | 14.9 | |
| Cascade-RCNN[ | 31.9 | 15.6 | 16.1 | |
| FPN | 32.2 | 14.9 | 16.5 | |
| Light-RCNN[ | 32.8 | 15.1 | 16.5 | |
| Faster-RCNN | 33.2 | 15.2 | 17.0 | 15 |
| CornerNet[ | 34.1 | 15.9 | 17.4 | 33 |
| YOLOv3 | 41.7 | 22.9 | 24.5 | 31 |
| YOLOv3-SPP | 41.9 | 23.1 | 25.4 | 32 |
| YOLOv4 | 43.0 | 25.2 | 24.9 | 35 |
| YOLOv5-v6.0 | 44.7 | 26.8 | 26.4 | 35 |
| 本文算法 | 54.5 | 33.1 | 32.0 | 46 |
| 1 | 江波, 屈若锟, 李彦冬, 等. 基于深度学习的无人机航拍目标检测研究综述[J]. 航空学报, 2021, 42(4): 524519. |
| JIANG B, QU R K, LI Y D, et al. Object detection in UAV imagery based on deep learning: Review[J]. Acta Aeronautica et Astronautica Sinica, 2021, 42(4): 524519 (in Chinese). | |
| 2 | REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. |
| 3 | REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2016: 779-788. |
| 4 | LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot MultiBox detector[C]∥European Conference on Computer Vision (ECCV). Amsterdam: Springer, 2016: 21-37. |
| 5 | REDMON J, FARHADI A. YOLO9000: Better, faster, stronger[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 6517-6525. |
| 6 | REDMON J, FARHADI A. YOLOv3: An incremental improvement[DB/OL]. arXiv preprint: 1804.02767, 2018. |
| 7 | BOCHKOVSKIY A, WANG C Y, LIAO H Y M. YOLOv4: Optimal speed and accuracy of object detection[DB/OL]. arXiv preprint: 2004.10934, 2020. |
| 8 | 李科岑, 王晓强, 林浩, 等. 深度学习中的单阶段小目标检测方法综述[J]. 计算机科学与探索, 2022, 16(1): 41-58. |
| LI K C, WANG X Q, LIN H, et al. Survey of one-stage small object detection methods in deep learning[J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(1): 41-58 (in Chinese). | |
| 9 | WANG Q C, ZHANG H, HONG X G, et al. Small object detection based on modified FSSD and model compression[J]. 2021 IEEE 6th International Conference on Signal and Image Processing (ICSIP), 2021: 88-92. |
| 10 | GONG Y Q, YU X H, DING Y, et al. Effective fusion factor in FPN for tiny object detection[C]∥2021 IEEE Winter Conference on Applications of Computer Vision. Piscataway: IEEE Press, 2021: 1159-1167. |
| 11 | LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 936-944. |
| 12 | 刘芳, 韩笑. 基于多尺度深度学习的自适应航拍目标检测[J]. 航空学报, 2022, 43(5): 325270. |
| LIU F, HAN X. Adaptive aerial object detection based on multi-scale deep learning[J]. Acta Aeronautica et Astronautica Sinica, 2022, 43(5): 325270 (in Chinese). | |
| 13 | WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional block attention module[C]∥Computer Vision – ECCV 2018, 2018. |
| 14 | WANG Q L, WU B G, ZHU P F, et al. ECA-net: Efficient channel attention for deep convolutional neural networks[C]∥2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2020: 11531-11539. |
| 15 | LIU S, QI L, QIN H F, et al. Path aggregation network for instance segmentation[C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 8759-8768. |
| 16 | DAI Y M, GIESEKE F, OEHMCKE S, et al. Attentional feature fusion[C]∥2021 IEEE Winter Conference on Applications of Computer Vision. Piscataway: IEEE Press, 2021: 3559-3568. |
| 17 | ZHU L L, GENG X, LI Z, et al. Improving YOLOv5 with attention mechanism for detecting boulders from planetary images[J]. Remote Sensing, 2021, 13(18): 3776. |
| 18 | ZHU X K, LYU S C, WANG X, et al. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]∥2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW). Piscataway: IEEE Press, 2021: 2778-2788. |
| 19 | DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: Transformers for image recognition at scale[C]∥ International Conference on Learning Representations (ICLR), 2021. |
| 20 | PAN X R, GE C J, LU R, et al. On the integration of self-attention and convolution[C]∥2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 805-815. |
| 21 | VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all You need[DB/OL]. arXiv preprint: 1706.03762, 2017. |
| 22 | LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection[C]∥2017 IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2017: 2999-3007. |
| 23 | ZHANG S F, WEN L Y, BIAN X, et al. Single-shot refinement neural network for object detection[C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 4203-4212. |
| 24 | CAI Z W, VASCONCELOS N. Cascade R-CNN: Delving into high quality object detection[C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 6154-6162. |
| 25 | LI Z M, PENG C, YU G, et al. Light-head R-CNN: In defense of two-stage object detector[DB/OL]. arXiv preprint: 1711. 07264, 2017. |
| 26 | LAW H, DENG J. CornerNet: Detecting objects as paired keypoints[J]. International Journal of Computer Vision, 2020, 128(3): 642-656. |
| 27 | HE K M, ZHANG X Y, REN S Q, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916. |
| [1] | Xiangyu YU, Wen LI, Jie YAN, Shizhe LIANG. Simulation research on thermal management system of fuel cell for liquid hydrogen powered UAV [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(9): 630964-630964. |
| [2] | Pengqian YANG, Yutong CHEN, Junhui LIU, Jiehao YANG, Jiayuan SHAN, Shijun SUN. Aerodynamic and operational characteristics analysis for tandem wing cargo UAV at high angle of attack [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(9): 131056-131056. |
| [3] | Rongzu LI, Li LIU, Dun YANG. Optimal design of hydrogen-powered UAV based on multi-source domain fusion surrogate model [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(9): 630979-630979. |
| [4] | Kaifang WAN, Zhilin WU, Yunhui WU, Haozhi QIANG, Yibo WU, Bo LI. Cooperative location of multiple UAVs with deep reinforcement learning in GPS-denied environment [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(8): 331024-331024. |
| [5] | Lingfeng JIANG, Xinkai LI, Hai ZHANG, Hanwei LI, Hongli ZHANG. Mapless navigation of UAVs in dynamic environments based on an improved TD3 algorithm [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(8): 331035-331035. |
| [6] | Lingjie MENG, Hongguang LI, Xinjun LI. SAR image simulation method guided by geomorphic category information [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(7): 331003-331003. |
| [7] | Jinwu XIANG, Kai MA, Zi KAN, Daochun LI, Kexin ZHENG, Hanxuan CHEN. Review of key technologies for hydrogen powered unmanned aerial vehicles [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(5): 531603-531603. |
| [8] | Qishuai DING, Bangjun LEI, Zhengping WU. A lightweight single object tracking algorithm for UAVs based on Siamese network [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(4): 330925-330925. |
| [9] | Nuo MA, Shechun WEI, Junhui MENG, Qingyang LIU, Yusheng LEI. Flow field characteristics and dynamics of internal supply chamber separating from UAV considering effect of deceleration parachutes [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(3): 130755-130755. |
| [10] | Yiquan WU, Kang TONG. Research advances on deep learning-based small object detection in UAV aerial images [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(3): 30848-030848. |
| [11] | Yi ZHENG, Xianghong CHENG, Xingbang TANG, Yi CAO. Oriented detection algorithm for insulator and their defects from aerial images based on improved ReDet [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(18): 331825-331825. |
| [12] | Qiushi CHEN, Jinglong GAO, Meng WANG, Wenkun BIAN, Haojun HAN. Overview of anti-interference technology of unmanned aerial vehicle satellite navigation system [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(17): 331797-331797. |
| [13] | Guixian QU, Dongyang LIU, Xu YANG, Tian QIU, Chuankai LIU, Shuiting DING, Shuzheng YUAN, Kan GUO. Remaining useful life prediction method based on temporal information enhancement of sensors [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(17): 231634-231634. |
| [14] | Xiaowei JIANG, Yiquan WU. Research progress of UAV aerial image mosaic methods [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(17): 331799-331799. |
| [15] | Jiang ZHAO, Minghao PI, Bailing TIAN, Pei CHI, Yingxun WANG. Self-organized consensus decision-making method for swarm UAV tracking multiple targets [J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(16): 331635-331635. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||
Address: No.238, Baiyan Buiding, Beisihuan Zhonglu Road, Haidian District, Beijing, China
Postal code : 100083
E-mail:hkxb@buaa.edu.cn
Total visits: 6658907 Today visits: 1341All copyright © editorial office of Chinese Journal of Aeronautics
All copyright © editorial office of Chinese Journal of Aeronautics
Total visits: 6658907 Today visits: 1341

