导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (14): 630119-630119.doi: 10.7527/S1000-6893.2024.30119

• special column • Previous Articles     Next Articles

Real⁃time small target detection networks for UAV remote sensing

Yanfang LIU1,2(), Jiayu SHE1, Qiufan YUAN3, Rui ZHOU1, Naiming QI1,2   

  1. 1.School of Astronautics,Harbin Institute of Technology,Harbin 150001,China
    2.Suzhou Research Institute of HIT,Suzhou 215104,China
    3.Shanghai Aerospace System Engineering Institute,Shanghai 201109,China
  • Received:2024-01-08 Revised:2024-01-18 Accepted:2024-03-26 Online:2024-07-25 Published:2024-04-10
  • Contact: Yanfang LIU E-mail:lyf04025121@126.com
  • Supported by:
    National Natural Science Foundation of China(52272390);Natural Science Foundation of Heilongjiang Province(YQ2022A009)

Abstract:

Benefiting from deep learning methods, the performance of object detection methods has greatly improved in recent years. However, significant challenges still exist in detecting targets from UAV remote sensing images. For example, the targets in UAV remote sensing images have small resolution and complex background, and the existing algorithms are difficult to meet the requirement for real-timeliness. To overcome these challenges, this paper proposes a Real-Time Small Target Detection (RTSTD) method based on a Multi-scalar & Multi-depth Feature Extraction (MMFE) network, which can efficiently detect small targets from UAV remote sensing images. The proposed RTSTD crops an input image into multiple small-size images, and feeds a portion of these small-size images into the lightweight MMFE network. Therefore, RTSTD has the capability to handle remote sensing images of arbitrary resolutions without losing image features. A more effective output is proposed for the MMFE network: an overlap vector that represents the position and confidence of the target in the input image. To enhance the MMFE network’s ability to distinguish targets from complex backgrounds, the positive and negative samples are redefined. To test the performance of RTSTD, seven datasets are selected and reconstructed from UAV123, DTB70 and AU-AIR, comprising a total of 8,369 UAV remote sensing images involving small target detection in the ground and sea scenarios. The experimental results demonstrate that compared to existing detection methods, the RTSTD method achieves improvements in both accuracy and speed. It achieves an F-Score of 0.90 or above, with a running speed of over 66 frames per second (FPS) using GPU acceleration and over 35 FPS using only CPU.

Key words: remote sensing image, small target detection, real-time detection, convolutional neural network, feature fusion

CLC Number: