导航

Acta Aeronautica et Astronautica Sinica ›› 2025, Vol. 46 ›› Issue (23): 631952.doi: 10.7527/S1000-6893.2025.31952

• special column • Previous Articles    

Wavelet time-frequency localization-based model compression for UAV object detection

Wei HUANG, Jiahao PAN, Chu HE()   

  1. School of Electronic Information,Wuhan University,Wuhan 430072,China
  • Received:2025-03-10 Revised:2025-03-28 Accepted:2025-04-25 Online:2025-05-23 Published:2025-05-08
  • Contact: Chu HE E-mail:chuhe@whu.edu.cn
  • Supported by:
    National Key Research and Development Program of China(2016YFC0803000);National Natural Science Foundation of China(41371342)

Abstract:

Current deep learning-based remote sensing object detection models rely on the powerful computing and storage resources of ground servers, and now capable of efficiently processing massive remote sensing data. However, in deployment scenarios for mobile edge devices such as Unmanned Aerial Vehicles (UAVs), the limited computational resources, storage capacity, and energy constraints pose significant challenges for effectively deploying large-scale models. Model compression techniques, such as lightweight design and model quantization, have become critical forfacilitating the practical application of remote sensing object detection algorithms. This paper proposes a wavelet time-frequency localization-based model compression framework, using the single-stage object detection network as the baseline: By integrating depthwise separable convolution with the spatial-frequency localization characteristics of discrete wavelet transforms, a Wavelet Depthwise Separable Convolution (W-DSConv) module is constructed to achieve lightweight model reconstruction while expanding its receptive field; Leveraging wavelet frequency domain decomposition, a Wavelet Frequency-Division Quantization (W-FDQ) method for quantization-aware training is proposed, enabling independent quantization of features across different frequency bands to further compress the lightweight model. Experiments are conducted using YOLO-series models on the VisDrone2021 UAV remote sensing dataset.Results demonstrate that: The W-DSConv module reduces model parameters by 45.5% and computational load by 37.4%, while limiting detection accuracy fluctuations to within 2.2%; When applying 6-bit and 4-bit quantization via W-FDQ, the quantized models retain 95.8% and 92.9% of the floating-point model’s detection performance, respectively. This research provides novel technical insights for lightweight deployment of remote sensing object detection models on mobile platforms.

Key words: object detection, Unmanned Aerial Vehicles (UAVs), model compression, lightweight, model quantization, wavelet transforms

CLC Number: