导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (14): 629490-629490.doi: 10.7527/S1000-6893.2024.29490

• special column • Previous Articles     Next Articles

Infrared small target detection based on multi⁃layer multi⁃direction transformer

Xiao WANG, Zhenbao LIU()   

  1. School of Civil Aviation,Northwestern Polytechnical University,Xi’an 710072,China
  • Received:2023-08-30 Revised:2023-10-30 Accepted:2024-01-08 Online:2024-07-25 Published:2024-01-24
  • Contact: Zhenbao LIU E-mail:liuzhenbao@nwpu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(52072309);Key Research and Development Program of Shaanxi(2019ZDLGY14-02-01);Shenzhen Fundamental Research Program(JCYJ20190806152203506);Aeronautical Science Foundation of China(ASFC-2018ZC53026)

Abstract:

The convolution neural network based infrared small target detection suffers from the problems of limited receptive field of convolution kernel, information loss caused by down sampling operation, and limited power of the convolution neural network in relative information extraction. To solve these problems, a multi-layer multi-direction Transformer based neural network is proposed. Firstly, the Transformer block is adopted as the basic operator since it has a larger receptive field and more powerful in extracting relative information. The proposed network is a U-shaped network, and fuses local and global information with multi-layers structure. Meanwhile, to enhance the network’s ability to detect the infrared small target, a dual-direction attention operator which calculates the attention information along spatial and channel directions is designed for the decoder network. Finally, an additional network is added to the backbone network to calculate the number of the detected infrared small targets. This additional network reduces the number of falsely detected targets by comparing the calculated number with ground truth. The proposed method is tested on several datasets and the evaluation metrics in comparison with state-of-the-art methods. The proposed method achieves an improvement by 35% at most, which proves the effectiveness of the proposed method.

Key words: infrared small target detection, Transformer, multi-layers fusion, dual-direction attention operator, number supervision

CLC Number: