导航

Acta Aeronautica et Astronautica Sinica ›› 2025, Vol. 46 ›› Issue (21): 532355.doi: 10.7527/S1000-6893.2025.32355

• Special Issue: 60th Anniversary of Aircraft Strength Research Institute of China • Previous Articles    

Transformer-based intelligent tracking method of aviation structure surface cracks

Jiaxin LI1,2(), Shuaishuai LYU1,2, Yezi WANG1,2, Yu YANG1,2, Ziyue LI1   

  1. 1.Aircraft Strength Research Institute of China,Xi’an 710065,China
    2.National Key Laboratory of Strength and Structural Integrity,Xi’an 710065,China
  • Received:2025-06-03 Revised:2025-07-03 Accepted:2025-08-11 Online:2025-08-29 Published:2025-08-28
  • Contact: Jiaxin LI E-mail:1056900948@qq.com
  • Supported by:
    National Level Project

Abstract:

Semantic segmentation models based on deep convolutional networks have shown good performance in structural damage detection. However, when it comes to aircraft structural damage detection, cracks usually occupy a small proportion of the image, and the multi-layer convolution and pooling operations can lead to the loss of crack information, thereby seriously reducing the segmentation accuracy. Consequently, this research is conducted on Transformer-based semantic segmentation models, and de-signs a Transformer-based Model for Intelligent Crack Tracking (TICT) for aeronautical structural surface damage detection, aiming to achieve precise segmentation and intelligent tracking of cracks. To start with, an adaptive dynamic patch partitioning mechanism is employed to divide the image into patches of different sizes with varying degrees of overlap. Next, these patches are fed into a Transformer-based encoder to extract multi-scale features containing both the contextual and local details of the crack image. Then, a lightweight multi-layer perceptron along with attention modules is utilized as a decoder to generate a crack mask image. After that, morphological operations are performed on the mask image to correct the connected regions of cracks and map them back to the original image, thus obtaining the exact crack areas. By repeating the aforementioned procedure on the images collected in real time during fatigue tests, automated and continuous tracking of cracks can be realized. The TICT model is trained and tested on datasets of fatigue test images of metal components and entire aircraft. It achieves an Mean Intersection Over Union (mIoU) of 78.31% on the test sets of crack image of various metal components and full-scale aircraft structures, which demonstrates that the TICT model can accurately segment surface cracks in aviation structures with various structural configurations, complex backgrounds, and tiny features, exhibiting good generalization and robustness.

Key words: crack tracking, Transformer, computer vision, semantic segmentation, structural health monitoring

CLC Number: