导航

Acta Aeronautica et Astronautica Sinica ›› 2025, Vol. 46 ›› Issue (23): 631694.doi: 10.7527/S1000-6893.2025.31694

• special column • Previous Articles    

Generalized few-shot segmentation for remote sensing image based on class relation mining

Runmin CONG1,2, Haoyan SUN1,2, Yuxuan LUO3(), Hao FANG1,2   

  1. 1.School of Control Science and Engineering,Shandong University,Jinan 250061,China
    2.Key Laboratory of Machine Intelligence and System Control,Ministry of Education,Jinan 250061,China
    3.Department of Computer Science,City University of Hong Kong,Hong Kong 999077,China
  • Received:2024-12-19 Revised:2025-03-04 Accepted:2025-04-23 Online:2025-05-13 Published:2025-05-06
  • Contact: Yuxuan LUO E-mail:yuxuanluo4-c@my.cityu.edu.hk
  • Supported by:
    Taishan Scholar Project of Shandong Province(tsqn202306079);National Natural Science Foundation of China(62471278)

Abstract:

With the rapid development of remote sensing satellites and unmanned aerial vehicle technologies, the acquisition of remote sensing images has become more convenient, making precise interpretation of image content a research hotspot. However, existing remote sensing image semantic segmentation methods struggle to handle new object categories that emerge due to scene changes when only a small number of samples are available, thereby affecting the overall performance of the model. To address the potential issues of new classes and new scenes in remote sensing scenarios, this paper introduces the generalized few-shot segmentation technique, aiming to rapidly adapt to new classes using a few samples and efficiently complete the semantic segmentation task for remote sensing images. Firstly, to tackle the challenge of distinguishing objects of different categories and different sizes in remote sensing images, a multi-scale feature fusion decoder is introduced to enhance the model’s segmentation ability for targets of different scales. Then, to address the background semantic shift problem in the generalized few-shot segmentation task, a background consistency modeling training strategy is designed to ensure the accurate expression of background features by the model. Furthermore, this paper proposes an inter-class relationship mining technique, which utilizes prototypes generated from a large amount of base class data to enhance the prototypes generated from a small amount of new class data, enabling them to acquire beneficial information from the base classes and achieve more precise segmentation. Finally, comparative experiments with mainstream methods are conducted on public datasets, and the mIoU segmentation metric is improved by up to 11.48%, demonstrating the effectiveness of the proposed algorithm.

Key words: remote sensing, semantic segmentation, generalized few-shot, knowledge distillation, relationship mining

CLC Number: