航空学报 > 2025, Vol. 46 Issue (2): 330406-330406   doi: 10.7527/S1000-6893.2024.30406

一种无人机视觉位姿估计的语义特征匹配算法

赵辰豪, 吴德伟(), 何晶, 吴倩   

  1. 空军工程大学 信息与导航学院,西安 710077
  • 收稿日期:2024-03-18 修回日期:2024-04-15 接受日期:2024-05-09 出版日期:2025-01-25 发布日期:2024-05-29
  • 通讯作者: 吴德伟 E-mail:wudewei74609@126.com
  • 基金资助:
    国家自然科学基金(62206310);陕西高校青年创新团队(2023015)

A semantic feature matching algorithm for UAV visual pose estimation

Chenhao ZHAO, Dewei WU(), Jing HE, Qian WU   

  1. School of Information and Navigation,Airforce Engineering University,Xi’an 710077,China
  • Received:2024-03-18 Revised:2024-04-15 Accepted:2024-05-09 Online:2025-01-25 Published:2024-05-29
  • Contact: Dewei WU E-mail:wudewei74609@126.com
  • Supported by:
    National Natural Science Foundation of China(62206310);Youth Innovation Team of Shaanxi Universities(2023015)

摘要:

传统视觉特征匹配算法提取的特征点易受光照、视角变化等外界因素的影响,复杂环境下算法匹配精度低、鲁棒性差,导致无人机视觉定位误差大。针对上述问题,提出了一种融合多类信息的语义特征匹配算法,该算法采用语义特征点替代传统特征点以提高特征点的鲁棒性与可重复提取率。首先,采用位置信息、语义信息以及描述信息建立特征点的初始表征;随后,利用具有注意力机制的图神经网络聚合其他语义特征点信息,建立最终表征,以优化表征结果的精确性;最后,根据语义信息与表征信息计算特征点对的综合匹配度,提高特征匹配的准确率。实验结果表明:在无人机飞行场景下,所提算法特征点的稳健性、特征匹配准确性均优于传统算法,且对光照、大视角等变化具有较强的鲁棒性,为后端无人机视觉导航位姿解算提供了更加可靠精准的语义特征匹配结果。

关键词: 路标检测, 语义特征提取, 语义特征匹配, 图神经网络, 注意力机制

Abstract:

The feature points extracted by traditional methods in UAV navigation are susceptible to external factors such as changes in lighting and viewing angles, and the low matching accuracy and poor robustness of the algorithm in complex environment lead to large errors in pose estimation. To address these issues, this paper proposes a semantic feature matching algorithm that integrates multiple types of information. The algorithm replaces traditional feature points with semantic feature points to enhance robustness and repeatability of point extraction. To optimize the accuracy of the description, we establish the initial representations of feature points via positional, semantic, and descriptive information. Subsequently, a graph neural network incorporating a fusion attention mechanism is used to aggregate the information from other semantic feature points, so as to create the final representation. Finally, the comprehensive matching degree of feature point pairs is calculated based on semantic and characterization information to enhance the accuracy of feature matching. Experimental results demonstrate that in UAV flight scenarios, the proposed algorithm exhibits better robustness and feature matching accuracy compared to traditional algorithms. Moreover, it shows strong robustness to changes in lighting and large viewing angles, providing more reliable and accurate landmark matching for back-end UAV visual pose estimation.

Key words: landmark detection, semantic feature extraction, semantic feature matching, graph neural network, attentional mechanism

中图分类号: