适应着舰引导大距离跨度的高精度单目视觉位姿测量

doi:10.7527/S1000-6893.2025.31568

电子电气工程与控制

本期目录 | 过刊浏览 | 高级检索

前一篇 |

适应着舰引导大距离跨度的高精度单目视觉位姿测量

陈霖¹^,², 顾曦文³, 陈知颖¹^,², 张倬¹^,², 孙晓亮¹^,²()

^1.国防科技大学空天科学学院，长沙 410073
^2.国防科技大学图像测量和视觉导航湖南省重点实验室，长沙 410073
^3.中国人民解放军 91351部队，兴城 125106

收稿日期:2024-11-25 修回日期:2024-12-17 接受日期:2025-01-20 出版日期:2025-02-24 发布日期:2025-02-21
通讯作者: 孙晓亮 E-mail:alexander_sxl@nudt.edu.cn
基金资助:
国家自然科学基金(12272404)

High-precision monocular vision pose measurement for large distance span in carrier landing guidance

Lin CHEN¹^,², Xiwen GU³, Zhiying CHEN¹^,², Zhuo ZHANG¹^,², Xiaoliang SUN¹^,²()

^1.College of Aerospace Science and Engineering，National University of Defense Technology，Changsha 410073，China
^2.Hunan Province Key Laboratory of Image Measurement and Vision Navigation，National University of Defense Technology，Changsha 410073，China
^3.91351 Troops，Xingcheng 125106，China

Received:2024-11-25 Revised:2024-12-17 Accepted:2025-01-20 Online:2025-02-24 Published:2025-02-21
Contact: Xiaoliang SUN E-mail:alexander_sxl@nudt.edu.cn
Supported by:
National Natural Science Foundation of China(12272404)

摘要/Abstract

摘要：

自主着舰引导距离跨度大，使得机载单目视觉引导获取的图像序列中舰船目标尺度变化大，已有位姿测量方法难以实现覆盖大距离跨度的高精度单目视觉位姿测量。基于已有基于稀疏关键点集合的单目视觉位姿测量方法，从提升关键点检测精度出发，分析目标尺寸、网络输入尺寸对关键点检测精度的影响规律。在兼顾精度和效率的前提下，提出一种新的基于多部件的单目视觉位姿测量方法，采用稀疏关键点集合对部件进行简化表示，在舰船目标整体部件的粗略位姿估计的基础上，引入路径聚合特征金字塔网络和分层编码模块，实现局部部件关键点高精度检测，进一步综合各部件高精度关键点检测结果，通过求解Perspective-n-Points（PnP）问题，实现覆盖着舰引导大距离跨度范围内的鲁棒、高精度位姿测量。仿真实验及缩比实物实验的结果表明，文中方法实现了着舰引导大距离跨度范围内鲁棒、高精度单目位姿测量，性能优于已有方法，在嵌入式平台上的平均单帧推理时间约为40 ms。

关键词: 单目视觉, 着舰引导, 位姿测量, 深度学习, 关键点检测

Abstract:

The autonomous landing guidance involves a large distance span， resulting in significant scale variations of the ship target in the image sequences obtained through monocular vision guidance. Existing pose measurement methods struggle to achieve high-precision monocular vision pose measurement across such a wide distance range. For current monocular vision pose measurement methods based on sparse keypoint sets， this paper focuses on improving the accuracy of keypoint detection， and analyzes the impact of target size and network input size on keypoint detection accuracy. Furthermore， this paper proposes a novel monocular vision pose measurement method based on multiple components， balancing both accuracy and efficiency. By using sparse keypoint sets to represent components in a simplified manner， and building on a coarse pose estimation of the overall ship target components， this method introduces a path aggregation feature pyramid network and a hierarchical encoding module to achieve high-precision detection of local component keypoints. Subsequently， by integrating the high-precision keypoint detection results of all components and solving the Perspective-n-Points （PnP） problem， the method achieves robust and high-precision pose measurement across the large distance span required for landing guidance. Simulation experiments and scaled physical experiments demonstrate that the proposed method achieves robust and high-precision monocular pose measurement across the large distance span for landing guidance， outperforming existing methods， with an average single-frame inference time of approximately 40 ms on embedded platforms.

Key words: monocular, landing guidance, pose measurement, deep learning, keypoint detection

中图分类号:

V249.32

陈霖, 顾曦文, 陈知颖, 张倬, 孙晓亮. 适应着舰引导大距离跨度的高精度单目视觉位姿测量[J]. 航空学报, 2025, 46(15): 331568.

Lin CHEN, Xiwen GU, Zhiying CHEN, Zhuo ZHANG, Xiaoliang SUN. High-precision monocular vision pose measurement for large distance span in carrier landing guidance[J]. Acta Aeronautica et Astronautica Sinica, 2025, 46(15): 331568.

图/表 23

图 1

图 2

图 3

图 4

表1

图 5

图 6

图 7

表2

图 8

图 9

图 10

表 3

图 11

图 12

图 13

表 4

图 14

图 15

图 16

表 5

缩比实物实验位姿估计误差

方法	$e (x) / 像素$	$e (R) / (°)$	$e (t) / %$
PP-TinyPose	1.21	0.43	0.71
ViTPose	0.59	0.29	0.40
RTMPose	0.72	0.30	0.47
本文方法	0.44	0.27	0.34

表 5

表 6

表 7

参考文献 27

[1]	WICKRAMASURIYA M， LEE T， SNYDER M. Deep monocular relative 6D pose estimation for ship-based autonomous UAV‍［C］‍∥AIAA SCITECH 2024 Forum. Reston： AIAA， 2024.
[2]	胡小兵，周大鹏，曲晓雷. 国外舰载机全自动着舰技术综述［J］. 飞机设计， 2021， 41（2）： 32-36.
	HU X B， ZHOU D P， QU X L. Review on full automatic carrier landing technique of foreign shipboard aircraft‍［J］. Aircraft Design， 2021， 41（2）： 32-36 （in Chinese）.
[3]	甄子洋，王新华，江驹，等. 舰载机自动着舰引导与控制研究进展［J］. 航空学报， 2017， 38（2）： 020435.
	ZHEN Z Y， WANG X H， JIANG J， et al. Research progress in guidance and control of automatic carrier landing of carrier-based aircraft［J］. Acta Aeronautica et Astronautica Sinica， 2017， 38（2）： 020435 （in Chinese）.
[4]	张志冰，甄子洋，江驹，等. 舰载机自动着舰引导与控制综述［J］. 南京航空航天大学学报， 2018， 50（6）： 734-744.
	ZHANG Z B， ZHEN Z Y， JIANG J， et al. Review on development in guidance and control of automatic carrier landing of carrier-based aircraft［J］. Journal of Nanjing University of Aeronautics & Astronautics， 2018， 50（6）： 734-744 （in Chinese）.
[5]	魏振忠. 舰载机着舰位姿视觉测量技术概述［J］. 测控技术， 2020， 39（8）： 2-6.
	WEI Z Z. Overview of visual measurement technology for landing position and attitude of carrier-based aircraft［J］. Measurement & Control Technology， 2020， 39（8）： 2-6 （in Chinese）.
[6]	GUI Y， GUO P Y， ZHANG H L， et al. Airborne vision-based navigation method for UAV accuracy landing using infrared lamps［J］. Journal of Intelligent & Robotic Systems， 2013， 72（2）： 197-218.
[7]	POLVARA R， SHARMA S， WAN J， et al. Towards autonomous landing on a moving vessel through fiducial markers［C］∥2017 European Conference on Mobile Robots （ECMR）. Piscataway： IEEE Press， 2017： 1-6.
[8]	XU G L， QI X P， ZENG Q H， et al. Use of Land’s cooperative object to estimate UAV’s pose for autonomous landing［J］. Chinese Journal of Aeronautics， 2013， 26（6）： 1498-1505.
[9]	SHI F H， ZHANG X Y， LIU Y C. A new method of camera pose estimation using 2D-3D corner correspondence［J］. Pattern Recognition Letters， 2004， 25（10）： 1155-1163.
[10]	ZHOU L P， KOPPEL D， KAESS M. A complete， accurate and efficient solution for the perspective-N-line problem［J］. IEEE Robotics and Automation Letters， 2021， 6（2）： 699-706.
[11]	毕道明，黄辉，范静，等. 视觉着舰中非合作结构化特征匹配算法［J］. 南京航空航天大学学报， 2021， 53（3）： 395-401.
	BI D M， HUANG H， FAN J， et al. Non-cooperative structural feature matching algorithm in visual landing［J］. Journal of Nanjing University of Aeronautics & Astronautics， 2021， 53（3）： 395-401 （in Chinese）.
[12]	王秋富，石治国，张倬，等. 舰载机着舰引导中鲁棒单目视觉相对位姿测量［J］. 航空学报， 2024， 45 （23）： 330309.
	WANG Q F， SHI Z G， ZHANG Z， et al. Robust monocular relative pose measurement for carrier-based aircraft landing guidance［J］. Acta Aeronautica et Astronautica Sinica， 2024， 45 （23）： 330309 （in Chinese）.
[13]	SUN X L， ZHANG Z， LIU J， et al. Visual pose measurement for automatic landing on an aircraft carrier［C］∥2022 IEEE International Conference on Unmanned Systems （ICUS）. Piscataway： IEEE Press， 2022： 891-895.
[14]	ZHANG Z， WANG Q F， BI D M， et al. MC-LRF based pose measurement system for shipborne aircraft automatic landing‍［J］. Chinese Journal of Aeronautics， 2023， 36（8）： 298-312.
[15]	WICKRAMASURIYA M， LEE T， SNYDER M. Deep transformer network for monocular pose estimation of ship-based UAV［DB/OL］. arXiv preprint： 2406.09260， 2024.
[16]	YU G H， CHANG Q Y， LV W Y， et al. PP-PicoDet： A better real-time object detector on mobile devices［DB/OL］. arXiv preprint： 2111.00902， 2021.
[17]	JIANG T， LU P， ZHANG L， et al. RTMPose： real-time multi-person pose estimation based on MMPose［DB/OL］. arXiv preprint： 2303.07399， 2023.
[18]	FISCHLER M A， BOLLES R C. Random sample consensus［J］. Communications of the ACM， 1981， 24（6）： 381-395.
[19]	LEPETIT V， MORENO-NOGUER F， FUA P. EPnP： An accurate O（n） solution to the PnP problem［J］. International Journal of Computer Vision， 2009， 81（2）： 155-166.
[20]	LIU S， QI L， QIN H F， et al. Path aggregation network for instance segmentation［C］∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE Press， 2018： 8759-8768.
[21]	RAZAVI A， VAN DEN OORD A， VINYALS O. Generating diverse high-fidelity images with VQ-VAE-2［DB/OL］. arXiv preprint： 1906.00446， 2019.
[22]	LI Y J， YANG S， LIU P D， et al. SimCC： A simple coordinate classification perspective for human pose estimation‍［M］‍∥Computer Vision-ECCV 2022. Cham： Springer Nature Switzerland， 2022： 89-106.
[23]	GitHub. MMPose： OpenMMlab pose estimation toolbox and benchmark［DB/OL］. .
[24]	PaddlePaddle.‍ ‍Object detection and instance segmentation toolkit based on PaddlePaddle［DB/OL］. （2024-12-05）［2025-01-08］. .
[25]	XU Y F， ZHANG J， ZHANG Q M， et al. ViTPose： Simple vision transformer baselines for human pose estimation［J］. Advances in Neural Information Processing Systems， 2022， 35： 38571-38584.
[26]	GUO L C， CHEN L， WANG Q F， et al. Joint optimization of the 3D model and 6D pose for monocular pose estimation［J］. Drones， 2024， 8（11）： 626.
[27]	DENNINGER M， SUNDERMEYER M， WINKELBAUER D， et al. BlenderProc‍［DB/OL］. arXiv preprint： 1911.01911， 2019.

E-mail：hkxb@buaa.edu.cn

关于我们

期刊社服务

专业学科

封面文章

友情链接

主管单位：中国科学技术协会主办单位：中国航空学会北京航空航天大学

方法	输入尺寸	帧率/（帧·s^-1）	浮点运算数/G
RTMPose	256×256	95.39	5.508
	384×384	83.25	12.385
	512×512	70.05	22.012

参数	俯仰/（°）	偏航/（°）	滚转/（°）	X/m	Y/m	Z/m
周期/s	6	5	3	5	4	4
初始值	11	0	0	0	-2 600	218.7
扰动幅值	1	3	5	0~5	0~3	0~0.5

方法	平均误差/像素
方法	>2.0 km	1.0~2.0 km	0.5~1.0 km	0.2~0.5 km	<0.2 km
PP-TinyPose	0.55	0.80	1.56	2.83	4.69
ViTPose	0.12	0.19	0.49	0.61	1.30
RTMPose	0.16	0.24	0.43	0.55	1.21
本文方法	0.07	0.12	0.16	0.29	0.54

方法	平均误差/像素
方法	>2.0 km	1.0~2.0 km	0.5~1.0 km	0.2~0.5 km	<0.2 km
粗略阶段结果	0.16	0.25	0.42	0.55	1.10
RTMPose	0.10	0.14	0.28	0.35	0.59
+PAFPN	0.07	0.12	0.18	0.33	0.57
+HEM	0.08	0.13	0.17	0.30	0.54
+PAFPN+HEM	0.07	0.10	0.16	0.29	0.54

参数	数值
CPU	8核Carmel ARMv8.2 64处理器
GPU	512核Volta架构
内存	32 GB
储存	32 GB
峰值算力	32 TOPS

适应着舰引导大距离跨度的高精度单目视觉位姿测量

High-precision monocular vision pose measurement for large distance span in carrier landing guidance

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

图/表 23

参考文献 27

相关文章 15

编辑推荐

Metrics

本文评价

[1]	徐建宇, 周莉, 王占学, 是介, 史毫. 基于快速逐线计算模型的高超声速羽流红外辐射计算方法[J]. 航空学报, 2025, 46(8): 630778-630778.
[2]	孟令捷, 李红光, 李新军. 基于地貌类别信息指导的SAR图像仿真方法[J]. 航空学报, 2025, 46(7): 331003-331003.
[3]	赵志浩, 杨照华, 吴云, 余远金. 弱光环境下基于深度学习的单光子计数成像去噪方法[J]. 航空学报, 2025, 46(3): 630531-630531.
[4]	吴一全, 童康. 基于深度学习的无人机航拍图像小目标检测研究进展[J]. 航空学报, 2025, 46(3): 30848-030848.
[5]	李立功, 张超, 苏婧婷. 面向多源着舰引导信息的分布式融合技术[J]. 航空学报, 2025, 46(13): 531461-531461.
[6]	王秋富, 毕道明, 张倬, 孙晓亮, 于起峰. 基于视惯融合的机舰相对位姿和甲板晃动测量[J]. 航空学报, 2025, 46(13): 531268-531268.
[7]	李启军, 崔凯凯. 全自动着舰引导系统多余度设计与分析[J]. 航空学报, 2025, 46(13): 531743-531743.
[8]	张志冰, 甄子洋. 固定翼舰载有人/无人机着舰引导控制研究进展[J]. 航空学报, 2025, 46(13): 532336-532336.
[9]	孙彬, 游航, 李文博, 刘祥瑞, 马佳义. 双光载荷图像融合及其在低空遥感中的应用[J]. 航空学报, 2025, 46(11): 531343-531343.
[10]	孟凡腾, 秦勇, 崔京, 吴云鹏, 张紫城, 魏少伟. 铁路外部环境无人机图像未知风险检测方法[J]. 航空学报, 2025, 46(11): 531262-531262.
[11]	陈唯实, 牛红闯, 王鑫, 万健, 卢贤锋, 张洁, 王青斌. 机场净空区飞鸟与无人机多源探测技术综述[J]. 航空学报, 2025, 46(10): 31251-031251.
[12]	林杰, 唐志共, 钱炜祺, 王岳青, 张鹏, 徐炜遐, 刘杰. 飞行器生成式模型气动设计研究进展与展望[J]. 航空学报, 2025, 46(10): 631679-631679.
[13]	王永海, 李昊歌, 李嘉鑫, 段毅, 田川, 郭灵犀, 吴旭生. 基于深度学习的飞行器外形快速生成[J]. 航空学报, 2025, 46(10): 631614-631614.
[14]	柳家齐, 陈荣钱, 楼锦华, 韩旭, 吴昊, 尤延铖. 基于深度学习的高速直升机旋翼翼型气动优化设计[J]. 航空学报, 2024, 45(9): 529828-529828.
[15]	罗旭东, 吴一全, 陈金林. 无人机航拍影像目标检测与语义分割的深度学习方法研究进展[J]. 航空学报, 2024, 45(6): 28822-028822.

方法	目标检测耗时/ms	关键点检测耗时/ms	位姿估计耗时/ms	总耗时/ms
RTMPose	9.2	20.8	1.8	31.8
本文方法	9.2	28.9	1.9	40.0