基于自适应混合动态规划的欠驱动航天器姿态容错控制

刘切; 卢俊颖; 谢非; 李文博

doi:10.7527/S1000-6893.2025.32531

航空学报 >

2026 , Vol. 47 >Issue 9: 532531 - 532531

DOI: https://doi.org/10.7527/S1000-6893.2025.32531

先进飞行器安全控制技术专刊

基于自适应混合动态规划的欠驱动航天器姿态容错控制

刘切 ,
卢俊颖 ,
谢非 ,
李文博

展开

^1.重庆大学自动化学院，重庆 400044
^2.北京控制工程研究所，北京 100190

E-mail： liwenbo_bice@163.com

收稿日期: 2025-07-08

修回日期: 2025-08-14

录用日期: 2025-10-27

网络出版日期: 2025-10-30

基金资助

国家自然科学基金(62373068);空间智能控制技术全国重点实验室开放基金(HTKJ2024KL502023);重庆市自然科学基金(CSTB2025NSCQ-LZX0013)

收起

Adaptive hybrid dynamic programming for fault-tolerant attitude control of underactuated spacecraft

Qie LIU ,
Junying LU ,
Fei XIE ,
Wenbo LI

Expand

^1.College of Automation，Chongqing University，Chongqing 400044，China
^2.Beijing Institute of Control Engineering，Beijing 100190，China

E-mail： liwenbo_bice@163.com

Received date: 2025-07-08

Revised date: 2025-08-14

Accepted date: 2025-10-27

Online published: 2025-10-30

Supported by

National Natural Science Foundation of China(62373068);Open Fund of the National Key Laboratory of Space Intelligent Control Technology(HTKJ2024KL502023);Chongqing Natural Science Foundation(CSTB2025NSCQ-LZX0013)

Fold

摘要

针对欠驱动航天器在执行机构故障下的姿态控制问题，同时考虑跟踪精度与控制能耗2个目标，提出一种基于自适应混合动态规划的最优容错控制方法。该方法首先在无执行器故障条件下将控制器设计转化成最优控制问题，并利用自适应混合动态规划方法构建近似最优控制策略，实现控制能耗和跟踪精度之间的平衡。为实现执行器故障下的容错控制，建立了基于故障估计与控制补偿的容错方案，针对执行器乘性故障和加性故障的不同特性，基于自适应故障观测器构建在线估计技术，然后通过补偿控制策略，使执行器故障下的实际输出接近无故障控制效果，达到容错控制目的。对上述控制系统，通过设计李雅普诺函数，证明了系统在执行器故障下的闭环稳定性。仿真对比实验表明，该方法相较于传统容错控制方法，实现了更高的跟踪精度和故障估计准确度。

关键词： 欠驱动航天器; 执行机构故障; 姿态控制; 自适应混合动态规划; 容错控制; 自适应故障观测器

本文引用格式

刘切 , 卢俊颖 , 谢非 , 李文博 . 基于自适应混合动态规划的欠驱动航天器姿态容错控制[J]. 航空学报, 2026 , 47(9) : 532531 -532531 . DOI: 10.7527/S1000-6893.2025.32531

Abstract

To address the attitude control problem of underactuated spacecraft subject to actuator faults， while simultaneously considering both tracking accuracy and control energy consumption， this paper proposes an optimal fault-tolerant control scheme based on Adaptive Hybrid Dynamic Programming （AHDP）. First， under fault-free conditions， the control design is formulated as an optimal control problem， and AHDP is employed to derive an approximate optimal control law that balances energy expenditure against tracking performance. To enable fault tolerance， an online compensation framework is developed using adaptive fault observers to separately estimate multiplicative and additive fault parameters. Subsequently， these estimates are incorporated into a compensation control law so that the faulty actuator output closely approximates nominal behavior. A Lyapunov-based stability analysis proves closed-loop robustness of the tracking error in the presence of actuator faults. Comparative simulations demonstrate that， compared with conventional fault-tolerant methods， the proposed AHDP-based controller achieves improved attitude tracking accuracy and higher fault estimation fidelity.

Key words： underactuated spacecraft; actuator faults; attitude control; Adaptive Hybrid Dynamic Programming (AHDP); fault-tolerant control; adaptive fault observer

参考文献

[1]	黄煦，王健，龚秋武. 椭圆轨道欠驱动编队重构最优脉冲控制［J］. 宇航学报， 2021， 42（5）： 591-602.
	HUANG X， WANG J， GONG Q W. Optimal impulsive control of underactuated spacecraft formation reconfiguration in elliptic orbits［J］. Journal of Astronautics， 2021， 42（5）： 591-602 （in Chinese）.
[2]	SODERLUND A A， PHILLIPS S. Hybrid systems approach to autonomous rendezvous and docking of an underactuated satellite［J］. Journal of Guidance， Control， and Dynamics， 2023， 46（10）： 1901-1918.
[3]	孙建业，叶东，肖岩. 基于状态增益可达集与快速模型预测控制的非合作航天器主动观测安全轨迹规划研究［J/OL］. 航空学报，（2025-03-06）［2026-03-17］. .
	SUN J Y， YE D， XIAO Y. Active observation trajectory planning for non-cooperative spacecraft based on reachable set theory and fast model predictive control［J/OL］. Acta Aeronautica et Astronautica Sinica，（2025-03-06）［2026-03-17］. （in Chinese）.
[4]	孟中杰，卢俊杰. 欠驱动航天器模糊自适应增强耦合姿态控制［J］. 中国空间科学技术（中英文）， 2024， 44（4）： 11-19.
	MENG Z J， LU J J. Fuzzy gain-adapting coupling attitude control for under-actuated spacecraft［J］. Chinese Space Science and Technology， 2024， 44（4）： 11-19 （in Chinese）.
[5]	喻乐言，曾喆昭，唐钰淇. 欠驱动飞行器横侧向通道的自耦PD控制方法［J］. 空间控制技术与应用， 2024， 50（3）： 60-67.
	YU L Y， ZENG Z Z， TANG Y Q. Auto-coupling PD control method for lateral channel of underactuated vehicle［J］. Aerospace Control and Application， 2024， 50（3）： 60-67 （in Chinese）.
[6]	曾喆昭，章禛昊. 欠驱动VTOL飞行器的自耦PID控制方法［J］. 空间控制技术与应用， 2023， 49（6）： 38-46.
	ZENG Z Z， ZHANG Z H. Auto-coupling PID control method for underactuated VTOL aircraft［J］. Aerospace Control and Application， 2023， 49（6）： 38-46 （in Chinese）.
[7]	杨铭宣，王保防，蔡明洁，等. 欠驱动无人船预设性能轨迹跟踪控制［J/OL］. 控制理论与应用，（2025-03-12）［2026-03-17］. .
	YANG M X， WANG B F， CAI M J， et al. Trajectory tracking control of underactuated unmanned ship with preset performance［J/OL］. Control Theory Applications，（2025-03-12）［2026-03-17］. （in Chinese）.
[8]	王晓芳，许家萁. 基于预设性能的欠驱动飞行器横侧向通道控制方法［J］. 兵工学报， 2024， 45（8）： 2749-2760.
	WANG X F， XU J Q. Prescribed performance-based lateral channel control method of underactuated hypersonic vehicle［J］. Acta Armamentarii， 2024， 45（8）： 2749-2760 （in Chinese）.
[9]	王健，胡震，张华. 考虑海流及航速影响的欠驱动无人潜航器自适应路径跟踪控制方法［J］. 中国造船， 2024， 65（6）： 54-65.
	WANG J， HU Z， ZHANG H. Adaptive path tracking control of underactuated AUVs considering ocean current and surge speed［J］. Shipbuilding of China， 2024， 65（6）： 54-65 （in Chinese）.
[10]	耿云海，宋道喆，王爽，等. 轮控式欠驱动卫星视线轴稳定控制［J］. 宇航学报， 2017， 38（1）： 57-65.
	GENG Y H， SONG D Z， WANG S， et al. Line-of-sight stabilization of an underactuated satellite controlled by wheels［J］. Journal of Astronautics， 2017， 38（1）： 57-65 （in Chinese）.
[11]	段超，邵小东，胡庆雷，等. 基于横截函数的欠驱动航天器姿态跟踪方法［J］. 航空学报， 2024， 45（1）： 628910.
	DUAN C， SHAO X D， HU Q L， et al. Attitude tracking of underactuated spacecraft based on transverse function［J］. Acta Aeronautica et Astronautica Sinica， 2024， 45（1）： 628910 （in Chinese）.
[12]	胡庆雷，邵小东，杨昊旸，等. 航天器多约束姿态规划与控制：进展与展望［J］. 航空学报， 2022， 43（10）： 527351.
	HU Q L， SHAO X D， YANG H Y， et al. Spacecraft attitude planning and control under multiple constraints： Review and prospects［J］. Acta Aeronautica et Astronautica Sinica， 2022， 43（10）： 527351 （in Chinese）.
[13]	张猛，王利楠，郑德智，等. 航天器姿态系统的分数阶滑模容错控制器设计［J］. 电子与信息学报， 2025， 47（6）： 1712-1722.
	ZHANG M， WANG L N， ZHENG D Z， et al. Fractional-order sliding mode fault-tolerant attitude controller for spacecraft［J］. Journal of Electronics Information Technology， 2025， 47（6）： 1712-1722 （in Chinese）.
[14]	秦海斌，张艳红，梁祯，等. 航天器姿态容错控制技术研究现状与发展［J］. 遥测遥控， 2022， 43（6）： 47-55.
	QIN H B， ZHANG Y H， LIANG Z， et al. Current status and development of spacecraft attitude fault-tolerant control technology［J］. Journal of Telemetry， Tracking and Command， 2022， 43（6）： 47-55 （in Chinese）.
[15]	沈强，赵泽阳，康泽禹，等. 航天器模型预测控制方法综述［J］. 南京航空航天大学学报， 2023， 55（6）： 941-955.
	SHEN Q， ZHAO Z Y， KANG Z Y， et al. Review on model predictive control in spacecraft［J］. Journal of Nanjing University of Aeronautics Astronautics， 2023， 55（6）： 941-955 （in Chinese）.
[16]	张秀云，冷嘉俊，刘文静，等. 基于鲁棒对抗强化学习的航天器智能容错控制［J］. 宇航学报， 2025， 46（2）： 244-252.
	ZHANG X Y， LENG J J， LIU W J， et al. Intelligent fault-tolerant control of spacecraft based on robust adversarial reinforcement learning［J］. Journal of Astronautics， 2025， 46（2）： 244-252 （in Chinese）.
[17]	肖冰，张海朝. 航天器姿态稳定强化学习鲁棒最优控制方法［J］. 航空学报， 2024， 45（1）： 628890.
	XIAO B， ZHANG H C. Reinforcement learning robust optimal control for spacecraft attitude stabilization［J］. Acta Aeronautica et Astronautica Sinica， 2024， 45（1）： 628890 （in Chinese）.
[18]	梁小辉，胡昌华，周志杰，等. 基于自适应动态规划的运载火箭智能姿态容错控制［J］. 航空学报， 2021， 42（4）： 524915.
	LIANG X H， HU C H， ZHOU Z J， et al. ADP-based intelligent attitude fault-tolerant control for launch vehicles［J］. Acta Aeronautica et Astronautica Sinica， 2021， 42（4）： 524915 （in Chinese）.
[19]	MORENO-VALENZUELA J， MONTOYA-CHáIREZ J， SANTIBá?EZ V. Robust trajectory tracking control of an underactuated control moment gyroscope via neural network-based feedback linearization［J］. Neurocomputing， 2020， 403： 314-324.
[20]	ORTEGA R， VAN DER SCHAFT A， MASCHKE B， et al. Interconnection and damping assignment passivity-based control of port-controlled Hamiltonian systems［J］. Automatica， 2002， 38（4）： 585-596.
[21]	LIU D R， XUE S， ZHAO B， et al. Adaptive dynamic programming for control： A survey and recent advances［J］. IEEE Transactions on Systems， Man， and Cybernetics： Systems， 2021， 51（1）： 142-160.
[22]	BIAN T， JIANG Z P. Value iteration， adaptive dynamic programming， and optimal control of nonlinear systems［C］∥2016 IEEE 55th Conference on Decision and Control （CDC）. Piscataway： IEEE Press， 2016： 3375-3380.
[23]	LIU D R， WEI Q L. Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems［J］. IEEE Transactions on Neural Networks and Learning Systems， 2014， 25（3）： 621-634.
[24]	ZHAO B， LIU D R， LI Y C. Observer based adaptive dynamic programming for fault tolerant control of a class of nonlinear systems［J］. Information Sciences， 2017， 384： 21-33.
[25]	ZHANG Y W， ZHAO B， LIU D R， et al. Distributed fault tolerant consensus control of nonlinear multiagent systems via adaptive dynamic programming［J］. IEEE Transactions on Neural Networks and Learning Systems， 2024， 35（7）： 9041-9053.
[26]	RAHIMI F. Adaptive dynamic programming-based fault tolerant control for nonlinear time-delay systems［J］. Chaos， Solitons Fractals， 2024， 188： 115544.
[27]	王明泽，戈新生. 欠驱动航天器姿态最优控制的近似动态规划方法［J］. 北京信息科技大学学报（自然科学版）， 2017， 32（4）： 10-17.
	WANG M Z， GE X S. Attitude optimal control of underactuated spacecraft with approximate dynamic programming［J］. Journal of Beijing Information Science Technology University， 2017， 32（4）： 10-17 （in Chinese）.
[28]	WEI Q L， LEWIS F L， LIU D R， et al. Discrete-time local value iteration adaptive dynamic programming： convergence analysis［J］. IEEE Transactions on Systems， Man， and Cybernetics： Systems， 2018， 48（6）： 875-891.
[29]	ZHANG H G， LUO Y H， LIU D R. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints［J］. IEEE Transactions on Neural Networks， 2009， 20（9）： 1490-1503.
[30]	AL-TAMIMI A， LEWIS F L， ABU-KHALAF M. Discrete-time nonlinear HJB solution using approximate dynamic programming： Convergence proof［J］. IEEE Transactions on Systems， Man， and Cybernetics， Part B （Cybernetics）， 2008， 38（4）： 943-949.
[31]	ELFWING S， UCHIBE E， DOYA K. Sigmoid-weighted linear units for neural network function approximation in reinforcement learning［J］. Neural Networks， 2018， 107： 3-11.
[32]	LIAN C Q， XU X， CHEN H， et al. Near-optimal tracking control of mobile robots via receding-horizon dual heuristic programming［J］. IEEE Transactions on Cybernetics， 2016， 46（11）： 2484-2496.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献