基于自适应混合动态规划的欠驱动航天器姿态容错控制(先进飞行器安全控制专栏)

  • 刘切 ,
  • 卢俊颖 ,
  • 谢非 ,
  • 李文博
展开
  • 1. 重庆大学
    2. 重庆大学A区
    3. 北京控制工程研究所

收稿日期: 2025-07-08

  修回日期: 2025-10-28

  网络出版日期: 2025-10-30

基金资助

国家自然科学基金;空间智能控制技术重点实验室

Adaptive Hybrid Dynamic Programming for Fault Tolerant Attitude Control of Underactuated Spacecraft

  • LIU Qie ,
  • LU Jun-Ying ,
  • XIE Fei ,
  • LI Wen-Bo
Expand

Received date: 2025-07-08

  Revised date: 2025-10-28

  Online published: 2025-10-30

Supported by

National Natural Science Foundation of China

摘要

针对欠驱动航天器在执行机构故障下的姿态跟踪控制问题,同时考虑跟踪精度与控制能耗两个目标, 提出一种基于自适应混合动态规划的最优容错控制方法。该方法首先在无执行器故障条件下将控制器设计转化成最优控制问题,并利用自适应混合动态规划方法构建近似最优控制策略,实现控制能耗和跟踪精度之间的平衡。为实现执行器故障下的容错控制,建立了基于故障估计与控制补偿的容错方案,针对执行器乘性故障和加性故障的不同特性,基于自适应故障观测器构建在线估计技术,然后通过补偿控制策略,使执行器故障下的实际输出接近无故障控制效果,达到容错控制目的。对上述控制系统,通过设计李雅普诺函数,证明了系统在执行器故障下的闭环稳定性。仿真对比实验表明,本文方法相较于传统容错控制方法,实现了更高的跟踪精度和故障估计准确度。

本文引用格式

刘切 , 卢俊颖 , 谢非 , 李文博 . 基于自适应混合动态规划的欠驱动航天器姿态容错控制(先进飞行器安全控制专栏)[J]. 航空学报, 0 : 1 -0 . DOI: 10.7527/S1000-6893.2025.32531

Abstract

Aiming at the attitude tracking control of underactuated spacecraft subject to actuator faults, with concurrent objectives of tracking accuracy and control energy consumption, this paper proposes an optimal fault tolerant control scheme based on adaptive hybrid dynamic programming (AHDP). First, under fault free conditions the control design is cast as an opti-mal control problem, and AHDP is employed to derive an approximate optimal control law that balances energy expendi-ture against tracking performance. To enable fault tolerance, an online compensation framework is developed using adaptive fault observers to separately estimate multiplicative and additive fault parameters; these estimates are then in-corporated into a compensation control law so that the faulty actuator output closely approximates nominal behavior. A Lyapunov based stability analysis proves closed loop robustness of the tracking error in the presence of actuator faults. Comparative simulations demonstrate that, compared with conventional fault tolerant methods, the proposed AHDP based controller achieves improved attitude tracking accuracy and higher fault estimation fidelity.

参考文献

[1]黄煦, 王健, 龚秋武.椭圆轨道欠驱动编队重构最优脉冲控制[J].宇航学报, 2021, 42(05):591-602
[2]SODERLUND A A, PHILLIPS S.Hybrid systems approach to autonomous rendezvous and docking of an underactuated satellite[J].Journal of Guidance, Control, and Dynamics, 2023, 46(10):1901-1918
[3]孙建业, 叶东, 肖岩.基于状态增益可达集与快速模型预测控制的非合作航天器主动观测安全轨迹规划研究[J/OL]. 航空学报, 1-17[2025-07-05].
[4]孟中杰, 卢俊杰.欠驱动航天器模糊自适应增强耦合姿态控制[J].中国空间科学技术, 2024, 44(04):11-19
[5]喻乐言, 曾喆昭, 唐钰淇.欠驱动飞行器横侧向通道的自耦控制方法[J].空间控制技术与应用, 2024, 50(03):60-67
[6]曾喆昭, 章禛昊.欠驱动飞行器的自耦控制方法[J].空间控制技术与应用, 2023, 49(06):38-46
[7] 杨铭宣, 王保防, 蔡明洁, 等.欠驱动无人船预设性能轨迹跟踪控制[J/OL]. 控制理论与应用, 1-9[2025-07-05].
[8]王晓芳, 许家萁.基于预设性能的欠驱动飞行器横侧向通道控制方法[J].兵工学报, 2024, 45(08):2749-2760
[9]王健, 胡震, 张华.考虑海流及航速影响的欠驱动无人潜航器自适应路径跟踪控制方法[J].中国造船, 2024, 65(06):54-65
[10]胡庆雷, 邵小东, 杨昊旸, 等.航天器多约束姿态规划与控制: 进展与展望[J].航空学报, 2022, 43(10):403-431
[11]张猛, 王利楠, 郑德智, 等.航天器姿态系统的分数阶滑模容错控制器设计[J/OL]. 电子与信息学报, 1-11[2025-07-05].
[12]秦海斌, 张艳红, 梁祯, 等.航天器姿态容错控制技术研究现状与发展[J].遥测遥控, 2022, 43(06):47-55
[13]沈强, 赵泽阳, 康泽禹, 等.航天器模型预测控制方法综述[J].南京航空航天大学学报, 2023, 55(06):941-955
[14]张秀云, 冷嘉俊, 刘文静, 等.基于鲁棒对抗强化学习的航天器智能容错控制[J].宇航学报, 2025, 46(02):244-252
[15]肖冰, 张海朝.航天器姿态稳定强化学习鲁棒最优控制方法[J].航空学报, 2024, 45(01):58-72
[16]梁小辉, 胡昌华, 周志杰, 等.基于自适应动态规划的运载火箭智能姿态容错控制[J].航空学报, 2021, 42(04):511-524
[17]BIAN T, JIANG Z P.Value iteration, adaptive dynam-ic programming, and optimal control of nonlinear sys-tems[C]//2016 IEEE 55th Conference on Decision and Control (CDC). IEEE, 2016: 3375-3380.
[18]LIU D, WEI Q.Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems[J].IEEE Transactions on Neural Networks and Learning Systems, 2013, 25(3):621-634
[19]ZHAO B, LIU D, LI Y.Observer based adaptive dy-namic programming for fault tolerant control of a class of nonlinear systems[J].Information Sciences, 2017, 384(—):21-33
[20]ZHANG Y W, ZHAO B, LIU D R, et al.Distributed fault tolerant consensus control of nonlinear multia-gent systems via adaptive dynamic programming[J].IEEE Transactions on Neural Networks and Learning Systems, 2024, 35(7):9041-9053
[21]RAHIMI F.Adaptive dynamic programming-based fault tolerant control for nonlinear time-delay systems[J].Chaos, Solitons &?Fractals, 2024, 188(—):115544-115544
[22]王明泽, 戈新生.欠驱动航天器姿态最优控制的近似动态规划方法[J].北京信息科技大学学报自然科学版, 2017, 32(04):10-17
[23]WEI Q, LEWIS F L, LIU D, et al.Discrete-time local value iteration adaptive dynamic programming: Con-vergence analysis[J].IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2016, 48(6):875-891
[24]ZHANG H, LUO Y, LIU D, et al.Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints[J].IEEE Transactions on Neural Networks, 2009, 20(9):1490-1503
[25]ZHANG H, LUO Y, LIU D, et al.Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints[J].IEEE Transactions on Neural Networks, 2009, 20(9):1490-1503
[26]ELFWING S, UCHIBE E, DOYA K.Sigmoid-weighted linear units for neural network function approximation in reinforcement learning[J]. Neural Networks, 2018, 107: 3-11.[J].Neural Networks, 2018, 107(—):3-11
[27]LIAN C, XU X, CHEN H, et al.Near-optimal tracking control of mobile robots via receding-horizon dual heuristic programming[J].IEEE Transactions on Cybernetics, 2015, 46(11):2484-2496
文章导航

/