导航
Acta Aeronautica et Astronautica Sinica
Previous Articles Next Articles
Received:
Revised:
Online:
Published:
Abstract: Aiming at the integrated penetration/strike design problem of the terminal guidance phase of hypersonic reentry glide missiles considering constraints such as the detection field of view angle, a Lagrange-Proximal Policy Optimization intelligent penetration decision algorithm based on the Constrained Markov Decision Process and an adaptive training method for multiple scenarios are proposed. Suppose the terminal guidance phase of the gliding missile adopts the biased proportional guidance law to strike the target. Taking the relative motion state between the interceptor and the gliding missile, and between the gliding missile and the target as the state space, and the rate of change of the biased acceleration as the action space, the reward function is designed by comprehensively considering the penetration/strike result of the gliding missile, the control energy consumption, the constraint satisfaction situation, and the velocity vector lead angle of the interceptor. The constraint cost function about the field of view angle is constructed, and the Constrained Markov Decision Process of the penetration/strike problem is established. The constraints are introduced into the loss function of the policy network through the Lagrange multiplier, and the constraint cost Critic network is introduced to construct the penetration network. The Proximal Policy Optimization algorithm is used to train the network to obtain the biased acceleration. Establish rules for classifying the complexity of combat scenarios, and propose an adaptive sampling training method for combat scenarios of "progressive learning in the early stage and more learning on difficult points in the later stage" to enhance the convergence speed of the penetration strategy and its generalization to different combat scenarios. Simulation results show that this intelligent penetration/strike integrated strategy can enable gliding missile to successfully penetrate and strike the target at the specified impact angle while meeting the field of view angle constraints throughout the process, and it has good generalization.
Key words: hypersonic glide missile, integrated penetration and strike, field of view angle constraint, Constrained Markov Decision Process, scene adaptive sampling
CLC Number:
V249.1
/ / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://hkxb.buaa.edu.cn/EN/10.7527/S1000-6893.2026.33053