导航

Acta Aeronautica et Astronautica Sinica ›› 2024, Vol. 45 ›› Issue (20): 630629.doi: 10.7527/S1000-6893.2024.30629

• Aeronautics Computing and Simulation Technique • Previous Articles    

Autonomous evasive maneuver method for unmanned combat aerial vehicle in air combat with multiple tactical requirements

Zhen YANG1(), Lin LI1,2, Shiyuan CHAI1, Jichuan HUANG1,3, Haiyin PIAO4, Deyun ZHOU1   

  1. 1.School of Electronics and Information,Northwestern Polytechnical University,Xi’an 710072,China
    2.AVIC Shenyang Aircraft Design and Research Institute,Shenyang 110035,China
    3.The 93147th Unit of Chinese PLA,Chengdu 610091,China
    4.School of Artificial Intelligence,Jilin University,Changchun 130012,China
  • Received:2024-04-30 Revised:2024-05-31 Accepted:2024-06-28 Online:2024-07-17 Published:2024-07-12
  • Contact: Zhen YANG E-mail:yz@nwpu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62006193);Key Research and Development Program of Shaanxi Province(2024GX-YBXM-115);Aeronautical Science Foundation of China(2022Z023053001);Fundamental Research Funds for the Central Universities(D5000230150)

Abstract:

Air combat is usually a continuous process involving multiple rounds of missile confrontation. Unmanned Combat Aerial Vehicle (UCAV) should comprehensively consider the impact of maneuvering on the entire air combat mission in the process of evading incoming air-to-air missiles, instead of focusing only on safety factors. In this paper, a UCAV autonomous evasive maneuver method is proposed under the condition of multi-tactical requirements such as miss distance, energy consumption and terminal superiority. A three-dimensional pursuit and escape model of UCAV-missile and a model for the state space, action space and reward function of UCAV autonomous evasion are established. An algorithm based on the LSTM-Dueling DDQN (Long Short-Term Memory-Dueling Double Deep-Q Network) is proposed for this model. The algorithm fuses Double DQN and Dueling DQN network models, and uses LSTM network to extract timing features. Based on the concept of exploratory course learning, temporal fusion of dense and sparse reward functions is carried out to promote joint guidance of artificial experience and strategy exploration in the process of maneuver learning. The Chebyshev method is introduced to solve the Pareto solution set for different degree of tactical demands, so as to reflect the contradiction and coupling of multiple tactical requirements. Simulation experiments and result analysis show that the proposed method has good convergence speed and learning effect, and is feasible and effective to solve the problem of autonomous evasive maneuver in air combat under multiple tactical requirements. The obtained evasive maneuvers can reflect different evasive tactical requirements while ensuring UCAV’s own safety.

Key words: air combat maneuver, autonomous evasion, tactical requirement, Unmanned Combat Aerial Vehicle (UCAV), LSTM-Dueling DDQN

CLC Number: