首页 >

基于时空频谱感知的无人机群电磁对抗智能决策

刘金杰1,李奎贤2,高琦凌1,林云2   

  1. 1. 哈尔滨工程大学信息与通信工程学院
    2. 哈尔滨工程大学
  • 收稿日期:2025-11-28 修回日期:2026-04-22 出版日期:2026-04-27 发布日期:2026-04-27
  • 通讯作者: 高琦凌

#br#

  • Received:2025-11-28 Revised:2026-04-22 Online:2026-04-27 Published:2026-04-27

摘要: 无人机集群面临通信受限与非完美信息的双重约束,实现高效协同对抗是亟待解决的关键挑战。针对星型拓扑约束及局部观测条件下异构无人机集群电磁对抗中的协同决策难题,本文提出一种基于时空频谱感知的无人机群电磁对抗智能决策方法以提升异构集群电磁对抗的协同效能。首先构建了融合异构节点角色差异、“通信-侦察-干扰”闭环交互机制与智能对手博弈的Dec-POMDP模型,其中多智能体输出包含通信、运动、侦察等维度的连续决策向量。进而提出时空频谱感知多智能体近端策略优化算法(STSA-MAPPO),该算法通过频谱感知与空间协同模块并行解耦高维观测中的频域特征与空域拓扑关系,利用串行交叉注意力机制分阶段聚合敌我交互信息,并引入GRU记忆单元建模时序依赖以推断全局态势并预测对手策略。仿真结果表明,所提算法在电磁优势度及频谱冲突率等关键指标上显著优于基线算法。

关键词: 异构无人机集群, 电磁对抗, 多智能体强化学习, 星型拓扑, 时空频谱感知

Abstract: UAV swarms face the dual constraints of communication limitations and imperfect information, making efficient collaborative countermeasures a critical challenge to be addressed. To tackle the collaborative decision-making problem in electromagnetic countermeasures for heterogeneous UAV swarms under star topology constraints and partial observation conditions, this paper proposes an intelligent decision-making method based on spatio-temporal spectrum awareness to enhance the collaborative effectiveness of heterogeneous swarms in electromagnetic confrontation. First, a Dec-POMDP model is constructed that integrates role differentiation of heterogeneous nodes, a closed-loop "communication-reconnaissance-jamming" interaction mechanism, and intelligent adversarial gaming, where multi-agent outputs contain continuous decision vectors across communication, motion, reconnaissance, and other dimensions. Subsequently, a Spatio-Temporal Spectrum Awareness Multi-Agent Proximal Policy Optimization (STSA-MAPPO) algorithm is proposed, which decouples frequency-domain features and spatial topological relationships from high-dimensional observations through parallel spectrum awareness and spatial coordination modules, aggregates friend-enemy interaction information in stages via serial cross-attention mechanisms, and incorporates GRU memory units to model temporal dependencies for inferring global situations and predicting adversarial strategies. Simulation results demonstrate that the proposed algorithm significantly outperforms baseline algorithms in key metrics including electromagnetic advantage degree and spectrum conflict rate.

Key words: heterogeneous UAV swarm, electronic countermeasures, multi-agent reinforcement learning, star topology, spatio-temporal spectrum awareness