首页 >

面向局部观测与通信受限下的察打一体自主决策方法(智能高速飞行器前沿技术专刊)

张栋1,傅晋博2,王孟阳3,沈潼2   

  1. 1. 西北工业大学航天学院
    2. 西北工业大学
    3. 西安电子科技大学
  • 收稿日期:2026-01-15 修回日期:2026-05-07 出版日期:2026-05-08 发布日期:2026-05-08
  • 通讯作者: 张栋
  • 基金资助:
    自然科学基金;2025年度空基信息感知与融合全国重点实验室开放课题基金

Autonomous decision-making method for integrated reconnaissance and strike operations under local observation and limited communication

  • Received:2026-01-15 Revised:2026-05-07 Online:2026-05-08 Published:2026-05-08
  • Contact: ZHANG Dong

摘要: 针对强对抗环境下通信网络动态碎片化与战场实体时变导致的无人机集群协同决策不连续及维度失配难题,提出一种异构图时空推理决策方法(Heterogeneous Graph Spatio-Temporal Reasoning, HG-STR)。首先,构建以单机为中心的局部动态异构图,利用元关系驱动的异构图Transformer提取无人机、动态目标与搜索区域间的语义拓扑特征,并通过门控循环单元构建时序记忆以补偿局部观测中断带来的决策震荡;其次,引入可学习的注意力通信机制,在物理链路受限及网络拓扑频繁割裂条件下实现关键协同信息的自适应筛选与高置信度聚合;最后,建立“上层战术博弈—下层指令执行”的分层架构,设计指针式多头策略网络,在统一框架内解决变长对象指派与资源量化分配的联合决策问题。构建了多区域察打任务的典型场景,仿真实验表明,相比传统规则算法任务完成率提升了37.14%;相比全局优化算法,单步决策耗时从秒级降低至毫秒级;且在通信半径极度受限的弱连通条件下仍能保持94%的任务成功率。

关键词: 无人机集群, 异构图注意力网络, 多智能体强化学习, 分层决策, 分布式协同, 察打协同

Abstract: To address the challenges of discontinuous collaborative decision-making and dimensional mismatch in UAV swarms caused by the dynamic fragmentation of communication networks and the time-varying nature of battlefield entities in highly adversarial environments, a heterogeneous graph spatio-temporal reasoning (HG-STR) method is proposed. First, a local dynamic heterogeneous graph centered on individual UAVs is constructed. A meta-relation-driven heterogeneous graph Transformer is used to extract semantic topological features between UAVs, dynamic targets, and the search area. Temporal memory is constructed using gated recurrent units to compensate for decision-making oscillations caused by local observation interruptions. Second, a learnable attention communication mechanism is introduced to achieve adaptive filtering and high-confidence aggregation of key collaborative information under conditions of limited physical links and frequent network topology fragmentation. Finally, a hierarchical architecture of "upper-level tactical game—lower-level command execution" is established, and a pointer-based multi-head policy network is designed to solve the joint decision-making problem of variable-length object assignment and resource quantification allocation within a unified framework. A typical scenario for multi-area reconnaissance and strike missions was constructed. Simulation experiments show that the task completion rate is improved by 37.14% compared to traditional rule-based algorithms; compared to global optimization algorithms, the single-step decision-making time is reduced from seconds to milliseconds; and a 94% task success rate is maintained even under weak connectivity conditions with extremely limited communication radius.

Key words: UAV swarm, heterogeneous graph transformer, multi-agent reinforcement learning, hierarchical decision-making, distributed collaboration, reconnaissance and strike collaboration

中图分类号: