针对集群攻击的飞行器智能协同拦截策略

doi:10.7527/S1000-6893.2023.28301

电子电气工程与控制

本期目录 | 过刊浏览 | 高级检索

前一篇 | 后一篇

针对集群攻击的飞行器智能协同拦截策略

高树一¹, 林德福¹, 郑多¹(), 胡馨予²

^1.北京理工大学宇航学院，北京 100081
^2.北京理工大学徐特立学院，北京 100081

收稿日期:2022-11-23 修回日期:2022-12-20 接受日期:2023-02-22 出版日期:2023-09-25 发布日期:2023-03-03
通讯作者: 郑多 E-mail:zhengduohello@126.com
基金资助:
国家自然科学基金青年基金项目(61903350);教育部产学研创新项目(2021ZYA02002);北京理工大学青年教师学术启动计划(3010011182130)

Intelligent cooperative interception strategy of aircraft against cluster attack

Shuyi GAO¹, Defu LIN¹, Duo ZHENG¹(), Xinyu HU²

^1.School of Aerospace Engineering，Beijing Institute of Technology，Beijing 100081，China
^2.XUTELI School，Beijing Institute of Technology，Beijing 100081，China

Received:2022-11-23 Revised:2022-12-20 Accepted:2023-02-22 Online:2023-09-25 Published:2023-03-03
Contact: Duo ZHENG E-mail:zhengduohello@126.com
Supported by:
National Natural Science Foundation of China(61903350);Ministry of Education's industry-university-researchinnovation project(2021ZYA02002);Beijing Institute of Technology Research Fund Program for Young Scholars(3010011182130)

摘要/Abstract

摘要：

无人集群间拦截博弈对抗是未来智能化战争的重要作战场景。针对飞行器集群攻击的协同拦截博弈对抗问题，提出了一种基于近端策略优化方法的多智能体深度强化学习协同拦截策略，将单智能体近端策略优化算法和集中式评价分布式执行算法架构相结合，设计了一种多智能体强化学习智能机动策略，在此基础上为解决算法收敛慢的问题，引入广义优势函数提升算法的收敛性能。仿真结果表明，多机智能协同拦截策略赋予飞行器自主学习的属性，能够根据实时战场态势智能自主分配拦截任务，且通过约束策略更新幅度提升了算法收敛速率。经过不断迭代自学习，能够实现拦截策略的自主优化，在不同的场景下自学习提升协同拦截效能。

关键词: 群目标协同拦截, 近端策略优化, 多智能体强化学习, 集中式评价-分布式执行, 深度学习

Abstract:

The attack defense confrontation and interception between unmanned clusters is an important operational scenario in the future intelligent war. Aiming at the problem of cooperative interception of game confrontation against aircraft cluster attacks， a multi-agent deep reinforcement learning cooperative interception strategy based on the near end strategy optimization method is proposed. Combining the single agent near end strategy optimization algorithm with the centralized evaluation distributed execution algorithm architecture， a multi-agent reinforcement learning intelligent maneuver strategy is designed. On this basis， to solve the problem of slow algorithm convergence， the generalized dominance function is introduced to improve the convergence performance of the algorithm. Simulation results show that the multi aircraft intelligent cooperative interception strategy endows the UAV with the attribute of autonomous learning， which can intelligently and autonomously assign interception tasks according to the real-time battlefield situation， and improves the algorithm convergence rate by constraining the update range of the strategy. Through continuous iterative self-learning， this strategy can realize the autonomous optimization of game interception strategy. Improve collaborative interception efficiency by self-learning in different scenarios.

Key words: multi-target cooperative interception, proximal policy optimization, multi-agent reinforcement learning, centralized evaluation-distributed execution, deep learning

中图分类号:

V279

高树一, 林德福, 郑多, 胡馨予. 针对集群攻击的飞行器智能协同拦截策略[J]. 航空学报, 2023, 44(18): 328301-328301.

Shuyi GAO, Defu LIN, Duo ZHENG, Xinyu HU. Intelligent cooperative interception strategy of aircraft against cluster attack[J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(18): 328301-328301.

图/表 22

图1

图2

图3

图4

图5

图6

图7

图8

图9

图10

图11

表1

图12

图13

表2

表3

图14

图15

图16

表 4

图17

表5

参考文献 26

1	GUO D， LIANG Z X， JIANG P， et al. Weapon-target assignment for multi-to-multi interception with grouping constraint［J］. IEEE Access， 2019， 7： 34838-34849.
2	GUO J G， HU G J， GUO Z Y， et al. Evaluation model， intelligent assignment， and cooperative interception in multimissile and multitarget engagement［J］. IEEE Transactions on Aerospace and Electronic Systems， 2022， 58（4）： 3104-3115.
3	KHOSRAVI M， AGHDAM A G. Cooperative receding horizon control for multi-target interception in uncertain environments［C］∥ 53rd IEEE Conference on Decision and Control. Piscataway： IEEE Press， 2015： 4497-4502.
4	MENG X Q， SUN B， ZHU D Q. Harbour protection： Moving invasion target interception for multi-AUV based on prediction planning interception method［J］. Ocean Engineering， 2021， 219： 108268.
5	SUN Z Y， YANG J Y. Multi-missile interception for multi-targets： Dynamic situation assessment， target allocation and cooperative interception in groups［J］. Journal of the Franklin Institute， 2022， 359（12）： 5991-6022.
6	ZHU R， SUN D， ZHOU Z Y. Cooperation strategy of unmanned air vehicles for multitarget interception［J］. Journal of Guidance， Control， and Dynamics， 2005， 28（5）： 1068-1072.
7	JEON I S， LEE J I， TAHK M J. Impact-time-control guidance law for anti-ship missiles［J］. IEEE Transactions on Control Systems Technology， 2006， 14（2）： 260-266.
8	吕腾，吕跃勇，李传江，等.带空间协同的多导弹时间协同制导律［J］.航空学报，2018，39（10）：322115.
	LYU T， LYU Y Y， LI C J， et al. Time cooperative guidance law for multiple missiles with space coopera-tion ［J］. Acta Aeronautica et Astronautica Sinica， 2018， 39（10）： 322115 （in Chinese）.
9	SINHA A， KUMAR S R. Supertwisting control-based cooperative salvo guidance using leader-follower approach［J］. IEEE Transactions on Aerospace and Electronic Systems， 2020， 56（5）： 3556-3565.
10	ZHANG P， ZHANG X Y. Multiple missiles fixed-time cooperative guidance without measuring radial velocity for maneuvering targets interception［J］. ISA Transactions， 2022， 126： 388-397.
11	SHAFERMAN V， SHIMA T. Linear quadratic guidance laws for imposing a terminal intercept angle［J］. Journal of Guidance， Control， and Dynamics， 2008， 31（5）： 1400-1412.
12	SUN X J， ZHOU R， HOU D L， et al. Consensus of leader-followers system of multi-missile with time-delays and switching topologies［J］. Optik， 2014， 125（3）： 1202-1208.
13	ERER K， MERTTOPÇUOGLU O. Indirect control of impact angle against stationary targets using biased PPN： AIAA-2010-8184［R］. Reston： AIAA， 2010.
14	HARL N， BALAKRISHNAN S N. Impact time and angle guidance with sliding mode control［J］. IEEE Transactions on Control Systems Technology， 2012， 20（6）： 1436-1449.
15	KUMAR S R， RAO S， GHOSE D. Nonsingular terminal sliding mode guidance with impact angle constraints［J］. Journal of Guidance， Control， and Dynamics， 2014， 37（4）： 1114-1130.
16	DONG X F， REN Z. Impact angle constrained distributed cooperative guidance against maneuvering targets with undirected communication topologies［J］. IEEE Access， 2020， 8： 117867-117876.
17	KANG S， KIM H J. Differential game missile guidance with impact angle and time constraints［J］. IFAC Proceedings Volumes， 2011， 44（1）： 3920-3925.
18	WANG B L， LI S G， GAO X Z， et al. UAV swarm confrontation using hierarchical multiagent reinforcement learning［J］. International Journal of Aerospace Engineering， 2021， 2021： 1-12.
19	陈灿，莫雳，郑多，等. 非对称机动能力多无人机智能协同攻防对抗［J］. 航空学报， 2020， 41（12）： 324152.
	CHEN C， MO L， ZHENG D， et al. Cooperative attack-defense game of multiple UAVs with asymmetric maneuverability［J］. Acta Aeronautica et Astronautica Sinica， 2020， 41（12）： 324152 （in Chinese）.
20	IMADO F， KURODA T. Family of local solutions in a missile-aircraft differential game［J］. Journal of Guidance， Control， and Dynamics， 2011， 34（2）： 583-591.
21	Bowling M， Veloso M. Rational and convergent learning in stochastic games［C］∥ International Joint Conference On Artificial Intelligence. Hillsdale： Lawrence Erlbaum Associates Ltd， 2001： 1021-1026.
22	罗德林，段海滨，吴顺详，等. 基于启发式蚁群算法的协同多目标攻击空战决策研究［J］. 航空学报， 2006， 27（6）： 1166-1170.
	LUO D L， DUAN H B， WU S X， et al. Research on air combat decision-making for cooperative multiple target attack using heuristic ant colony algorithm［J］. Acta Aeronautica et Astronautica Sinica， 2006， 27（6）： 1166-1170 （in Chinese）.
23	裴培，何绍溟，王江，等. 一种深度强化学习制导控制一体化算法［J］. 宇航学报， 2021， 42（10）： 1293-1304.
	PEI P， HE S M， WANG J， et al. Integrated guidance and control for missile using deep reinforcement learning［J］. Journal of Astronautics， 2021， 42（10）： 1293-1304 （in Chinese）.
24	LEE S M， KIM H， MYUNG H， et al. Cooperative coevolutionary algorithm-based model predictive control guaranteeing stability of multirobot formation［J］. IEEE Transactions on Control Systems Technology， 2015， 23（1）： 37-51.
25	WU X， LIU Y， XIE S R， et al. Collaborative defense with multiple USVs and UAVs based on swarm intelligence［J］. Journal of Shanghai Jiaotong University （Science）， 2020， 25（1）： 51-56.
26	LUO Y X， SONG J A， ZHAO K， et al. UAV-cooperative penetration dynamic-tracking interceptor method based on DDPG［J］. Applied Sciences， 2022， 12（3）： 1618.

E-mail：hkxb@buaa.edu.cn

关于我们

期刊社服务

专业学科

封面文章

友情链接

主管单位：中国科学技术协会主办单位：中国航空学会北京航空航天大学

参数	数值
PPO裁剪系数	0.2
熵奖励系数	0.02
GAE参数	0.98
衰减因子	0.998
神经网络优化器	Adam
Mini_batch数量	4
缓存区大小	4 096
学习率	0.000 3

场景	防御飞行器数量	进攻飞行器数量	防护目标数量
1	5	1	1
2	5	2	1
3	5	3	1
4	5	4	1

防御vs进攻	防御飞行器			进攻飞行器			目标位置/m
防御vs进攻	初始位置/m	初始速度 /（m·s^-1）	初始航向角 /rad	初始位置/m	初始速度 /（m·s^-1）	初始航向角 /rad	目标位置/m
5 vs 1	（74.8，101.5）	18.1	0.02	（578.2，321.3）	16.1	-2.18	（99.3，135.3）
	（69.4，95.6）	19.5	-0.34
	（98.7，100.5）	19.4	-0.29
	（81.9，63.5）	17.2	-0.47
	（76.7，151.5）	18.1	-0.26
5 vs 2	（-72.8，131.2）	17.3	-0.71	（187.2，-411.6）（-593，121.8）	18.7 19.3	2.33 -0.56	（33.5，145.3）
	（-87.3，56.9）	17.6	-1.28
	（-97.9，65.2）	18.1	3.27
	（-17.5，6.6）	18.7	2.75
	（-99.6，89.4）	15.9	2.91
5 vs 3	（-117.4，2.5）	17.9	0.71	（621.2，263.2）（93.7，-571.3）（-183.6，-493.2）	20.8 16.3 17.9	-2.38 1.58 1.33	（-21.7，112.7）
	（37.4，26.6）	17.5	0.14
	（87.7，71.5）	18.5	-1.41
	（56.9，47.5）	20.3	-1.87
	（57.7，-2.5）	16.1	-2.16
5 vs 4	（-31.4，39.5）	17.1	-0.41	（541.2，-188.5）（-331.8，531）（-119.2，-421.1）（468.4，438.2）	21.4 22.5 17.3 15.6	2.89 -1.16 1.36 -2.13	（-1.5，193.9）
	（-53.4，38.6）	16.8	-0.26
	（-19.7，12.5）	20.3	2.51
	（-135.9，40.5）	16.9	-1.86
	（-11.7，93.5）	20.5	1.13

防御方	智能任务分配
防御方	1架进攻	2架进攻	3架进攻	4架进攻
D-UAV1	A-UAV1	A-UAV2	A-UAV2	A-UAV1
D-UAV2	A-UAV1	A-UAV1	A-UAV1	A-UAV1
D-UAV3	A-UAV1	A-UAV2	A-UAV1	A-UAV2
D-UAV4	A-UAV1	A-UAV2	A-UAV2	A-UAV3
D-UAV5	A-UAV1	A-UAV2	A-UAV3	A-UAV4

防御vs进攻	成功率/%	平均脱靶量/m
5 vs 1	100	1.1
5 vs 2	84	1.7
5 vs 3	78	2.1
5 vs 4	70	2.8

针对集群攻击的飞行器智能协同拦截策略

Intelligent cooperative interception strategy of aircraft against cluster attack

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

图/表 22

参考文献 26

相关文章 15

编辑推荐

Metrics

本文评价

[1]	柳家齐, 陈荣钱, 楼锦华, 韩旭, 吴昊, 尤延铖. 基于深度学习的高速直升机旋翼翼型气动优化设计[J]. 航空学报, 2024, 45(9): 529828-529828.
[2]	罗旭东, 吴一全, 陈金林. 无人机航拍影像目标检测与语义分割的深度学习方法研究进展[J]. 航空学报, 2024, 45(6): 28822-028822.
[3]	刘海桥, 刘萌, 龚子超, 董晶. 基于深度学习的图像匹配方法综述[J]. 航空学报, 2024, 45(3): 28796-028796.
[4]	苏鑫, 管润程, 王桥, 苑伟政, 吕湘连, 何洋. 基于深度学习的结冰区域和厚度检测方法[J]. 航空学报, 2023, 44(S2): 729283-729283.
[5]	陈立群, 邹旭, 张磊, 朱颖盼, 王港, 陈金勇. 基于国产商用器件的星载智能目标检测技术[J]. 航空学报, 2023, 44(S2): 728860-728860.
[6]	刘鹏宇, 朱雪耀. 基于深度学习的融合空域空管指令语义解析技术[J]. 航空学报, 2023, 44(S1): 727592-727592.
[7]	符小卫, 徐哲, 朱金冬, 王楠. 基于PER-MATD3的多无人机攻防对抗机动决策[J]. 航空学报, 2023, 44(7): 327083-327083.
[8]	何磊, 钱炜祺, 董康生, 易贤, 柴聪聪. 基于卷积神经网络的结冰翼型气动特性建模[J]. 航空学报, 2023, 44(5): 126434-126434.
[9]	丁鹏, 宋亚飞. 代价敏感的空中目标意图识别方法[J]. 航空学报, 2023, 44(24): 328551-328551.
[10]	李晓航, 周建江. 基于自适应记忆长度的多尺度模态融合网络[J]. 航空学报, 2023, 44(22): 628977-628977.
[11]	罗皓文, 何绍溟, 金天宇, 刘子超. 基于迁移学习的角度约束时间最短制导算法[J]. 航空学报, 2023, 44(19): 328400-328400.
[12]	赵鋆赫, 王生楠. 基于深度学习的权函数法应力强度因子求解[J]. 航空学报, 2023, 44(19): 228367-228367.
[13]	苑玉彬, 吴一全, 赵朗月, 陈金林, 赵其昌. 基于深度学习的无人机航拍视频多目标检测与跟踪研究进展[J]. 航空学报, 2023, 44(18): 28334-028334.
[14]	张荣升, 吴燕生, 秦旭东, 张普卓. 基于深度学习的高空风在线估计及预报方法[J]. 航空学报, 2023, 44(13): 327860-327860.
[15]	王强, 吴乐天, 王勇, 王欢, 杨万扣. 基于关键点检测的红外弱小目标检测[J]. 航空学报, 2023, 44(10): 328173-328173.