非对称机动能力多无人机智能协同攻防对抗 Cooperative attack-defense game of multiple UAVs with asymmetric maneuverabi lity期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

非对称机动能力多无人机智能协同攻防对抗

引用本文：	陈灿,莫雳,郑多,程子恒,林德福.非对称机动能力多无人机智能协同攻防对抗[J].航空学报,2020,41(12):324152-324152.

作者姓名：	陈灿莫雳郑多程子恒林德福

作者单位：	1. 北京理工大学宇航学院, 北京 100081;2. 北京理工大学无人机自主控制技术北京市重点实验室, 北京 100081

摘要：	协同攻防对抗是未来军用无人机的重要作战场景。针对不同机动能力无人机群体间的攻防对抗问题，建立了多无人机协同攻防演化模型，基于多智能体强化学习理论，研究了多无人机协同攻防的自主决策方法，提出了基于执行-评判（Actor-Critic）算法的集中式评判和分布式执行的算法结构，保证算法稳定收敛的同时，提升执行效率。无人机的评判模块使用全局信息评价决策优劣引导策略学习，而执行时只需要依赖局部感知信息进行自主决策，提高了多机攻防对抗的效能。仿真结果表明，所提的多无人机强化学习方法具备较强的自进化属性，赋予了无人机一定智能，即稳定的自主学习能力，通过不断演化，能自主学习提升协同对抗的决策效能。
关键词：	多无人机协同攻防对抗强化学习集中式评判分布式执行
收稿时间：	2020-04-29
修稿时间：	2020-05-22
Cooperative attack-defense game of multiple UAVs with asymmetric maneuverabi lity

CHEN Can,MO Li,ZHENG Duo,CHENG Ziheng,LIN Defu.Cooperative attack-defense game of multiple UAVs with asymmetric maneuverabi lity[J].Acta Aeronautica et Astronautica Sinica,2020,41(12):324152-324152.

Authors:	CHEN Can MO Li ZHENG Duo CHENG Ziheng LIN Defu

Institution:	1. School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China;2. Beijing Key Laboratory of UAV Autonomous Control, Beijing Institute of Technology, Beijing 100081, China

Abstract:	The attack-defense game is an important combat scenario of future military Unmanned Aerial Vehicles (UAVs). This paper studies an attack-defense game between groups of UAVs with different maneuverability, establishing a multi-UAV cooperative attack and defense evolution model. Based on the multi-agent reinforcement learning theory, the autonomous decision-making method of multi-UAV cooperative attack-defense game is studied, and a centralized critic and distributed actor algorithm structure is proposed based on the actor-critic algorithm, guaranteeing the convergence of the algorithm and improving the efficiency of decision-making. The critic module of UAVs uses the global information to evaluate the decision-making quality during training, while the actor module only needs to rely on the local perception information to make autonomous decisions during execution, hence improving the effectiveness of the multi-UAV attack-defense game. The simulation results show that the proposed multi-UAV reinforcement learning method has a strong self-evolution property, endowing the UAV certain intelligence, that is, the stable autonomous learning ability. Through continuous training, the UAVs can autonomously learn cooperative attack or defense policies to improve the effectiveness of decision-making.

Keywords:	multi-UAV coordination attack-defense games reinforcement learning centralized critic distributed actors
本文献已被万方数据等数据库收录！
	点击此处可从《航空学报》浏览原始摘要信息
	点击此处可从《航空学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏