自主空战连续决策方法 Continuous decision-making method for autonomous air combat期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

自主空战连续决策方法

引用本文：	单圣哲,杨孟超,张伟伟,高传强.自主空战连续决策方法[J].航空工程进展,2022,13(5):47-58.

作者姓名：	单圣哲杨孟超张伟伟高传强

作者单位：	西北工业大学航空学院/中国人民解放军93995部队,西北工业大学航空学院,西北工业大学航空学院,西北工业大学航空学院

基金项目：	国防科技重点实验室基金（6142219190302）

摘要：	未来空战正朝着无人化、自主化方向发展，自主空战决策方法是未来空战的重要支撑手段之一。传统空战决策方法由于维度限制，存在无法处理连续动作与远视决策的问题。基于Actor-Critic 方法提出空战连续决策的统一架构，依据空战训练经验对状态空间、动作空间、奖励及训练科目进行合理设计，测试多种连续动作空间强化学习算法在高不确定性空战场景下的学习效果并进行可视化验证。结果表明：基于本文提出的方法架构，可以实现连续动作下的远视价值寻优，智能体可以在复杂空战态势下做出最优决策，对随机机动飞行目标有较高的击杀率，且空战机动轨迹具有较高的合理性。
关键词：	自主空战强化学习人工智能深度神经网络
收稿时间：	2021/11/25 0:00:00
修稿时间：	2022/1/24 0:00:00
Continuous decision-making method for autonomous air combat

Shan ShengZhe,Yang MengChao,Zhang WeiWei and Gao ChuanQiang.Continuous decision-making method for autonomous air combat[J].Advances in Aeronautical Science and Engineering,2022,13(5):47-58.

Authors:	Shan ShengZhe Yang MengChao Zhang WeiWei and Gao ChuanQiang

Abstract:	The future air warfare is developing in the direction of unmanned and autonomous, and autonomous air warfare decision-making methods are one of the important support methods for future air warfare. Due to dimensional limitations, traditional air combat decision-making methods cannot handle continuous action and long-sighted decision-making problems. Based on the Actor-Critic method, this paper proposes a unified architecture for continuous decision-making in air combat. Combining air combat training experience, the state space, action space, reward and training subjects are rationally designed, and a variety of continuous action space reinforcement learning algorithms are tested in high uncertainty. The learning effect in the air combat scenario has been visually verified. The results show that: based on the method architecture proposed in this paper, long-sighted value optimization under continuous actions can be realized, the agent can make optimal decisions in complex air combat situations, and has a high kill rate against random maneuvering flying targets. And the air combat maneuver trajectory is highly reasonable.

Keywords:	autonomous air combat reinforcement learning artificial intelligence deep neural network

	点击此处可从《航空工程进展》浏览原始摘要信息
	点击此处可从《航空工程进展》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏