主管单位:中华人民共和国工业和信息化部
主办单位:西北工业大学  中国航空学会
地       址:西北工业大学友谊校区航空楼
自主空战连续决策方法
作者:
作者单位:

1.西北工业大学航空学院/中国人民解放军93995部队;2.西北工业大学航空学院

作者简介:

通讯作者:

中图分类号:

V212.1

基金项目:

国防科技重点实验室基金(6142219190302)


Continuous decision-making method for autonomous air combat
Author:
Affiliation:

1.School of Aeronautics Northwestern Polytechnical University/93995 Unit of Chinese People’s Liberation Army;2.School of Aeronautics Northwestern Polytechnical University

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    未来空战正朝着无人化、自主化方向发展,自主空战决策方法是未来空战的重要支撑手段之一。传统空战决策方法由于维度限制,存在无法处理连续动作与远视决策的问题。基于Actor-Critic 方法提出空战连续决策的统一架构,依据空战训练经验对状态空间、动作空间、奖励及训练科目进行合理设计,测试多种连续动作空间强化学习算法在高不确定性空战场景下的学习效果并进行可视化验证。结果表明:基于本文提出的方法架构,可以实现连续动作下的远视价值寻优,智能体可以在复杂空战态势下做出最优决策,对随机机动飞行目标有较高的击杀率,且空战机动轨迹具有较高的合理性。

    Abstract:

    The future air warfare is developing in the direction of unmanned and autonomous, and autonomous air warfare decision-making methods are one of the important support methods for future air warfare. Due to dimensional limitations, traditional air combat decision-making methods cannot handle continuous action and long-sighted decision-making problems. Based on the Actor-Critic method, this paper proposes a unified architecture for continuous decision-making in air combat. Combining air combat training experience, the state space, action space, reward and training subjects are rationally designed, and a variety of continuous action space reinforcement learning algorithms are tested in high uncertainty. The learning effect in the air combat scenario has been visually verified. The results show that: based on the method architecture proposed in this paper, long-sighted value optimization under continuous actions can be realized, the agent can make optimal decisions in complex air combat situations, and has a high kill rate against random maneuvering flying targets. And the air combat maneuver trajectory is highly reasonable.

    参考文献
    相似文献
    引证文献
引用本文

单圣哲,杨孟超,张伟伟,高传强.自主空战连续决策方法[J].航空工程进展,2022,13(5):47-58

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-11-25
  • 最后修改日期:2022-01-24
  • 录用日期:2022-01-30
  • 在线发布日期: 2022-07-25
  • 出版日期: