首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Propulsionless planar phasing of multiple satellites using deep reinforcement learning
Authors:Brenton Smith  Rasit Abay  Joshua Abbey  Sudantha Balage  Melrose Brown  Russell Boyce
Institution:School of Engineering and Information Technology, UNSW Canberra, Northcott Drive, Campbell, ACT, Australia
Abstract:This work creates a framework for solving highly non-linear satellite formation control problems by using model-free policy optimisation deep reinforcement learning (DRL) methods. This work considers, believed to be for the first time, DRL methods, such as advantage actor-critic method (A2C) and proximal policy optimisation (PPO), to solve the example satellite formation problem of propellantless planar phasing of multiple satellites. Three degree-of-freedom simulations, including a novel surrogate propagation model, are used to train the deep reinforcement learning agents. During training, the agents actuated their motion through cross-sectional area changes which altered the environmental accelerations acting on them. The DRL framework designed in this work successfully coordinated three spacecraft to achieve a propellantless planar phasing manoeuvre. This work has created a DRL framework that can be used to solve complex satellite formation flying problems, such as planar phasing of multiple satellites and in doing so provides key insights into achieving optimal and robust formation control using reinforcement learning.
Keywords:Reinforcement learning  Formation control
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号