首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Multi-agent Q-Learning control of spacecraft formation flying reconfiguration trajectories
Institution:1. Department of Electrical Engineering, Iran University of Science and Technology, Tehran 1684613114, Iran;2. Department of Computer Engineering, Iran University of Science and Technology, Tehran 1684613114, Iran
Abstract:This paper presents a novel approach based on multi-agent reinforcement learning for spacecraft formation flying reconfiguration tracking problems. In this scheme, spacecrafts learn the control strategy via transfer learning. For this matter, a new generalized discounted value function is introduced for the tracking problems. Due to the digital nature of spacecraft computer systems, local optimal controllers are developed for the spacecrafts in discrete-time. The stability of the controller is proven. Two Q-learning algorithms are proposed, in each of which the optimal control solution is learned on-line without knowledge about the system dynamics. In the first algorithm, each agent learns the optimal control independently. In the second one, each agent shares the learned information with other agents. Next, the collision avoidance capability is provided. The effectiveness of the presented schemes is verified through simulations and compared with each other.
Keywords:Reinforcement learning  Spacecraft formation flying  Q-learning  Optimal adaptive control  Multi-agent systems
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号