首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于CGAN的避扰通信决策网络离线式训练方法
引用本文:江民民,李大朋,邱昕,慕福奇,柴旭荣,孙志浩.基于CGAN的避扰通信决策网络离线式训练方法[J].北京航空航天大学学报,2020,46(7):1412-1421.
作者姓名:江民民  李大朋  邱昕  慕福奇  柴旭荣  孙志浩
作者单位:1.中国科学院大学 微电子学院, 北京 100029
摘    要:基于强化学习的避扰通信,由于需要不断地与环境交互从中学习到最优决策,其决策网络的训练时间受环境反馈速率的约束,通常耗时严重。针对这一问题,提出了一种离线式训练方法。构建出一种频谱虚拟环境生成器,可以快速生成大量的逼真合成频谱瀑布图,用于避扰通信决策网络训练。由于所提方法脱离真实环境反馈,形成离线式训练,进而显著提高模型训练效率。实验结果表明:与实时在线训练方法比较,所提离线式训练方法的训练时间可以减少50%以上。 

关 键 词:强化学习    避扰通信    频谱瀑布图    条件生成对抗网络(CGAN)    离线式训练
收稿时间:2019-08-16

An offline training method using CGAN for anti-jamming communication decision network
Institution:1.School of Microelectronics, University of Chinese Academy of Sciences, Beijing 100029, China2.Institute of Microelectronics of the Chinese Academy of Sciences, Beijing 100029, China
Abstract:Due to the continuous interaction with the environment to learn the optimal decision, the training time of the decision network based on reinforcement learning is restricted by the feedback rate of the environment, which usually consumes a lot of time. To solve this problem, an offline training method is proposed. A spectrum virtual environment generator is constructed, which can quickly generate a large number of realistic synthetic spectrum waterfall images for the training of anti-jamming communication decision network. Because the method is separated from the real environment feedback, the offline training is formed and the efficiency of model training is improved significantly. Experimental results show that the training time of this offline method is reduced by more than 50% compared with the online real-time training method. 
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号