首页 | 本学科首页   官方微博 | 高级检索  
     

基于分布式平台的FDTD并行算法
引用本文:冯圆,代小霞,唐晓斌,龚晓燕. 基于分布式平台的FDTD并行算法[J]. 北京航空航天大学学报, 2016, 42(9): 1874-1883. DOI: 10.13700/j.bh.1001-5965.2015.0593
作者姓名:冯圆  代小霞  唐晓斌  龚晓燕
作者单位:1.中国电子科学研究院 预警机研究所, 北京 100041
基金项目:国家“863”计划(2012AA01A308),国家“973”计划(613206),National High-tech Research and Development Program of China(2012AA01A308),National Basic Research Program of China(613206)
摘    要:
基于分布式平台开展一种新的时域有限差分(FDTD)并行算法研究,该算法基于VC++、CUDA5.0平台开发,调用Intel MPI 4.1.0库进行测试,在上海交通大学高性能计算中心图形处理单元(GPU)集群、上海超级计算机中心的“魔方”商用超级计算机以及国家超级计算济南中心的“神威蓝光”国产超级计算机等平台开展软件调试。通过对纯CPU、GPU以及CPU和GPU的混合测试,线程调度水平、核心函数处理速度得到明显提升,同时减少了通信执行时间比例,提高了加速比和并行效率,最后以2×2微带阵列为验证模型进行拓扑优化测试,结果证明该算法准确、有效。 

关 键 词:Mur   消息传递接口   图形处理单元(GPU)   时域有限差分(FDTD)   分布式平台
收稿时间:2015-09-10

FDTD parallel algorithm based on distributed platform
FENG Yuan,DAI Xiaoxia,TANG Xiaobin,GONG Xiaoyan. FDTD parallel algorithm based on distributed platform[J]. Journal of Beijing University of Aeronautics and Astronautics, 2016, 42(9): 1874-1883. DOI: 10.13700/j.bh.1001-5965.2015.0593
Authors:FENG Yuan  DAI Xiaoxia  TANG Xiaobin  GONG Xiaoyan
Affiliation:1.Institute of Early Warning Aircraft, China Academy of Electronics and Information Technology, Beijing 1000412. Department of Radar Technology, Academy of Air Force Early Warning, Wuhan 4300193. Department of Command, Rocket Army Command Academy, Wuhan 430012
Abstract:
A new finite difference time domain (FDTD) parallel algorithm is developed based on distributed platform, which is based on VC++, CUDA5.0 development platform, calling Intel MPI 4.1.0 library for testing, developing software debugging on the platforms of high performance computing center graphics processing units (GPU) cluster in Shanghai Jiao Tong University, "Rubik's Cube" commercial super computer at Shanghai Supercomputer Center, and "Divinity Blue" domestic super computer at the National Supercomputing Center in Jinan. By pure CPU, GPU, and CPU and GPU hybrid test, thread scheduling level and kernel function processing speed improve significantly, while the proportion of the execution time of communication reduces, and the acceleration ratio and operation efficiency improve. Finally, the topology optimization of the model is verified by 2×2 micro-strip arrays. The results show that the algorithm is accurate and effective.
Keywords:Mur  message passing interface  graphics processing units (GPU)  finite difference time domain (FDTD)  distributed platform
本文献已被 万方数据 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号