基于分布式平台的FDTD并行算法 FDTD parallel algorithm based on distributed platform期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于分布式平台的FDTD并行算法

引用本文：	冯圆,代小霞,唐晓斌,龚晓燕. 基于分布式平台的FDTD并行算法[J]. 北京航空航天大学学报, 2016, 42(9): 1874-1883. DOI: 10.13700/j.bh.1001-5965.2015.0593

作者姓名：	冯圆代小霞唐晓斌龚晓燕

作者单位：	1.中国电子科学研究院预警机研究所, 北京 100041

基金项目：	国家“863”计划(2012AA01A308),国家“973”计划(613206),National High-tech Research and Development Program of China(2012AA01A308),National Basic Research Program of China(613206)

摘要：	基于分布式平台开展一种新的时域有限差分（FDTD）并行算法研究，该算法基于VC++、CUDA5.0平台开发，调用Intel MPI 4.1.0库进行测试，在上海交通大学高性能计算中心图形处理单元（GPU）集群、上海超级计算机中心的“魔方”商用超级计算机以及国家超级计算济南中心的“神威蓝光”国产超级计算机等平台开展软件调试。通过对纯CPU、GPU以及CPU和GPU的混合测试，线程调度水平、核心函数处理速度得到明显提升，同时减少了通信执行时间比例，提高了加速比和并行效率，最后以2×2微带阵列为验证模型进行拓扑优化测试，结果证明该算法准确、有效。
关键词：	Mur 消息传递接口图形处理单元(GPU) 时域有限差分(FDTD) 分布式平台
收稿时间：	2015-09-10
FDTD parallel algorithm based on distributed platform

FENG Yuan,DAI Xiaoxia,TANG Xiaobin,GONG Xiaoyan. FDTD parallel algorithm based on distributed platform[J]. Journal of Beijing University of Aeronautics and Astronautics, 2016, 42(9): 1874-1883. DOI: 10.13700/j.bh.1001-5965.2015.0593

Authors:	FENG Yuan DAI Xiaoxia TANG Xiaobin GONG Xiaoyan

Affiliation:	1.Institute of Early Warning Aircraft, China Academy of Electronics and Information Technology, Beijing 1000412. Department of Radar Technology, Academy of Air Force Early Warning, Wuhan 4300193. Department of Command, Rocket Army Command Academy, Wuhan 430012

Abstract:	A new finite difference time domain (FDTD) parallel algorithm is developed based on distributed platform, which is based on VC++, CUDA5.0 development platform, calling Intel MPI 4.1.0 library for testing, developing software debugging on the platforms of high performance computing center graphics processing units (GPU) cluster in Shanghai Jiao Tong University, "Rubik's Cube" commercial super computer at Shanghai Supercomputer Center, and "Divinity Blue" domestic super computer at the National Supercomputing Center in Jinan. By pure CPU, GPU, and CPU and GPU hybrid test, thread scheduling level and kernel function processing speed improve significantly, while the proportion of the execution time of communication reduces, and the acceleration ratio and operation efficiency improve. Finally, the topology optimization of the model is verified by 2×2 micro-strip arrays. The results show that the algorithm is accurate and effective.

Keywords:	Mur message passing interface graphics processing units (GPU) finite difference time domain (FDTD) distributed platform
本文献已被万方数据等数据库收录！
	点击此处可从《北京航空航天大学学报》浏览原始摘要信息
	点击此处可从《北京航空航天大学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏