题名 | Multiple-GPU accelerated high-order gas-kinetic scheme for direct numerical simulation of compressible turbulence |
作者 | |
通讯作者 | Pan,Liang |
发表日期 | 2023-03-01
|
DOI | |
发表期刊 | |
ISSN | 0021-9991
|
EISSN | 1090-2716
|
卷号 | 476 |
摘要 | High-order gas-kinetic scheme (HGKS) has become a workable tool for the direct numerical simulation (DNS) of turbulence. In this paper, to accelerate the computation, HGKS is implemented with the graphical processing unit (GPU) using the compute unified device architecture (CUDA). Due to the limited available memory size, the computational scale is constrained by single GPU. For large-scale DNS of turbulence, we develop a multi-GPU HGKS simulation using message passing interface (MPI) and CUDA. The benchmark cases for compressible turbulence, including Taylor-Green vortex and turbulent channel flows, are presented to assess the numerical performance of HGKS with Nvidia TITAN RTX and Tesla V100 GPUs. For single-GPU computation, compared with the parallel central processing unit (CPU) code running on the Intel Core i7-9700 with open multi-processing (OpenMP) directives, 7x speedup is achieved by TITAN RTX and 16x speedup is achieved by Tesla V100. For multiple-GPU computation, multiple-GPU accelerated HGKS code scales properly with the increasing number of GPU. The computational time of parallel CPU code running on 1024 Intel Xeon E5-2692 cores with MPI is approximately 3 times longer than that of GPU code using 8 Tesla V100 GPUs with MPI and CUDA. Numerical results confirm the excellent performance of multiple-GPU accelerated HGKS for large-scale DNS of turbulence. Besides reducing memory access pressure, we also exploit single precision floating point arithmetic to accelerate HGKS on GPUs. Reasonably, compared to the computation with FP64 precision, the efficiency is improved and the memory cost is reduced with FP32 precision. Meanwhile, the differences in accuracy for statistical turbulent quantities appear. For turbulent channel flows, difference in long-time statistical turbulent quantities is acceptable between FP32 and FP64 precision solutions. While the obvious discrepancy in instantaneous turbulent quantities can be observed, which shows that FP32 precision is not safe for DNS in compressible turbulence. The choice of precision should be depended on the requirement of accuracy and the available computational resources. |
关键词 | |
相关链接 | [Scopus记录] |
收录类别 | |
语种 | 英语
|
学校署名 | 其他
|
资助项目 | National Natural Science Foundation of China[11701038]
; Fundamental Research Funds for the Central Universities[2018NTST19]
|
WOS研究方向 | Computer Science
; Physics
|
WOS类目 | Computer Science, Interdisciplinary Applications
; Physics, Mathematical
|
WOS记录号 | WOS:000923255500001
|
出版者 | |
EI入藏号 | 20230213380923
|
EI主题词 | Application programming interfaces (API)
; Benchmarking
; Channel flow
; Compressibility of gases
; Digital arithmetic
; Graphics processing unit
; Kinetic theory
; Kinetics
; Memory architecture
; Message passing
; Numerical models
; Program processors
; Turbulence
|
EI分类号 | Fluid Flow, General:631.1
; Semiconductor Devices and Integrated Circuits:714.2
; Computer Theory, Includes Formal Logic, Automata Theory, Switching Theory, Programming Theory:721.1
; Computer Circuits:721.3
; Computer Systems and Equipment:722
; Computer Software, Data Handling and Applications:723
; Computer Programming:723.1
; Data Processing and Image Processing:723.2
; Computer Applications:723.5
; Mathematics:921
; Numerical Methods:921.6
; Classical Physics; Quantum Theory; Relativity:931
; Physical Properties of Gases, Liquids and Solids:931.2
|
ESI学科分类 | PHYSICS
|
Scopus记录号 | 2-s2.0-85146050650
|
来源库 | Scopus
|
引用统计 |
被引频次[WOS]:4
|
成果类型 | 期刊论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/442644 |
专题 | 前沿与交叉科学研究院 |
作者单位 | 1.Laboratory of Mathematics and Complex Systems,School of Mathematical Sciences,Beijing Normal University,Beijing,China 2.Academy for Advanced Interdisciplinary Studies,Southern University of Science and Technology,Shenzhen,China |
推荐引用方式 GB/T 7714 |
Wang,Yuhang,Cao,Guiyu,Pan,Liang. Multiple-GPU accelerated high-order gas-kinetic scheme for direct numerical simulation of compressible turbulence[J]. JOURNAL OF COMPUTATIONAL PHYSICS,2023,476.
|
APA |
Wang,Yuhang,Cao,Guiyu,&Pan,Liang.(2023).Multiple-GPU accelerated high-order gas-kinetic scheme for direct numerical simulation of compressible turbulence.JOURNAL OF COMPUTATIONAL PHYSICS,476.
|
MLA |
Wang,Yuhang,et al."Multiple-GPU accelerated high-order gas-kinetic scheme for direct numerical simulation of compressible turbulence".JOURNAL OF COMPUTATIONAL PHYSICS 476(2023).
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论