导航

ACTA AERONAUTICAET ASTRONAUTICA SINICA ›› 2018, Vol. 39 ›› Issue (9): 121944-121953.doi: 10.7527/S1000-6893.2018.21944

• Fluid Mechanics and Flight Mechanics • Previous Articles     Next Articles

Multi-GPU parallel compressible flow solver and its performance analysis

LAI Jianqi, LI Hua, ZHANG Ran, CHANG Qing   

  1. College of Aerospace Science and Engineering, National University of Defense Technology, Changsha 410073, China
  • Received:2017-12-19 Revised:2018-02-08 Online:2018-09-15 Published:2018-04-02
  • Supported by:
    National Natural Science Foundation of China (11472004)

Abstract: To achieve efficient numerical solutions for large-scale compressible flow problems, Graphics Processing Units (GPU)-based parallel computing is studied. A multi-GPU parallel compressible flow solver based on Message Passing Interface + Compute Unified Device Architecture (MPI+CUDA)is built on the NVIDIA GTX 1070. This solver is applicable to structured meshes, and an upwind finite volume scheme AUSM+UP is used for spatial discretization. A one-dimensional domain decomposition method is used to divide the computational grid into the same size, so as to obtain load balancing among GPUs. According to the case of the supersonic inlet, the parallel performance of single GPU and scalability of multi-GPU are analyzed for this solver. The numerical results show that for single GPU, parallel computing can get a speedup ratio of 37 to 46 times, greatly improving computational efficiency. For four GPUs, the speedup ratio increases from 47 to 143 times and parallel efficiency maintains above 70%, demonstrating good scalability of the solver.

Key words: Graphics Processing Units (GPU), Compute Unified Device Architecture (CUDA), parallel computing, speedup ratio, parallel efficiency

CLC Number: