Explore chapters and articles related to this topic
MPI Parallel Implementation for Pseudo-Spectral Simulations for Turbulent Channel Flow
Published in International Journal of Computational Fluid Dynamics, 2020
Oh-Kyoung Kwon, Jin Lee, Junghoon Lee, Ji-Hoon Kang, Jung-Il Choi
The strong scalability test for the L550 case of grid points indicated that the non-blocking collective operations with 256 nodes using 16,384 MPI ranks provided 15.76 times performance improvement over 8 nodes using 512 MPI ranks. In addition, we evaluated which transposition algorithm is appropriate for the present study. Applying the non-blocking collective operations leads to 1.76 and 3.55 times better computation in terms of performance for the S180 and L550 cases compared to the baseline, respectively. Moreover, a weak scaling test confirmed that the non-blocking collective operations can be scaled up to 512 nodes using a 32,768 MPI ranks. This is because the non-blocking collective operations make performance be improved due to the latency mitigation by overlapping computation and communication.