A dual communicator and dual grid-resolution algorithm for petascale 
simulations of turbulent mixing at high Schmidt number

Clay, M. P.; Buaria, Dhawal; Gotoh, T.; Yeung, P. K.

doi:10.1016/j.cpc.2017.06.009

A dual communicator and dual grid-resolution algorithm for petascale simulations of turbulent mixing at high Schmidt number

Clay, M. P., Buaria, D., Gotoh, T., & Yeung, P. K. (2017). A dual communicator and dual grid-resolution algorithm for petascale simulations of turbulent mixing at high Schmidt number. Computer Physics Communications, 219, 313-328. doi:10.1016/j.cpc.2017.06.009.

Item is 公開

表示: 全項目非表示: 全項目

基本情報

表示: 非表示:

アイテムのパーマリンク: https://hdl.handle.net/11858/00-001M-0000-002D-E195-8 版のパーマリンク: https://hdl.handle.net/21.11116/0000-000C-FF5F-C

資料種別: 学術論文

ファイル

表示: ファイル

作成者

表示:

非表示:

作成者:
Clay, M. P., 著者
Buaria, Dhawal¹, 著者
Gotoh, T., 著者
Yeung, P. K., 著者

所属:
1Laboratory for Fluid Dynamics, Pattern Formation and Biocomplexity, Max Planck Institute for Dynamics and Self-Organization, Max Planck Society, ou_2063287

内容説明

表示:

非表示:

キーワード: Turbulence; Mixing; High Schmidt number; Compact finite differences; Nested OpenMP; Blue Waters

要旨: A new dual-communicator algorithm with very favorable performance characteristics has been developed for direct numerical simulation (DNS) of turbulent mixing of a passive scalar governed by an advection-diffusion equation. We focus on the regime of high Schmidt number (Sc), where because of low molecular diffusivity the grid-resolution requirements for the scalar field are stricter than those for the velocity field by a factor root Sc. Computational throughput is improved by simulating the velocity field on a coarse grid of N-v(3) points with a Fourier pseudo-spectral (FPS) method, while the passive scalar is simulated on a fine grid of N-theta(3) points with a combined compact finite difference (CCD) scheme which computes first and second derivatives at eighth-order accuracy. A static three-dimensional domain decomposition and a parallel solution algorithm for the CCD scheme are used to avoid the heavy communication cost of memory transposes. A kernel is used to evaluate several approaches to optimize the performance of the CCD routines, which account for 60% of the overall simulation cost. On the petascale supercomputer Blue Waters at the University of Illinois, Urbana-Champaign, scalability is improved substantially with a hybrid MPI-OpenMP approach in which a dedicated thread per NUMA domain overlaps communication calls with computational tasks performed by a separate team of threads spawned using OpenMP nested parallelism. At a target production problem size of 8192(3) (0.5 trillion) grid points on 262,144 cores, CCD timings are reduced by 34% compared to a pure-MPI implementation. Timings for 16384(3) (4 trillion) grid points on 524,288 cores encouragingly maintain scalability greater than 90%, although the wall clock time is too high for production runs at this size. Performance monitoring with CrayPat for problem sizes up to 4096(3) shows that the CCD routines can achieve nearly 6% of the peak flop rate. The new DNS code is built upon two existing FPS and CCD codes. With the grid ratio N-theta/N-v = 8, the disparity in the computational requirements for the velocity and scalar problems is addressed by splitting the global communicator MPI_COMM_WORLD into disjoint communicators for the velocity and scalar fields, respectively. Inter communicator transfer of the velocity field from the velocity communicator to the scalar communicator is handled with discrete send and non-blocking receive calls, which are overlapped with other operations on the scalar communicator. For production simulations at N-theta = 8192 and N-v = 1024 on 262,144 cores for the scalar field, the DNS code achieves 94% strong scaling relative to 65,536 cores and 92% weak scaling relative to N-theta = 1024 and Nv = 128 on 512 cores.

資料詳細

表示:

非表示:

言語: eng - English

日付: オンライン出版: 2017-06-23出版: 2017-10

出版の状態: 出版

ページ: -

出版情報: -

目次: -

査読: 査読あり

識別子（DOI, ISBNなど）: DOI: 10.1016/j.cpc.2017.06.009

学位: -

訴訟

表示:

Project information

表示:

出版物 1

表示:

非表示:

出版物名: Computer Physics Communications

種別: 学術雑誌

著者・編者:

所属:

出版社, 出版地: -

ページ: - 巻号: 219 通巻号: - 開始・終了ページ: 313 - 328 識別子（ISBN, ISSN, DOIなど）: -

アイテム詳細

基本情報

ファイル

関連URL

作成者

内容説明

資料詳細

関連イベント

訴訟

Project information

出版物 1