A performance portable implementation of the semi-Lagrangian algorithm in six 
dimensions

Schild, Nils; Räth, Mario; Eibl, Sebastian; Hallatschek, Klaus; Kormann, Katharina

doi:10.1016/j.cpc.2023.108973

A performance portable implementation of the semi-Lagrangian algorithm in six dimensions

Schild, N., Räth, M., Eibl, S., Hallatschek, K., & Kormann, K. (2023). A performance portable implementation of the semi-Lagrangian algorithm in six dimensions. Computer Physics Communications, 295:. doi:10.1016/j.cpc.2023.108973.

Item is 公開

表示: 全項目非表示: 全項目

基本情報

表示: 非表示:

アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-000D-FEB8-6 版のパーマリンク: https://hdl.handle.net/21.11116/0000-000D-FEB9-5

資料種別: 学術論文

ファイル

表示: ファイル

非表示: ファイル

:

A performance portable implementation of the semi-Lagrangian algorithm in six dimensions.pdf (全文テキスト（全般）), 2MB

ファイルのパーマリンク:
-

ファイル名:
A performance portable implementation of the semi-Lagrangian algorithm in six dimensions.pdf

説明:
-

OA-Status:

閲覧制限:
非公開

MIMEタイプ / チェックサム:
application/pdf

技術的なメタデータ:

著作権日付:
-

著作権情報:
-

CCライセンス:
-

作成者

表示:

非表示:

作成者:
Schild, Nils, 著者
Räth, Mario, 著者
Eibl, Sebastian¹, 著者
Hallatschek, Klaus, 著者
Kormann, Katharina, 著者

所属:
1Max Planck Computing and Data Facility, Max Planck Society, ou_2364734

内容説明

表示:

非表示:

キーワード: -

要旨: This paper describes our approach to developing a simulation software application for the fully kinetic 6D-Vlasov equation, which will be used to explore physics beyond the reduced gyrokinetic model. Simulating the fully kinetic Vlasov equation requires efficient utilization of compute and storage capabilities due to the high dimensionality of the problem. In addition, the implementation needs to be extensible regarding the physical model and flexible regarding the hardware for production runs. We start on the algorithmic background to simulate the 6-D Vlasov equation using a semi-Lagrangian algorithm. The performance portable software stack, which enables production runs on pure CPU as well as AMD or Nvidia GPU accelerated nodes, is presented. The extensibility of our implementation is guaranteed through the described software architecture of the main kernel, which achieves a memory bandwidth of almost 500 GB/s on a V100 Nvidia GPU and around 100 GB/s on an Intel Xeon Gold CPU using a single code base. We provide performance data on multiple node-level architectures discussing utilized and further available hardware capabilities. Finally, the network communication bottleneck of 6-D grid-based algorithms is quantified. A verification of physics beyond gyrokinetic theory, for the example of ion Bernstein waves, concludes the work.

資料詳細

表示:

非表示:

言語:

日付: オンライン出版: 2023-10-20

出版の状態: オンラインで出版済み

ページ: -

出版情報: -

目次: -

査読: -

識別子（DOI, ISBNなど）: DOI: 10.1016/j.cpc.2023.108973

学位: -

訴訟

表示:

Project information

表示:

出版物 1

表示:

非表示:

出版物名: Computer Physics Communications

省略形 : Comput. Phys. Commun.

種別: 学術雑誌

著者・編者:

所属:

出版社, 出版地: Amsterdam : Elsevier B.V.

ページ: - 巻号: 295 通巻号: 108973 開始・終了ページ: - 識別子（ISBN, ISSN, DOIなど）: ISSN: 0010-4655
CoNE: https://pure.mpg.de/cone/journals/resource/954925392326

アイテム詳細

基本情報

ファイル

関連URL

作成者

内容説明

資料詳細

関連イベント

訴訟

Project information

出版物 1