日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

  Performance Evaluation of Large Scale Electron Dynamics Simulation under Many-core Cluster based on Knights Landing

Hirokawa, Y., Boku, T., Sato, S., & Yabana, K. (2018). Performance Evaluation of Large Scale Electron Dynamics Simulation under Many-core Cluster based on Knights Landing. In Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2018) (pp. 183-191). New York: ACM. doi:10.1145/3149457.3149465.

Item is

基本情報

表示: 非表示:
アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-0002-CCE5-6 版のパーマリンク: https://hdl.handle.net/21.11116/0000-0002-CCE6-5
資料種別: 会議論文

ファイル

表示: ファイル
非表示: ファイル
:
p183-hirokawa.pdf (出版社版), 418KB
ファイルのパーマリンク:
https://hdl.handle.net/21.11116/0000-0002-CCE7-4
ファイル名:
p183-hirokawa.pdf
説明:
-
OA-Status:
閲覧制限:
公開
MIMEタイプ / チェックサム:
application/pdf / [MD5]
技術的なメタデータ:
著作権日付:
2018
著作権情報:
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).
CCライセンス:
-

関連URL

表示:
非表示:
説明:
-
OA-Status:

作成者

表示:
非表示:
 作成者:
Hirokawa, Y.1, 著者
Boku, T.2, 著者
Sato, S.3, 著者           
Yabana, K.2, 著者
所属:
1Graduate School of Systems and Information Engineering, University of Tsukuba, ou_persistent22              
2Center for Computational Sciences, University of Tsukuba, ou_persistent22              
3Theory Group, Theory Department, Max Planck Institute for the Structure and Dynamics of Matter, Max Planck Society, ou_2266715              

内容説明

表示:
非表示:
キーワード: Intel Xeon Phi, Knights Landing, Electron Dynamics Simulation
 要旨:

We have been developing an advanced scientific code called "ARTED" for an electron dynamics simulation using the first-order computation of materials to be ported to various large-scale parallel systems including the "K" Computer, which was previously Japan's fastest supercomputer. In this paper, the implementation and performance evaluation of the ARTED code used in Intel's latest many-core processor, the Knights Landing (KNL) stand-alone cluster, are described based on past research on porting the code to the Knights Corner (KNC) accelerator. Our target system is Oakforest-PACS, which is currently the fastest supercomputer in Japan. For performance tuning on KNL, the largest issue is how to utilize multiple levels of parallelism, such as the instruction level (512-bit SIMD instruction), hardware thread (4 threads/core), and large number of cores. We focus on the dominant computation part of the code, where 25 points of a 3D stencil computation are required.

We successfully optimize this part to achieve 758.4 GFLOPS per node, which corresponds to 24.8% of the theoretical peak on the node of Oakforest-PACS using an Intel Xeon Phi 7250 (3046 GFLOPS peak). It is also shown that the KNL sustained performance is better than that of the two KNC accelerator cards. The entire ARTED code implies other time step computing, and was designed for a large-scale parallel execution using MPI, whereas single-node parallelization is achieved using OpenMP. We finally evaluate the entire parallel execution performance with up to 128 nodes.

資料詳細

表示:
非表示:
言語: eng - English
 日付: 20182018
 出版の状態: 出版
 ページ: 9
 出版情報: -
 目次: -
 査読: 査読あり(内部)
 識別子(DOI, ISBNなど): DOI: 10.1145/3149457.3149465
 学位: -

関連イベント

表示:
非表示:
イベント名: International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia)
開催地: Tokyo, Japan
開始日・終了日: 2018-01-28 - 2018-01-31

訴訟

表示:

Project information

表示:

出版物 1

表示:
非表示:
出版物名: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2018)
種別: 会議論文集
 著者・編者:
所属:
出版社, 出版地: New York : ACM
ページ: - 巻号: - 通巻号: - 開始・終了ページ: 183 - 191 識別子(ISBN, ISSN, DOIなど): -