日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

  Performance Optimization and Evaluation of Scalable Optoelectronics Application on Large Scale KNL Cluster

Hirokawa, Y., Boku, T., Uemoto, M., Sato, S., & Yaban, K. (2018). Performance Optimization and Evaluation of Scalable Optoelectronics Application on Large Scale KNL Cluster. In R., Yokota, M., Weiland, J., Shalf, & S., Alam (Eds.), High Performance Computing. Basel, Switzerland: Springer International Publishing. doi:10.1007/978-3-319-92040-5_11.

Item is

基本情報

表示: 非表示:
アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-0002-8520-3 版のパーマリンク: https://hdl.handle.net/21.11116/0000-0002-8521-2
資料種別: 会議論文

ファイル

表示: ファイル
非表示: ファイル
:
Hirokawa2018_Chapter_PerformanceOptimizationAndEval.pdf (出版社版), 2MB
 
ファイルのパーマリンク:
-
ファイル名:
Hirokawa2018_Chapter_PerformanceOptimizationAndEval.pdf
説明:
-
OA-Status:
閲覧制限:
非公開
MIMEタイプ / チェックサム:
application/pdf
技術的なメタデータ:
著作権日付:
-
著作権情報:
-
CCライセンス:
-

関連URL

表示:
非表示:
説明:
-
OA-Status:

作成者

表示:
非表示:
 作成者:
Hirokawa, Y.1, 著者
Boku, T.1, 2, 著者
Uemoto, M.2, 著者
Sato, S.3, 著者           
Yaban, K.2, 著者
所属:
1Graduate School of Systems and Information Engineering, University of Tsukuba, ou_persistent22              
2Center for Computational Sciences, University of Tsukuba, ou_persistent22              
3Theory Group, Theory Department, Max Planck Institute for the Structure and Dynamics of Matter, Max Planck Society, ou_2266715              

内容説明

表示:
非表示:
キーワード: -
 要旨: “ARTED” is an advanced scientific code for electron dynamics simulation which has been ported to various large-scale parallel systems including the “K” Computer, the ex-fastest supercomputer in the world, and many other MPP and cluster systems.

In this paper, we describe ARTED’s code optimization and performance evaluation applied to a large-scale cluster with Intel’s latest many-core processor, KNL (Knights Landing), based on past research regarding porting ARTED to the KNC (Knights Corner) coprocessor. Code optimization for dominant computation has been thoroughly carried out in KNL to achieve the highest performance with detailed optimization such as memory access, vectorization for the AVX-512 instruction set, cache utilization, etc. For further tuning, we investigated various KNL-dedicated techniques such as combining MCDRAM/DDR4 memories and parallel vector summation.

After detailed performance tuning on each core to achieve up to 25% of theoretical peak in the kernel part with 3-D stencil computation, we evaluated the application performance on the full system (25 PFLOPS of theoretical peak) of the KNL cluster “Oakforest-PACS” which is the largest KNL-based cluster in the world using the Intel Omni-Path Architecture. It shows excellent weak scaling with a dominant Hamiltonian performance of up to 4 PFLOPS (16% efficiency of the system) in double precision irrespective of simulation size as well as reasonable strong scaling on material simulations requiring high degree of parallelism.

資料詳細

表示:
非表示:
言語: eng - English
 日付: 20182018
 出版の状態: 出版
 ページ: 21
 出版情報: -
 目次: -
 査読: 査読あり(内部)
 識別子(DOI, ISBNなど): DOI: 10.1007/978-3-319-92040-5_11
 学位: -

関連イベント

表示:
非表示:
イベント名: 33rd International Conference on High Performance Computing (ISC High Performance)
開催地: Frankfurt/Main, Germany
開始日・終了日: 2018-06-24 - 2018-06-28

訴訟

表示:

Project information

表示:

出版物 1

表示:
非表示:
出版物名: High Performance Computing
  副タイトル : ISC High Performance 2018 International Workshops, Frankfurt/Main, Germany, June 24 - 28, 2018, Revised Selected Papers
種別: 書籍
 著者・編者:
Yokota, R.1, 編集者
Weiland, M.1, 編集者
Shalf, J.1, 編集者
Alam, S.1, 編集者
所属:
1 external, ou_persistent22            
出版社, 出版地: Basel, Switzerland : Springer International Publishing
ページ: - 巻号: 11203 通巻号: - 開始・終了ページ: - 識別子(ISBN, ISSN, DOIなど): ISBN: 978-3-030-02464-2
DOI: 10.1007/978-3-030-02465-9