Resources

IXPUG banner image

"In the present work, optimization of PCG (Preconditioned CG) method has been conducted on KNL cluster using OpenMP. Target application is a 3D static linear-elastic problem in solid mechanics, which is based on GeoFEM/Cube. We introduce the calculation-communication overlapping technique with dynamic scheduling for SpMV routine in a PCG iterative calculation. As KNL cluster, we use Oakforest-PACS system, which is introduced by JCAHPC (Joint Center for Advanced HPC) under the collaboration between ITC, University of Tokyo and CCS, University of Tsukuba, with 68core KNL for each node. We investigate the performance under various configurations of the memory mode and clustering mode, such as Flat+Quadrant, Cache+Quadrant, Flat+SNC-4, and Cache+SNC-4, using 32 nodes with MCDRAM only. In current results, we observed the best performance with 64 threads per MPI process on the Flat+Quadrant mode , and dynamic scheduling for OpenMP is effective on such high thread count configuration."

Event Name

IXPUG BoF SC16

Keywords

ixpug

Video Name

NA