Skip to main content
eScholarship
Open Access Publications from the University of California

Optimizing Fusion PIC Code Performance at Scale on Cori Phase Two

Published Web Location

https://www.ixpug.org/documents/14982562688_Koskela_XGC1.pdf
No data is associated with this publication.
Abstract

In this paper we present the results of optimizing the performance of the gyrokinetic full-f fusion PIC code XGC1 on the Cori Phase Two Knights Landing system. The code has undergone substantial development to enable the use of vector instructions in its most expensive kernels within the NERSC Exascale Science Applications Program. We study the single-node performance of the code on an absolute scale using the roofline methodology to guide optimization efforts. We have obtained 2× speedups in single node performance due to enabling vectorization and performing memory layout optimizations. On multiple nodes, the code is shown to scale well up to 4000 nodes, near half the size of the machine. We discuss some communication bottlenecks that were identified and resolved during the work.

Item not freely available? Link broken?
Report a problem accessing this item