Search

Article
Peer Reviewed

Biot Savart Law integrator BioSaw

LBL Publications (2015)

This contribution documents the methods used in the BioSaw code. The code is inteded to be a flexible tool for calculating magnetic fields due to coils in magnetic confinement fusion devices. It assumes the conductors are infinitesimally thin and can be described as either point sequences or circular coils. The code can calculate both the magnetic field as well as the vector potential due to the coils. The fields can be reduced very near the coils to avoid singular behaviour caused by the thin conductor approximation.

Cover page: Biot Savart Law integrator BioSaw

Article
Peer Reviewed

Optimizing Fusion PIC Code Performance at Scale on Cori Phase Two

LBL Publications (2017)

In this paper we present the results of optimizing the performance of the gyrokinetic full-f fusion PIC code XGC1 on the Cori Phase Two Knights Landing system. The code has undergone substantial development to enable the use of vector instructions in its most expensive kernels within the NERSC Exascale Science Applications Program. We study the single-node performance of the code on an absolute scale using the roofline methodology to guide optimization efforts. We have obtained 2× speedups in single node performance due to enabling vectorization and performing memory layout optimizations. On multiple nodes, the code is shown to scale well up to 4000 nodes, near half the size of the machine. We discuss some communication bottlenecks that were identified and resolved during the work.

Article
Peer Reviewed

Evaluating and Optimizing the NERSC Workload on Knights Landing

LBL Publications (2016)

NERSC has partnered with 20 representative application teams to evaluate performance on the Xeon-Phi Knights Landing architecture and develop an application-optimization strategy for the greater NERSC workload on the recently installed Cori system. In this article, we present early case studies and summarized results from a subset of the 20 applications highlighting the impact of important architecture differences between the Xeon-Phi and traditional Xeon processors. We summarize the status of the applications and describe the greater optimization strategy that has formed.

Cover page: Evaluating and Optimizing the NERSC Workload on Knights Landing