Fusion PIC Code Performance Analysis on The Cori KNL System
Skip to main content
eScholarship
Open Access Publications from the University of California

Fusion PIC Code Performance Analysis on The Cori KNL System

Abstract

We study the attainable performance of Particle-In-Cell codes on the Cori KNL system by analyzing a miniature particle push application based on the fusion PIC code XGC1. We start from the most basic building blocks of a PIC code and build up the complexity to identify the kernels that cost the most in performance and focus optimization efforts there. Particle push kernels operate at high AI and are not likely to be memory bandwidth or even cache bandwidth bound on KNL. Therefore, we see only minor benefits from the high bandwidth memory available on KNL, and achieving good vectorization is the most beneficial optimization path and can theoretically yield up to 8x speedup on KNL, but is in practice limited by the data layout to 4x.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View