Skip to main content
eScholarship
Open Access Publications from the University of California

Fusion PIC code performance analysis on the Cori KNL system

Published Web Location

https://cug.org/proceedings/cug2017_proceedings/includes/files/pap152s2-file1.pdf
No data is associated with this publication.
Abstract

We study the attainable performance of Particle-In-Cell codes on the Cori KNL system by analyzing a miniature particle push application based on the fusion PIC code XGC1. We start from the most basic building blocks of a PIC code and build up the complexity to identify the kernels that cost the most in performance and focus optimization efforts there. Particle push kernels operate at high AI and are not likely to be memory bandwidth or even cache bandwidth bound on KNL. Therefore, we see only minor benefits from the high bandwidth memory available on KNL, and achieving good vectorization is shown to be the most beneficial optimization path with theoretical yield of up to 8x speedup on KNL. In practice we are able to obtain up to a 4x gain from vectorization due to limitations set by the data layout and memory latency.

Item not freely available? Link broken?
Report a problem accessing this item