Skip to main content
eScholarship
Open Access Publications from the University of California

Performance evaluation and enhancement of SuperLU_DIST 2.0

Abstract

We present the runtime comparison of the two versions of Super LU_DIST, using up to 128 processors of the IBM SP at NERSC. One version provides the global input interface, and another provides the distributed input interface. The comparison includes the total runtime of the solver with both 32-bit and 64-bit addressing modes, the time breakdown for different phases of the solver. We also present an in-depth comparison off our sparse matrix-vector multiplication methods in the context of iterative refinement. Finally, we describe our Fortran 90 interface that enhances the usability of the software.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View