Lawrence Berkeley National Laboratory
Performance evaluation and enhancement of SuperLU_DIST 2.0
- Author(s): Li, Xiaoye S.
- Wang, Yu
- et al.
We present the runtime comparison of the two versions of Super LU_DIST, using up to 128 processors of the IBM SP at NERSC. One version provides the global input interface, and another provides the distributed input interface. The comparison includes the total runtime of the solver with both 32-bit and 64-bit addressing modes, the time breakdown for different phases of the solver. We also present an in-depth comparison off our sparse matrix-vector multiplication methods in the context of iterative refinement. Finally, we describe our Fortran 90 interface that enhances the usability of the software.