LeDell, Erin; Petersen, Maya; van der Laan, Mark

doi:10.1214/15-ejs1035

Download PDF

Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates

2015

Published Web Location

https://doi.org/10.1214/15-ejs1035

Abstract

In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content

For improved accessibility of PDF content, download the file to your device.

UC Berkeley

Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates

Published Web Location