Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Monte Carlo studies of bootstrap variability in ROC analysis with data dependency

Published

Author(s)

Jin Chu Wu, Alvin F. Martin, Raghu N. Kacker

Abstract

ROC analysis involving two large datasets is an important method for analyzing statistics of interest for decision making of a classifier in many disciplines. And data dependency due to multiple use of the same subjects exists ubiquitously in order to generate more samples because of limited resources. Hence, a two-layer data structure is constructed and the nonparametric two-sample two-layer bootstrap is employed to estimate standard errors of statistics of interest derived from two sets of data, such as a weighted sum of two probabilities. In this article, to reduce the bootstrap variance and ensure the accuracy of computation, Monte Carlo studies of bootstrap variability were carried out to determine the appropriate number of bootstrap replications in ROC analysis with data dependency. It is suggested that with a tolerance 0.02 of the coefficient of variation, 2,000 bootstrap replications be appropriate under such circumstances.
Citation
Communications in Statistics Part B-Simulation and Computation
Volume
48

Keywords

Bootstrap variability, Bootstrap replications, ROC analysis, Data dependency, Large datasets, Standard error

Citation

, J. , Martin, A. and Kacker, R. (2019), Monte Carlo studies of bootstrap variability in ROC analysis with data dependency, Communications in Statistics Part B-Simulation and Computation (Accessed April 18, 2024)
Created August 1, 2019