Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

An Empirical Study of Sample Size in ROC-Curve Analysis of Fingerprint Data

Published

Author(s)

Jin Chu Wu, Charles Wilson

Abstract

The fingerprint datasets in many cases may exceed millions of samples. Thus, the needed size of a biometric evaluation test sample is an important issue in terms of both accuracy and efficiency. In this article, an empirical study, namely, using Chebyshev s inequality in combination with simple random sampling, is applied to determine the sample size for biometric applications. No parametric model is assumed, since the underlying distribution functions of the similarity scores are unknown. The performance of fingerprint-image matcher is measured by a Receiver Operating Characteristic (ROC) curve. Both the area under an ROC curve and the True Accept Rate (TAR) at an operational False Accept Rate (FAR) are employed. The Chebyshev s greater-than-95% intervals of using these two criteria based on 500 Monte Carlo iterations are computed for different sample sizes as well as for both high- and low-quality fingerprint-image matchers. The stability of such Monte Carlo calculations with respect to the number of iterations is also explored. The choice of sample size depends on matchers qualities as well as on which performance criterion is invoked. In general, for 6,000 match similarity scores, 50,000 to 70,000 scores randomly selected from 35,994,000 nonmatch similarity scores can ensure the accuracy with greater-than-95% probability.
Proceedings Title
Proceedings on SPIE Conference
Volume
6202
Conference Dates
April 10-14, 2006
Conference Location
Orlando, FL

Keywords

Empirical Study, Chebyshev s Inequality, Simple Random Sampling, Sample Size, Receiver Operating Characteristic (ROC) Curve, Data Analysis, Stability Metric, Monte Carlo Calculation, Biometrics, Fingerprint Matching

Citation

, J. and Wilson, C. (2006), An Empirical Study of Sample Size in ROC-Curve Analysis of Fingerprint Data, Proceedings on SPIE Conference, Orlando, FL, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=152122 (Accessed October 8, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created December 4, 2006, Updated February 19, 2017