The speaker recognition evaluation is an ongoing series of evaluations conducted by NIST. A detection cost function is computed over the sequence of trials provided and used for all speaker detection tests while measuring speaker detection performance. The detection cost function is defined as a weighted sum of probabilities of type I error and type II error. The sampling variability can result in measurement uncertainties of the detection cost function. Hence, while evaluating and comparing the performances of speaker recognition systems, the measurement uncertainties must be taken into account. In this article, the uncertainties of measures of detection cost functions in terms of standard errors (SE) and confidence intervals are computed using the nonparametric two-sample bootstrap methods based on our extensive bootstrap variability studies on large datasets conducted before. The data independence is assumed because the bootstrap results of SEs matched very well with the analytical results of SEs using the Mann-Whitney statistic if the metric of area under ROC curve is employed. Examples are provided.
Citation: NIST Interagency/Internal Report (NISTIR) - 7722Report Number:
NIST Pub Series: NIST Interagency/Internal Report (NISTIR)
Pub Type: NIST Pubs
speaker recognition evaluation, biometrics, bootstrap, uncertainty, standard error, confidence interval