Data Dependency on Measurement Uncertainties in Speaker Recognition Evaluation
Jin Chu Wu, Alvin F. Martin, Craig S. Greenberg, Raghu N. Kacker
The National Institute of Standards and Technology conducts an ongoing series of Speaker Recognition Evaluations (SRE). Speaker detection performance is measured using a detection cost function defined as a weighted sum of the probabilities of type I and type II errors. The sampling variability can result in measurement uncertainties. In our prior study, the data independency was assumed in using the nonparametric two-sample bootstrap method to compute the standard errors (SE) of the detection cost function based on our extensive bootstrap variability studies in ROC analysis on large datasets. In this article, the data dependency caused by multiple uses of the same subjects is taken into account. The data are grouped into target sets and non-target sets, and each set contains multiple scores. One-layer and two-layer bootstrap methods are proposed based on whether the two-sample bootstrap resampling takes place only on target sets and non-target sets, or subsequently on target scores and non-target scores within the sets, respectively. The SEs of the detection cost function using these two methods along with those with the assumption of data independency are compared. It is found that the data dependency increases both estimated SEs and the variations of SEs. Some suggestions regarding the test design are provided.
Proc. of SPIE Vol. 8382, Active and Passive Signatures III
, Martin, A.
, Greenberg, C.
and Kacker, R.
Data Dependency on Measurement Uncertainties in Speaker Recognition Evaluation, Proc. of SPIE Vol. 8382, Active and Passive Signatures III, Baltimore, MD, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=911725
(Accessed February 29, 2024)