Significance Test in Speaker Recognition Data Analysis with Data Dependency
Jin Chu Wu, Alvin F. Martin, Craig S. Greenberg, Raghu N. Kacker, Vincent M. Stanford
To evaluate the performance of speaker recognition systems, a detection cost function defined as a weighted sum of the probabilities of type I and type II errors is employed. The speaker datasets may have data dependency due to multiple uses of the same subjects. Using the standard errors of the detection cost function computed by means of the two-layer nonparametric two-sample bootstrap method, a significance test is performed to determine whether the difference between the measured performance levels of two speaker recognition algorithms is statistically significant. While conducting the significance test, the correlation coefficient between two detection cost functions for two algorithms, respectively, is taken into account. Examples are provided.
, Martin, A.
, Greenberg, C.
, Kacker, R.
and Stanford, V.
Significance Test in Speaker Recognition Data Analysis with Data Dependency, NIST Interagency/Internal Report (NISTIR), National Institute of Standards and Technology, Gaithersburg, MD, [online], https://doi.org/10.6028/NIST.IR.7884
(Accessed February 22, 2024)