The NIST 1999 Speaker Recognition Evaluation - An Overview

Alvin F. Martin; Mark A. Przybocki

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

The NIST 1999 Speaker Recognition Evaluation - An Overview

Published

December 31, 1999

Author(s)

Alvin F. Martin, Mark A. Przybocki

Abstract

This article summarizes the 1999 NIST Speaker Recognition Evaluation. It discusses the overall research objectives, the three task definitions, the development and evaluation data sets, the specified performance measures and their manner of presentation, the overall quality of the results, and the officially declared winners.More than a dozen sites from the United States, Europe, and Asia participated in this evaluation. There were three primary tasks for which automatic systems could be designed: one-speaker detection, two-speaker detection, and speaker tracking. All three tasks were performed in the context of mu-law encoded conversational telephone speech. The one-speaker detection task used single channel data, while the other two tasks used summed two-channel data.About 500 target speakers were specified, with two minutes of training speech data provided for each. Both multiple and single speaker test segments were selected from about 2000 conversations that were not used for training material. The duration of the multiple speaker test data was nominally one minute, while the duration of the single speaker test segments varied from near zero up to sixty seconds. For each task, systems had to make independent decisions for selected combinations of a test segment and a hypothesized target speaker. The data sets for each task were designed to be large enough to provide statistically meaningful results on test subsets of interest. Results were analyzed with respect to various conditions including, duration, pitch differences, and handset types.

Citation

Digital Signal Processing

Volume

Issue

No. 1-3

Pub Type

Journals

Download Paper

https://doi.org/10.1006/dspr.1999.0355

Local Download

Keywords

DET curve, NIST evaluations, performance evaluation, speaker identification, speaker recognition, speaker verification

Information technology

Citation

Martin, A. and Przybocki, M. (1999), The NIST 1999 Speaker Recognition Evaluation - An Overview, Digital Signal Processing, [online], https://doi.org/10.1006/dspr.1999.0355, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=151447 (Accessed July 26, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created December 31, 1999, Updated July 8, 2025

Was this page helpful?

The NIST 1999 Speaker Recognition Evaluation - An Overview

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats

Issues