An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
Results of the 1999 Topic Detection and Tracking Evaluation in Mandarin and English
Published
Author(s)
Jonathan G. Fiscus, G R. Doddington
Abstract
The National Institute of Standards and Technology (NIST) administered the second open evaluation of Topic Detection and Tracking (TDT) technologies in 1999. The TDT project supports development of technologies that automatically organize event-related news stories. The program leverages expertise in core technologies, Automatic Speech Recognition (ASR), Document Retrieval (DR), and Machine Translation (MT) to build the TDT technologies.The 1999 TDT project extended the 1998 TDT project in two dimensions, first by adding Mandarin Chinese audio and text sources and second by adding two new evaluation tasks. Through experimental controls and conditioned analysis of system performance, the 1999 evaluation yielded numerous insights into the effects of multilingual texts on TDT technologies. Three notable generalizations arise from the evaluation: (1) English and Mandarin story segmentation performance is similar, (2) cross-lingual topic tracking performance is 44 % worse than monolingual tracking, and (3) multilingual topic detection performance is 37 % worse than monolingual topic detection.
Citation
International Conference on Spoken Language Processing
Fiscus, J.
and Doddington, G.
(2000),
Results of the 1999 Topic Detection and Tracking Evaluation in Mandarin and English, International Conference on Spoken Language Processing
(Accessed November 2, 2024)