Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Results of the 1999 Topic Detection and Tracking Evaluation in Mandarin and English

Published

Author(s)

Jonathan G. Fiscus, G R. Doddington

Abstract

The National Institute of Standards and Technology (NIST) administered the second open evaluation of Topic Detection and Tracking (TDT) technologies in 1999. The TDT project supports development of technologies that automatically organize event-related news stories. The program leverages expertise in core technologies, Automatic Speech Recognition (ASR), Document Retrieval (DR), and Machine Translation (MT) to build the TDT technologies.The 1999 TDT project extended the 1998 TDT project in two dimensions, first by adding Mandarin Chinese audio and text sources and second by adding two new evaluation tasks. Through experimental controls and conditioned analysis of system performance, the 1999 evaluation yielded numerous insights into the effects of multilingual texts on TDT technologies. Three notable generalizations arise from the evaluation: (1) English and Mandarin story segmentation performance is similar, (2) cross-lingual topic tracking performance is 44 % worse than monolingual tracking, and (3) multilingual topic detection performance is 37 % worse than monolingual topic detection.
Citation
International Conference on Spoken Language Processing
Volume
CD-ROM

Keywords

benchmark evaluation, document retrieval, language, spoken language, topic detection and tracking

Citation

Fiscus, J. and Doddington, G. (2000), Results of the 1999 Topic Detection and Tracking Evaluation in Mandarin and English, International Conference on Spoken Language Processing (Accessed April 19, 2024)
Created January 1, 2000, Updated February 17, 2017