The Impact of Scenario Development on the Performance of Speech Translation Systems Prescribed by the SCORE Framework

Brian A. Weiss; Craig I. Schlenoff

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

The Impact of Scenario Development on the Performance of Speech Translation Systems Prescribed by the SCORE Framework

Published

September 25, 2009

Author(s)

Brian A. Weiss, Craig I. Schlenoff

Abstract

The Defense Advanced Research Projects Agency's (DARPA) Spoken Language Communication and Translation for Tactical Use (TRANSTAC) program is a focused advanced technology research and development program. The intent of the TRANSTAC program is to demonstrate capabilities to quickly develop and implement free-form, two-way, speech-to-speech spoken language translation systems allowing speakers of different languages to communicate with each other in real-world tactical situations without the need for an interpreter. The National Institute of Standards and Technology (NIST), with support from the Mitre Corporation and Appen Pty Limited, has been funded by DARPA to evaluate the TRANSTAC technologies since 2006. The NIST-led Independent Evaluation Team (IET) has numerous responsibilities in this ongoing effort including collecting and processing training data, designing and implementing performance evaluations and analyzing the test data. In order to design and execute fair and relevant evaluations, the NIST IET has employed the System, Component and Operationally-Relevant Evaluation (SCORE) framework. The SCORE framework is a unified set of criteria and tools built around the premise that in order to gain an understanding of how a technology would perform in its intended environment, it must be evaluated at both the component and system levels and further tested in operationally-relevant environments while capturing both quantitative and qualitative performance data. Since an evaluation goal of the TRANSTAC program is to capture quantitative performance data of the translation technologies, the IET developed and implemented SCORE-inspired live evaluation scenarios. The two forms of live evaluation scenarios have unique impacts on the quantitative performance data. This paper not only presents the TRANSTAC program and SCORE methodology, but also focuses on the evaluation scenarios and their influence on system performance.

Proceedings Title

Proceedings of the Performance Metrics for Intelligent Systems (PerMIS) 2009

Conference Dates

September 21-23, 2009

Conference Location

Gaithersburg, MD

Pub Type

Conferences

Download Paper

Local Download

Keywords

SCORE, TRANSTAC, Speech-to-Speech Translation System, Performance Metrics, Evaluation

Manufacturing and Manufacturing systems design and analysis

Citation

Weiss, B. and Schlenoff, C. (2009), The Impact of Scenario Development on the Performance of Speech Translation Systems Prescribed by the SCORE Framework, Proceedings of the Performance Metrics for Intelligent Systems (PerMIS) 2009, Gaithersburg, MD, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=903628 (Accessed April 23, 2024)

Created September 25, 2009, Updated February 19, 2017

The Impact of Scenario Development on the Performance of Speech Translation Systems Prescribed by the SCORE Framework

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats