Publications

Displaying 1 - 18 of 18

2024 NIST GenAI (Pilot Study): Text-to-Text Evaluation Overview and Results

June 25, 2025

Author(s)

Hariharan Iyer, Seungmin Seo, Lukas Diduch, Kay Peterson, George Awad, Yooyoung Lee

The 2024 NIST Generative AI (GenAI) Pilot Study focuses on evaluating text-to-text (T2T) generation and discrimination tasks to assess the capabilities and limitations of generative AI models and AI detectors. The study aims to measure the effectiveness of

NIST GenAI Webinar (Text-to-Text)

August 1, 2024

Author(s)

Yooyoung Lee, Hariharan Iyer, Seungmin Seo, Kay Peterson, George Awad, Lukas Diduch

2024 NIST Generative AI (GenAI): Data Creation Specification for Text-to-Text (T2T) Generators

April 1, 2024

Author(s)

Yooyoung Lee, George Awad, Asad Butt, Lukas Diduch, Kay Peterson, Seungmin Seo, Ian Soboroff, Hariharan Iyer

Generator (G) teams will be tested on their system ability to generate content that is indistinguishable from human-generated content. For the pilot study, the evaluation will help determine strengths and weaknesses in their approaches including insights

2024 NIST Generative AI (GenAI): Evaluation Plan for Text-to-Text (T2T) Discriminators

April 1, 2024

Author(s)

Yooyoung Lee, George Awad, Asad Butt, Lukas Diduch, Kay Peterson, Seungmin Seo, Ian Soboroff, Hariharan Iyer

Generator (G) teams will be tested on their system's ability to generate content that is indistinguishable from human-generated content. For the pilot study, the evaluation will help determine strengths and weaknesses in their approaches including insights

OpenASR21: The Second Open Challenge for Automatic Speech Recognition of Low-Resource Languages

September 22, 2022

Author(s)

Kay Peterson, Audrey N. Tong, Jennifer Yu

In 2021, the National Institute of Standards and Technology (NIST), in cooperation with the Intelligence Advanced Research Project Activity (IARPA), conducted OpenASR21, the second cycle of an open challenge series of automatic speech recognition (ASR)

OpenASR20: An Open Challenge for Automatic Speech Recognition ofConversational Telephone Speech in Low-Resource Languages

September 1, 2021

Author(s)

Kay Peterson, Audrey N. Tong, Jennifer Yu

In 2020, the National Institute of Standards and Technology (NIST), in cooperation with the Intelligence Advanced Research Project Activity (IARPA), conducted an open challenge on automatic speech recognition (ASR) technology for low-resource languages on

Overview of the NIST 2016 LoReHLT Evaluation

November 13, 2017

Author(s)

Audrey N. Tong, Lukasz L. Diduch, Jonathan G. Fiscus, Yasaman Haghpanah, Shudong Huang, David M. Joy, Kay Peterson, Ian M. Soboroff

Initiated in conjunction with DARPA's Low Resource Languages for Emergent Incidents (LORELEI) Program, the NIST LoReHLT (Low Re-source Human Language Technology) evaluation series seeks to incubate research on fundamental natural language processing tasks

Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation

July 20, 2010

Author(s)

Chris Callison-Burch, Philipp Koehn, Christof Monz, Kay Peterson, Mark A. Przybocki, Omar F. Zaidan

This paper presents the results of the WMT10 and MetricsMATR10 shared tasks, which included a translation task, a system combination task, and an evaluation task. We conducted a large-scale manual evaluation of 104 machine translation systems and 41 system

The NIST 2008 Metrics for Machine Translation Challenge - Overview, Methodology, Metrics, and Results

March 10, 2010

Author(s)

Mark A. Przybocki, Kay Peterson, P. S. Bronsart, Gregory A. Sanders

This paper discusses the evaluation of automated metrics developed for the purpose of evaluating machine translation (MT) technology. A general discussion of the usefulness of automated metrics is offered. The NIST MetricsMATR evaluation of MT metrology is

Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction

August 27, 2008

Author(s)

S. Strassle, Mark A. Przybocki, Kay Peterson, Zhiyi Song, Kazuaki Maeda

The NIST Automatic Content Extraction (ACE) Evaluation expands its focus in 2008 to encompass the challenge of cross-document and cross-language global integration and reconciliation of information. While past ACE evaluations were limited to local (within

Translation Adequacy and Preference Evaluation Tool (TAP-ET)

May 28, 2008

Author(s)

Mark A. Przybocki, Kay Peterson, P. S. Bronsart

Evaluation of Machine Translation (MT) technology is often tied to the requirement for tedious manual judgments of translation quality. While automated MT metrology continues to be an active area of research, a well known and often accepted standard metric

Search Publications by: Kay Peterson (Fed)