Skip to main content
U.S. flag

An official website of the United States government

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.


Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction



S. Strassle, Mark A. Przybocki, Kay Peterson, Zhiyi Song, Kazuaki Maeda


The NIST Automatic Content Extraction (ACE) Evaluation expands its focus in 2008 to encompass the challenge of cross-document and cross-language global integration and reconciliation of information. While past ACE evaluations were limited to local (within-document) detection and disambiguation of entities, relations and events, the current evaluation adds global (cross-document and cross-language) entity disambiguation tasks for Arabic and English. This paper presents the 2008 ACE XDoc evaluation task and associated infrastructure. We describe the creation of development and test data to support the evaluation, focusing on new approaches required in data selection, annotation task definition and annotation software; and we conclude with a discussion of the metrics developed to support the evaluation.
Conference Dates
May 28-30, 2008
Conference Location
Marrakech, MO
Conference Title
The sixth international conference on Language Resources and Evaluation, LREC 2008


automatic content extraction, cross-document
Created August 27, 2008, Updated February 19, 2017