Skip to main content

NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.

Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.

U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Report on the TREC-5 Confusion Track

Published

Author(s)

Paul B. Kantor, Ellen M. Voorhees

Abstract

This paper is the track report for the TREC-5 confusion track. For TREC-5, retrieval from corrupted data was studied through retrieval of specific target documents from a corpus that was corrupted by applying OCR techniques to page images of varying qualities. Methods that attempted probabilistic estimation of the original clean text fared better than methods that simply accepted corrupted version of the query text.
Citation
Report on the TREC-5 Confusion Track

Keywords

information retrieval, test retrieval conference

Citation

Kantor, P. and Voorhees, E. (2005), Report on the TREC-5 Confusion Track, Report on the TREC-5 Confusion Track, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=151345 (Accessed October 14, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created October 23, 2005, Updated October 12, 2021
Was this page helpful?