Report on the TREC-5 Confusion Track

Paul B. Kantor; Ellen M. Voorhees

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Report on the TREC-5 Confusion Track

Published

October 24, 2005

Author(s)

Paul B. Kantor, Ellen M. Voorhees

Abstract

This paper is the track report for the TREC-5 confusion track. For TREC-5, retrieval from corrupted data was studied through retrieval of specific target documents from a corpus that was corrupted by applying OCR techniques to page images of varying qualities. Methods that attempted probabilistic estimation of the original clean text fared better than methods that simply accepted corrupted version of the query text.

Citation

Report on the TREC-5 Confusion Track

Pub Type

Others

Download Paper

Local Download

Keywords

information retrieval, test retrieval conference

Citation

Kantor, P. and Voorhees, E. (2005), Report on the TREC-5 Confusion Track, Report on the TREC-5 Confusion Track, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=151345 (Accessed August 2, 2026)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created October 23, 2005, Updated October 12, 2021

Was this page helpful?