Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Overview of the TREC 2004 Terabyte Track

Published

Author(s)

Charles L. Clarke, Nick Craswell, Ian Soboroff

Abstract

The Terabyte Track explores how adhoc retrieval and evaluationtechniques can scale to terabyte-sized collections. For TREC 2004, ourfirst year, 50 new adhoc topics were created and evaluated over a426GB collection of 25 million documents taken from the .gov Webdomain. A total of 70 runs were submitted by 17 groups. Along with thetop documents, each group reported average query times, indexingtimes, index sizes, and hardware and software characteristics fortheir systems.
Citation
Special Publication (NIST SP) - sp
Report Number
sp

Keywords

information retrieval evaluation, large-scale collections, TREC

Citation

Clarke, C. , Craswell, N. and Soboroff, I. (2005), Overview of the TREC 2004 Terabyte Track, Special Publication (NIST SP), National Institute of Standards and Technology, Gaithersburg, MD, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=151608 (Accessed December 13, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created October 2, 2005, Updated October 12, 2021