Dynamic Test Collections: Measuring Search Effectiveness on the Live Web

Ian M. Soboroff

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Dynamic Test Collections: Measuring Search Effectiveness on the Live Web

Published

January 22, 2007

Author(s)

Ian M. Soboroff

Abstract

Existing methods for measuring the quality of search algorithms use a static collection of documents. A set of queries and a mapping from the queries to the relevant documents allow the experimenter to see how well different search engines or engine configurations retrieve the correct answers. This methodology assumes that the document set and thus the set of relevant documents are unchanging. In this paper, we abandon the static collection requirement. We begin with a recent TEXT REtrieval Conference (TREC) collection created from a web crawl, and analyze how the documents in that collection have changed over time. We determine how the decayed collection to measure a live web search system. We employ novel measures of search effectiveness that are robust despite incomplete relevance information. Lastly, we propose a methodology of "collection maintenance" which supports measuring search effectiveness both for a single system and between systems run at different points in time.

Proceedings Title

Proceedings of the Annual International ACM SIGIR Conference on Research and Development inInformation Retrieval

Conference Title

29th Annual Conference on Research adn Development in Information Retrieval (SIGIR 2006)

Pub Type

Conferences

Download Paper

Local Download

Keywords

dynamic document collections, information retrieval evaluation, test collections, web search evaluation

Citation

Soboroff, I. (2007), Dynamic Test Collections: Measuring Search Effectiveness on the Live Web, Proceedings of the Annual International ACM SIGIR Conference on Research and Development inInformation Retrieval, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=50846 (Accessed May 29, 2026)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created January 22, 2007, Updated February 17, 2017

Was this page helpful?