Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Search Publications by: Ian Soboroff (Fed)

Search Title, Abstract, Conference, Citation, Keyword or Author
Displaying 76 - 100 of 131

Overview of the TREC 2008 Enterprise Track

January 1, 2010
Author(s)
Ian M. Soboroff, Krisztian Balog, Nick Craswell, Arjen de Vries, Paul Thomas, Peter Bailey
The goal of the enterprise track is to conduct experiments with enterprise data that reflect the experiences of users in real organizations. This year, we continued with the CERC collection introduced in TREC 2007. Topics were developed in conjunction with

Is spam an issue for opinionated blog post search?

July 19, 2009
Author(s)
Ian M. Soboroff, Craig Macdonald, Iadh Ounis
In opinion-finding, the retrieval system is tasked with re- trieving not just relevant documents, but those that also express an opinion towards the query target entity. This task has been studied in the context of the blogosphere by groups participating

A Guide to the RIA Workshop Data Archive

July 18, 2009
Author(s)
Ian M. Soboroff
During the course of the Reliable Information Access (RIA) workshop, a data archive was created to hold the outputs of the many experiments being done. This archive was designed to serve both as an organizational structure to support the researchers at the

Overview of the TREC 2007 Blog Track

December 17, 2008
Author(s)
Ian Soboroff, Craig Macdonald, Iadh Ounis
The goal of the Blog track is to explore the information seeking behaviour in the blogosphere. It aims to create the required in- frastructure to facilitate research into the blogosphere and to study retrieval from blogs and other related applied tasks

Overview of the TREC 2007 Enterprise Track

December 17, 2008
Author(s)
Ian M. Soboroff, Peter Bailey, Nick Craswell, Arjen de Vries
The goal of the enterprise track is to conduct experiments with enterprise data that reflect the experiences of users in real organizations. This year, the track has introduced a new corpus with the goal to be more representative of real-world enterprise

Limits of Opinion-Finding Baseline Systems

July 21, 2008
Author(s)
Ian M. Soboroff, Craig Macdonald, Ben He, Iadh Ounis
In opinion-finding, the retrieval system is tasked with re- trieving not just relevant documents, but which also express an opinion towards the query target entity. Most opinion- finding systems are based on a two-stage approach, where initially the system

Relevance assessment: are judges exchangeable and does it matter?

July 21, 2008
Author(s)
Ian M. Soboroff, Peter Bailey, Nick Craswell, Alan Smeaton, Emine Yilmaz, Paul Thomas
We investigate to what extent people making relevance judgments for a reusable IR test collection are exchangeable. We consider three classes of judge: gold standard judges, who are topic origi- nators and are experts in a particular information seeking

On The TREC Blog Track

April 30, 2008
Author(s)
Iadh Ounis, Craig Macdonald, Ian Soboroff
The rise of blogging as a new grassroots publishing medium and the many interesting peculiarities that characterize blogs compared to other genres of documents opened up several new interesting research areas in the information retrieval field. The Blog

The TREC 2006 Terabyte Track

March 24, 2008
Author(s)
Stefan Buttcher, Charles L. Clarke, Ian Soboroff
The primary goal of the Terabyte Track is to develop an evaluation methodology for terabyte-scale document collections. In addition, we are interested in efficiency and scalability issues, which can be studied more easily in the context of a larger

Overview of the TREC 2006 Enterprise Track

February 25, 2008
Author(s)
Ian M. Soboroff, Arjen de Vries, Nick Craswell
The goal of the enterprise track is to conduct experiments with enterprise data --- intranet pages, email archives, document repositories --- that reflect the experiences of users in real organizations, such that for example, an email ranking technique

Overview of the TREC 2006 Blog Track

November 27, 2007
Author(s)
Iadh Ounis, Maarten de Rijke, Craig Macdonald, Gilad Mishne, Ian Soboroff
The Blog track began this year, with the aim to explore the information seeking behaviour in the blogosphere. For this purpose, a new large-scale test collection, namely the TREC Blog06 collection, has been created. In the first pilot run of the track in

A Comparison of Pooled and Sampled Relevance Judgments

August 29, 2007
Author(s)
Ian M. Soboroff
Test collections are most useful when they are reusable, that is, when they can be reliably used to rank systems that did not contribute to the pools. Pooled relevance judgments for very large collections may not be reusable for two reasons: they will be

The TREC 2005 Terabyte Track

August 27, 2007
Author(s)
Charles L. Clarke, Falk Scholer, Ian Soboroff
The Terabyte Track explores how retrieval and evaluation techniques can scale to terabyte-sized collections, examining both efficiency and effectiveness issues. TREC 2005 is the second year for the track. The track was introduced as part of TREC 2004, with