NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
Creating a web-scale video collection for research
Published
Author(s)
Paul D. Over, George M. Awad, Alan Smeaton, Colum Foley, James Lanagan
Abstract
This paper begins by considering a number of important design questions for a web-scale, widely available, multimedia test collection intended to support long-term scientific evaluation and comparison of content-based video analysis and exploitation systems. Such exploitation systems would include the kinds of functionality already explored within the annual TREC Video Retrieval Evaluation (TRECVid) benchmarking activity such as search, semantic concept detection, and automatic summarization. We then report on our progress in creating such a multimedia collection from publicly available Internet Archive videos with Creative Commons licenses (IACC.1), which we hope will be a useful approximation of a web-scale collection and will support a next generation of benchmarking activities for content-based video operations. We also report on some possibilities for putting this collection to use in multimedia system evaluation.
Proceedings Title
The 1st International Workshop on Web-Scale Multimedia Corpus (WSMC09)
Over, P.
, Awad, G.
, Smeaton, A.
, Foley, C.
and Lanagan, J.
(2009),
Creating a web-scale video collection for research, The 1st International Workshop on Web-Scale Multimedia Corpus (WSMC09), Beijing, -1, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=903000
(Accessed October 25, 2025)