Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

High-Level Feature Detection from Video in TRECVid: a 5-Year Retrospective of Achievements

Published

Author(s)

Alan Smeaton, Paul D. Over, Wessel Kraaij

Abstract

Successful and effective content-based access to digital video requires fast, accurate and scalable methods to determine the video content automatically. A variety of contemporary approaches to this rely on text taken from speech within the video, or on matching one video frame against others using low-level characteristics like colour, texture, or shapes, or on determining and matching objects appearing within the video. Possibly the most important technique, however, is one which determines the presence or absence of a high-level or semantic feature, within a video clip or shot. By utilizing dozens, hundreds or even thousands of such semantic features we can support many kinds of content-based video navigation. Critically however, this depends on being able to determine whether each feature is or is not present in a video clip. The last 5 years has seen much progress in the development of techniques to determine the presence of semantic features within video. This progress can be tracked in the annual TREC Video Retrieval Evaluation (TRECVid) benchmarking activity where dozens of research groups measure the effectiveness of their techniques on common data and using an open, metrics-based approach. In this chapter we summarise the work done on the TRECVid high level feature task, showing the progress made year-on-year fairly comprehensive statement on where the state-of-the-art is regarding this important task, not just for one rgroup or for one approach, but across the spectrum. We then use this past, and on-going, work as a basis for highlighting the trends that are emerging in this area, and the questions which remain to be addressed before we can achieve large-scale, fast and reliable high level feature detection on video.
Citation
Book chapter in Multimedia Content Analysis, Theory and Appl

Keywords

evaluation, high-level feature detection, multimedia content analysis, TRECVid

Citation

Smeaton, A. , Over, P. and Kraaij, W. (2009), High-Level Feature Detection from Video in TRECVid: a 5-Year Retrospective of Achievements, Book chapter in Multimedia Content Analysis, Theory and Appl, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=51277 (Accessed May 28, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created March 1, 2009, Updated October 12, 2021