NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
Creating HAVIC: Heterogeneous Audio Visual Internet Collection
Published
Author(s)
Stephanie Strassel, Amanda Morris, Jonathan G. Fiscus, Christopher Caruso, Haejoong Lee, Paul D. Over, James Fiumara, Barbara L. Shaw, Brian Antonishek, Martial Michel
Abstract
Linguistic Data Consortium and the National Institute of Standards and Technology are collaborating to create a large, heterogeneous annotated multimodal corpus to support research in multimodal event detection and related technologies. The HAVIC (Heterogeneous Audio Visual Internet Collection) Corpus will ultimately consist of several thousands of hours of unconstrained user-generated multimedia content. HAVIC has been designed with an eye toward providing increased challenges for both acoustic and video processing technologies, focusing on multi-dimensional variation inherent in user-generated multimedia content. To date the HAVIC corpus has been used to support the NIST 2010 and 2011 TRECVID Multimedia Event Detection (MED) Evaluations. Portions of the corpus are expected to be released in LDC's catalog in the coming year, with the remaining segments being published over time after their use in the ongoing MED evaluations.
Proceedings Title
Language Resources and Evaluation (LREC) 2012 Proceedings
Strassel, S.
, Morris, A.
, Fiscus, J.
, Caruso, C.
, Lee, H.
, Over, P.
, Fiumara, J.
, Shaw, B.
, Antonishek, B.
and Michel, M.
(2012),
Creating HAVIC: Heterogeneous Audio Visual Internet Collection, Language Resources and Evaluation (LREC) 2012 Proceedings, Istanbul, TR, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=911993
(Accessed October 25, 2025)