Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

User Guide for NIST Media Forensic Challenge (MFC) Datasets

Published

Author(s)

Haiying Guan, Andrew Delgado, Yooyoung Lee, Amy Yates, Daniel Zhou, Timothée N. Kheyrkhah, Jonathan G. Fiscus

Abstract

NIST released a set of Media Forensic Challenge (MFC) datasets developed in DARPA MediFor (Media Forensics) project to the public in the past 5 years. More than 300 individuals, 150 organizations, from 26 countries and regions worldwide use our datasets for their research. During this period, we received many questions from users about the datasets. The questions vary from what the datasets contain, what is the definition of different kinds of metadata, how to use them, to how to modify/extract the data for their own research purposes, etc. This document aims to provide a systematical user guide for MFC datasets to answer those questions. In this document, first, an introduction about the MFC is given. Second, to provide users a deep understanding of what data and metadata are collected and how they are used in MFC datasets, a simple example with the manipulation journal graph is shown. The corresponding MFC dataset reference files of the given example are explained in detail. Third, the historical evolution of MFC datasets is introduced, and the releasable MFC datasets are summarized. Two commonly used approaches to illustrate the manipulation operation histogram of a dataset are presented. Moreover, a flexible approach for the special subset selection, selective scoring, is introduced to sample the test probes from the full test set for special task evaluation. Finally, suggestions to use MFC datasets beyond of MFC evaluations are presented for user interests.
Citation
NIST Interagency/Internal Report (NISTIR) - 8377
Report Number
8377

Keywords

Media Forensics, NIST Media Forensic Challenge (MFC) Evaluation, Journaling Tool (JT), Manipulation journal graph, Image Manipulation Detection, Image Manipulation Localization, Manipulation Localization Reference Mask, JPEG 2000, Manipulation Reference Ground-truth, Localization Mask, DARPA MediFor (Media Forensic) program

Citation

Guan, H. , Delgado, A. , Lee, Y. , Yates, A. , Zhou, D. , Kheyrkhah, T. and Fiscus, J. (2021), User Guide for NIST Media Forensic Challenge (MFC) Datasets, NIST Interagency/Internal Report (NISTIR), National Institute of Standards and Technology, Gaithersburg, MD, [online], https://doi.org/10.6028/NIST.IR.8377, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=931782 (Accessed November 30, 2021)
Created July 6, 2021, Updated October 14, 2021