Skip to main content
U.S. flag

An official website of the United States government

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Https

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Multimodal Information Group

We research and develop measurement and evaluation methods to advance and promote the use of technologies that provide more effective access to multimedia and multi-lingual information.

The group focuses on measurement and evaluation methods to facilitate the use of technologies that provide easier access to multimedia and multi-lingual information and that improve human-computer interface modalities. These technologies include recognizing and/or transforming information in speech, text, images, video, and other multimedia modalities, and the fusion of heterogeneous media content.  Focus areas include speech recognition, speaker recognition, language recognition, machine translation, image processing, image understanding, video processing, visual recognition, 2-D and 3D shape analysis, image quality assessment, and interoperable digital media access.

Projects and Programs

Video Analytics

The Multimodal Information Group's (MIG) video analytics program includes several activities contributing to the development of technologies that extract

Speaker and Language Recognition

Our Speaker and Language Recognition program includes several activities contributing to speaker and language recognition technology and metrology advancements

Data Science

The NIST Information Access Division (IAD) initiated a Data Science Research Program (DSRP) aimed to advance the measurement science for big data and data

Interdisciplinary Projects

Some of the Multimodal Information Group's project areas span across multiple research areas within the group or to other groups in IAD. These interdisciplinary

Publications

NIST 2020 CTS Speaker Recognition Challenge Evaluation Plan

Author(s)
Seyed Omid Sadjadi, Craig S. Greenberg, Elliot Singer, Douglas A. Reynolds, Lisa Mason
Following the success of the 2019 Conversational Telephone Speech (CTS) Speaker Recognition Challenge, which received 1347 submissions from 67 academic and

Contacts