Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Search Publications

NIST Authors in Bold

Displaying 1 - 25 of 128

The 2019 NIST Audio-Visual Speaker Recognition Evaluation

May 18, 2020
Author(s)
Seyed Omid Sadjadi, Craig S. Greenberg, Elliot Singer, Douglas A. Reynolds, Lisa Mason, Jaime Hernandez-Cordero
… (CTS) data from the Call My Net 2 (CMN2) corpus, and 2) an Audio-visual (AV) evaluation using video material extracted … (VAST) corpus. This paper presents an overview of the Audio-Visual SRE19 including the task, the performance … protocol, results and system performance analyses. The Audio-Visual SRE19 was organized in a similar manner to the …

Designing Usable Audio for Voting Systems: Best Practices and a Test Approach

January 31, 2025
Author(s)
Lynn Baumeister, Whitney Quesenbery, Sharon J. Laskowski
… The best practices outlined in this document focus on the audio and tactile controller experience. These best practices … voting system designers methods to improve their existing audio or for creating audio for a new voting system. These best practices describe …

Results of the 2006 Spoken Term Detection Evaluation

Author(s)
Jonathan G. Fiscus, Jerome G. Ajot, John S. Garofolo, George Doddington
… is a sequence of words consecutively spoken, in a large audio corpus of heterogeneous speech material. The paper … audio indexing, audio mining, multilingual, speech retrieval …

The TREC Spoken Document Retrieval Track: A Success Story

April 1, 2000
Author(s)
John S. Garofolo, C G. Auzanne, Ellen M. Voorhees
… involves the search and retrieval of excerpts from spoken audio recordings using a combination of automatic speech … this technology can be successfully applied to realistic audio collections using a combination of existing … audio, broadcast, document, indexing, multimedia, news, …

APPROACHES AND BEST PRACTICES: Data Collection of Audio Dialogues to Support the Training of Speech-to-Speech Translation Systems

June 25, 2010
Author(s)
Brian A. Weiss, Craig I. Schlenoff, Ann M. Virts
… effectively capture two-way, free-form speech-to-speech audio dialogues within recording studios. These dialogues, … NIST personnel have collected over 500 hours of bilingual audio data sets encompassing more than 1100 dialogues across … designed and employed allowing the successful capture of audio data. In addition to the data collection protocols …

Fused quad audio/visual and tracking data collection to enhance mobile robot and operator performance analyses

March 20, 2008
Author(s)
Brian A. Weiss, Brian Antonishek, Richard J. Norcross
… As a robot maneuvers through a performance test, video and audio data streams are simultaneously collected and fed into … a quad compressor providing real-time display. This fused audio/visual data provide a complete picture of what the … Fused quad audio/visual and tracking data collection to enhance mobile …

The CLEAR 2006 Evaluation

Author(s)
Rainer Stiefelhagen, Keni Bernardin, Rachel J. Bowers, John S. Garofolo, Djamel Mostefa, K Soundararajan
… were conducted, which included acoustic, visual and audio-visual analysis for many of the main tasks, as well as … Evaluation, Video, Multimedia, Audio, Metrics …

Applications of a 3D Range Camera Towards Healthcare Mobility Aids

October 3, 2006
Author(s)
Roger V. Bostelman, Peter Russo, James S. Albus, Tsai Hong Hong, Rajmohan Madhavan
… efforts allowed NIST to combine the 3D camera with stereo audio feedback to help the blind or visually impaired to … the control algorithm that combines the camera with stereo audio to help guide people around objects, including the … 3D range camera, audio guidance, control, guidance for the blind, healthcare, …

Creating HAVIC: Heterogeneous Audio Visual Internet Collection

May 21, 2012
Author(s)
Stephanie Strassel, Amanda Morris, Jonathan G. Fiscus, Christopher Caruso, Haejoong Lee, Paul D. Over, James Fiumara, Barbara L. Shaw, Brian Antonishek, Martial Michel
… and related technologies. The HAVIC (Heterogeneous Audio Visual Internet Collection) Corpus will ultimately … Creating HAVIC: Heterogeneous Audio Visual Internet Collection …

Java-Based Multimedia Collaboration and Application Sharing Environment

December 1, 1998
Author(s)
H Abdel-Wahab, Okhee Kim, P Kabore, J Favreau
… Multimedia desktop conferencing systems that include audio, video and application sharing are gaining momentum and … in Java using its Abstract Windows Toolkit (AWT). The audio and video components of JCE are based on the new Java … protocol (RTP) within JMF for sending and receiving many audio and video streams. Our solutions to these problems are …

Optimal Transmit Volume Conditions for Mission Critical Voice Quality of Experience Measurement Systems

September 22, 2021
Author(s)
Chelsea Greene, Jesse Frey, William Magrogan, Cara O'Malley, Jaden Pieper
… Research (PSCR) Division. As noted in prior publications, audio volume levels have an impact on output consistency … (SUT). To achieve this, a measurement that characterizes audio distortion levels, specifically caused by overdriven … Analog, Audio, A-weight, Communications, Direct mode, Distortion, …

What Makes a Good Podcast Summary?

July 11, 2022
Author(s)
Rezvaneh Rezapour, Sravana Reddy, Rosie Jones, Ian Soboroff
… automatic summarization, speech processing, audio

Calculable Coaxial Resistors for Precision Measurements

May 1, 1999
Author(s)
Randolph E. Elmquist
… between resistors, capacitors, and inductors in the audio frequency range. The design is based on the principle … resistors and of the quantum Hall resistance in the audio frequency range. …

Eval-ware: Multimodal Interaction

March 5, 2007
Author(s)
John S. Garofolo, Richard T. Rose, Rainer Stiefelhagen
Audio, Multimodal interaction, Speech recognition, Video …

Augmenting Deep Learning Models for Speech Emotion Recognition

October 19, 2020
Author(s)
Ram Sriram, Dinesh Manocha, Sarala Padi
… learning model to recognize the underlying emotion of an audio signal. Our proposed multi-window augmentation approach … speech signal by employing multiple window sizes in the audio feature extraction process. We show that our … of finding the best window size is an essential step in audio feature extraction. We perform extensive experimental …
Displaying 1 - 25 of 128
Was this page helpful?