An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
The National Institute of Standards and Technology (NIST) has constructed a discrete manufacturing workcell to support its operational technology (OT) and critical infrastructure research. This work is an improvement on the "Collaborative Robotics Testbed"
This manuscript establishes information-theoretic limitations for robustness of AI security and alignment by extending Gödel's incompleteness theorem to AI. Knowing these limitations and preparing for the challenges they bring is critically important for
It remains challenging to integrate artificial intelligence (AI) and natural language processing (NLP) into complex engineering systems. Popular models and techniques are designed for use with everyday, non-technical text, and therefore perform poorly when
The NIST Generative AI (GenAI) Text Challenge is an evaluation program designed to probe the capabilities and limitations of generative AI models from three complementary perspectives: Generator, Prompter, and Discriminator. The challenge will focus on two
Seungmin Seo, Oleg Aulov, Afzal Godil, Kevin Mangold
Speaker de-identification aims to conceal a speaker's identity while preserving intelligibility of the underlying speech. We introduce a benchmark that quantifies residual identity leak- age with three complementary error rates: equal error rate (EER)
Zongxia Li, Lorena Calvo Bartolome, Alexander Hoyle, Daniel Stephens, Paiheng Xu, Alden Dima, Jordan Boyd-Graber, Juan Fung
A common use of NLP is to facilitate the understanding of large document collections, with models based on Large Language Models (LLMs) replacing probabilistic topic models. Yet the effectiveness of LLM-based approaches in real-world applications remains
Measurement science for AI evaluations is a growing field. The National Institute of Standards and Technology (NIST) recently conducted a pilot evaluation of generative AI (GAI), specifically large language models (LLMs), in the ARIA (Assessing Risks and
Jack Chuang, Raied Caromi, Jelena Senic, Samuel Berweger, Neeraj Varshney, Jian Wang, Anuraag Bodi, Camillo Gentile, Nada Golmie
We describe a quasi-determinstic channel propagation model for human gesture recognition reduced from real-time measurements with our context aware channel sounder, considering four human subjects and 20 distinct body motions, for a total of 120,000
Lan Zhang, Anoop Singhal, Qingtian Zou, Xiaoyan Sun, Peng Liu
This article reviews the current human–large language models collaboration approach to bug fixing and points out the research directions toward (the development of) autonomous program repair artificial intelligence agents.
Hariharan Iyer, Seungmin Seo, Lukas Diduch, Kay Peterson, George Awad, Yooyoung Lee
The 2024 NIST Generative AI (GenAI) Pilot Study focuses on evaluating text-to-text (T2T) generation and discrimination tasks to assess the capabilities and limitations of generative AI models and AI detectors. The study aims to measure the effectiveness of
Basel Abdeen, S M Tahmid Siddiqui, Meah Tahmeed Ahmed, Anoop Singhal, Latifur Khan, Punya Modi, Ehab Al-Shaer
Large language models (LLMs) have emerged as a powerful tool for retrieving knowledge through seamless, human-like interactions. Despite their advanced text generation capabilities, LLMs exhibit hallucination tendencies, where they generate factually
Ryan O'Loughlin, Nicholas Skuda, Bakhrom Oripov, Sonia Buckley, Adam McCaughan
The brain implements recurrent neural networks (RNNs) efficiently, and modern computing hardware does not. Although specialized neuromorphic hardware is well suited for recurrent implementations in the inference phase, it does not lend itself to the
Yining Luo, Baobao Li, Anoop Singhal, Pei-Yu Tseng, Lan Zhang, Qingtian Zou, Xiaoyan Sun, Peng Liu
Large Language Models (LLMs) have shown promise in automating code vulnerability repair, but their effectiveness in handling real-world code remains limited. This paper investigates the capability of LLMs, in repairing vulnerabilities and proposes a
The 2024 NIST Generative AI (GenAI) Pilot Study focuses on evaluating text-to-text (T2T) generation and discrimination tasks to assess the capabilities and limitations of generative AI models. The study aims to measure the effectiveness of AI-generated
This plan has been developed by the Department of Commerce in coordination with the Department of State and agencies across the U.S. Government. It reflects more than 65 comments received in response to a December 2023 Request for Information
This NIST Trustworthy and Responsible AI report provides a taxonomy of concepts and defines terminology in the field of adversarial machine learning (AML). The taxonomy is arranged in a conceptual hierarchy that includes key types of ML methods, life cycle
Peter Bajcsy, Brycie Wiseman, Michael Paul Majurski, Andras Vladar
The speed of in-line scanning electron microscope (SEM) measurements of linewidth, contact hole, and overlay is critically important for identifying the measurement area and generating indispensable process control information. Sample charging and damage
Eugene Song, Thomas Roth, David A. Wollman, Eoin Jordan, Martin Serrano, Amelie Gyrard
Modern edge artificial intelligence (AI) chipsets and edge-intelligence-enabled smart sensors frameworks support real-time data processing and event detection at the signal source. Beyond just measuring local conditions and transmitting corresponding
The rapid advancement of artificial intelligence (AI) has led to the emergence of several technologies, including Generative Adversarial Networks (GANs), deepfakes, generative AI, CGI, and anti-forensics techniques. These technologies pose a significant
This paper presents gFlashNet, a generic flashover prediction model, designed to address the limitations of existing models that are restricted to specific residential building layouts. The aim of this research is to improve the scalability and
Precisely modeling radio propagation in complex environments has been a significant challenge, especially with the advent of 5G and beyond networks, where managing massive antenna arrays demands more detailed information. Traditional methods, such as
This report examines the existing standards, tools, methods, and practices, as well as the potential development of further science-backed standards and techniques, for: authenticating content and tracking its provenance; labeling synthetic content, such
Wai Cheong Tam, Jian Chen, Wei Tang, Qi Tong, Hongqiang Fang, Anthony Putorti
This paper presents the development of a fast-responding and accurate detection model for early-stage thermal runaway of a lithium-ion battery utilizing acoustics and a deep learning paradigm. A series of single-cell lithium-ion battery tests is conducted
Theodore Jensen, Mary Frances Theofanos, Kristen K. Greene, Olivia Williams, Kurtis Goad, Janet Bih Fofang
The increasing prevalence of artificial intelligence (AI) will likely lead to new interactions and impacts for the general public. An understanding of people's perceptions of AI can be leveraged to design and deploy AI systems toward human needs and values