NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
Nestor: A Tool for Natural Language Annotation of Short Texts
Published
Author(s)
Michael Brundage, Rachael Sexton
Abstract
Nestor is a software tool that annotates natural language CSV (comma-separated variable) files, with a UTF-8 encoding, using a process called tagging [1]. The outputted annotated datasets (as either a CSV or .h5 file) can be used for different analysis techniques, such as failure prediction, problem hot spot identification, and maintenance technician expertise assessment, as shown in [2-7]. Currently, the majority of use cases involve maintenance in the engineering domain (manufacturing, mining, heating ventilation and air conditioning (HVAC)), however Nestor can input any natural language CSV file with UTF-8 encoding. The objective is to help analysts make their natural language data, which is often unstructured, filled with technical content, jargon, mispellings, and abbreviations, computable to improve analysis.
Citation
Journal of Research of the National Institute of Standards and Technology
Brundage, M.
and Sexton, R.
(2019),
Nestor: A Tool for Natural Language Annotation of Short Texts, Journal of Research of the National Institute of Standards and Technology, [online], https://doi.org/10.6028/jres.124.029, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=928708
(Accessed October 9, 2025)