The National Institute of Standards and Technology (NIST) has developed and maintains an Electron Ionization (EI) Library as part of the NIST/EPA/NIH Mass Spectral Library, and the NIST GC Retention Index (RI) Database. These databases contain evaluated reference data used for the identification of chemicals.
The speed and accuracy of chemical analyses are vital issues in many areas. Monitoring for explosives, drugs (both legal and illegal), chemical weapons, pollutants, petrochemical processing, food contamination, and health/disease status biomarkers is a challenge with a high reward in terms of quality of life and security for the nation.
The first step in chemical analysis is usually identifying the chemical or chemicals present and the single most general technique for identifying molecules is mass spectrometry. Although there are numerous methods for ionizing materials to generate a mass spectrum, one of the oldest and the best established is electron ionization (EI). Although a mass spectrum reflects the underlying properties of the molecules, the process is sufficiently complex that identifications often require the comparison of the spectrum of an unknown with a measured spectrum of the compound.
The purposes of both the Electron Ionization (EI) Library, which is part of the NIST/EPA/NIH Mass Spectral Library, and the Retention Index (RI) Database is to aid in the identification of chemicals. In a substantial number of cases the EI Library is part of the only feasible identification process and both make the process faster and more reliable.
The Standard Reference Data Act (Pub. L. 90-396, Sec. 1, July 11, 1968) states: "The Congress hereby finds and declares that reliable standardized scientific and technical reference data are of vital importance to the progress of the Nation's science and technology. It is therefore the policy of the Congress to make critically evaluated reference data readily available to scientists, engineers, and the general public. It is the purpose of this Act to strengthen and enhance this policy."
The objective of the EI Library is to provide the largest possible collection of reliable reference spectra with an emphasis on compounds that are environmentally or biologically relevant. The Retention Index (RI) Database provides a comprehensive collection of reported retention indices.
Research Activities and Technical Approach
The size and scope of the EI Library is increased by the purchase of specialized collections, by both solicited and spontaneous contributions, by adding spectra from the literature, and, when not otherwise available, by obtaining the spectra at NIST of compounds both purchased and prepared.
Quality control is essential since the reliability of identifications depends on the spectra being of the correct compounds and of high quality. Although some aspects of the quality control can be and are automated, much of it must be done by experts.
An essential element in the NIST/EPA/NIH Mass Spectral Library is the software necessary to use the various libraries effectively. Past developments include the addition of synonyms for chemical names, the addition of chemical structures and Chemical Abstracts registry numbers, the development and implementation of spectral matching algorithms, the development of a retention index database with values linked to the appropriate spectra, and the development of AMDIS, a program that extracts clean spectra from complex gas chromatography/mass spectrometry data for comparison to library spectra. This software is available on a wide variety of commercial mass spectrometry instrumentation. Planning is underway to incorporate IUPAC International Chemical Identifiers (InChI) (mainly developed at NIST) into the system.
The RI Database is the largest available collection of retention indices. Most of the data was collected from the published literature although some was obtained at NIST at the same time as spectra for the EI Library. As with the NIST/EPA/NIH Mass Spectral Library the software provides chemical names, synonyms, Chemical Abstracts registry numbers, and structures.