Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

AIomics: Exploring More of the Proteome Using Mass Spectral Libraries Extended by Artificial Intelligence

Published

Author(s)

Lewis Geer, Joel Lapin, Douglas Slotta, Tytus Mak, Stephen E. Stein

Abstract

The unbounded permutations of biological molecules, including proteins and their constituent peptides, present a dilemma in identifying the components of complex biosamples. Sequence search algorithms used to identify peptide spectra can be expanded to cover larger classes of molecules, including more modifications, isoforms, and atypical cleavage, but at the cost of false positives or false negatives due to the simplified spectra they compute from sequence records. Spectral library searching can help solve this issue by precisely matching experimental spectra to library spectra with excellent sensitivity and specificity. However, compiling spectral libraries that span entire proteomes is pragmatically difficult. Neural networks that predict complete spectra containing a full range of annotated and unannotated ions can be used to replace these simplified spectra with libraries of fully predicted spectra, including modified peptides. Using such a network, we created predicted spectral libraries that were used to rescore matches from a sequence search done over a large search space, including a large number of modifications. Rescoring improved the separation of true and false hits by 82%, yielding an 8% increase in peptide identifications, including a 21% increase in nonspecifically cleaved peptides and a 17% increase in phosphopeptides.
Citation
ACS Journal of Proteome Research
Volume
22
Issue
7

Keywords

machine learning, search engine methods, tandem mass spectrometry, proteome analysis, peptides, algorithms

Citation

Geer, L. , Lapin, J. , Slotta, D. , Mak, T. and Stein, S. (2023), AIomics: Exploring More of the Proteome Using Mass Spectral Libraries Extended by Artificial Intelligence, ACS Journal of Proteome Research, [online], https://doi.org/10.1021/acs.jproteome.2c00807, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=935823 (Accessed May 26, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created May 26, 2023, Updated September 10, 2023