Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

International Interlaboratory Study Comparing Single Organism 16S rRNA Sequencing Data: Going Beyond Consensus Sequence Comparisons

Published

Author(s)

Nathanael D. Olson, Justin M. Zook, Steven P. Lund, Fabiola Rojas-Cornejo, Jim Huggett, Jayne B. Morrow

Abstract

The Consultative Committee for Amount of Substance established a working group to advance the metrology of microbial identification and quantification. This study represents the initial step towards international comparability and measurement assurance for microbial identity. Six laboratories sequenced the 16S rRNA gene from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes strain 4b NCTC 19944. Participants used either Sanger sequencing, Roche 454 pyrosequencing® (“454”), or Ion Torrent PGM®. The 16S rRNA gene is a multi-copy gene with both conserved and variable bases. To rigorously challenge comparability, the sequencing data was evaluated on three levels: 1) identity of conserved bases, 2) ratio of bases at the variable positions, and 3) the full-length sequences of the individual gene copies. Regardless of sequencing method the biologically conserved positions were correctly identified. To determine the base copy ratio and the full-length gene copy sequences, analytical methods were developed using Bayesian and Maximum Likelihood statistics. Accurate base copy ratios were only determined using datasets with higher coverage, or a greater number of sequencing reads covering the reference gene sequence. However, the base copy ratio was predicted accurately more frequently for lower coverage Sanger clone libraries compared to higher coverage Ion Torrent data. The full-length gene copy sequences were only accurately predicted with “454” datasets, indicating that longer reads and higher throughput are required to determine the full-length gene copy sequences. This study presents novel methods for comparing sequencing data from different sequencing platforms for a multi-copy gene, paving the way towards measurement assurance for DNA sequence based microbial identification.
Citation
PLoS One

Keywords

16S rRNA, microbial identification, DNA sequencing, interlaboratory study

Citation

Olson, N. , Zook, J. , Lund, S. , Rojas-Cornejo, F. , Huggett, J. and Morrow, J. (2015), International Interlaboratory Study Comparing Single Organism 16S rRNA Sequencing Data: Going Beyond Consensus Sequence Comparisons, PLoS One, [online], https://doi.org/10.1016/j.bdq.2015.01.004 (Accessed July 5, 2022)
Created March 5, 2015, Updated November 10, 2018