NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
International Interlaboratory Study Comparing Single Organism 16S rRNA Sequencing Data: Going Beyond Consensus Sequence Comparisons
Published
Author(s)
Nathanael D. Olson, Justin M. Zook, Steven P. Lund, Fabiola Rojas-Cornejo, Jim Huggett, Jayne B. Morrow
Abstract
The Consultative Committee for Amount of Substance established a working group to advance the metrology of microbial identification and quantification. This study represents the initial step towards international comparability and measurement assurance for microbial identity. Six laboratories sequenced the 16S rRNA gene from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes strain 4b NCTC 19944. Participants used either Sanger sequencing, Roche 454 pyrosequencing® (454), or Ion Torrent PGM®. The 16S rRNA gene is a multi-copy gene with both conserved and variable bases. To rigorously challenge comparability, the sequencing data was evaluated on three levels: 1) identity of conserved bases, 2) ratio of bases at the variable positions, and 3) the full-length sequences of the individual gene copies. Regardless of sequencing method the biologically conserved positions were correctly identified. To determine the base copy ratio and the full-length gene copy sequences, analytical methods were developed using Bayesian and Maximum Likelihood statistics. Accurate base copy ratios were only determined using datasets with higher coverage, or a greater number of sequencing reads covering the reference gene sequence. However, the base copy ratio was predicted accurately more frequently for lower coverage Sanger clone libraries compared to higher coverage Ion Torrent data. The full-length gene copy sequences were only accurately predicted with 454 datasets, indicating that longer reads and higher throughput are required to determine the full-length gene copy sequences. This study presents novel methods for comparing sequencing data from different sequencing platforms for a multi-copy gene, paving the way towards measurement assurance for DNA sequence based microbial identification.
Olson, N.
, Zook, J.
, Lund, S.
, Rojas-Cornejo, F.
, Huggett, J.
and Morrow, J.
(2015),
International Interlaboratory Study Comparing Single Organism 16S rRNA Sequencing Data: Going Beyond Consensus Sequence Comparisons, PLoS One, [online], https://doi.org/10.1016/j.bdq.2015.01.004
(Accessed October 18, 2025)