Understanding the characteristics of sequence-based single-source DNA profiles

Sarah Riman; Hariharan K. Iyer; Lisa A. Borsuk; Peter M. Vallone

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Understanding the characteristics of sequence-based single-source DNA profiles

Published

November 9, 2019

Author(s)

Sarah Riman, Hariharan K. Iyer, Lisa A. Borsuk, Peter M. Vallone

Abstract

The sequencing of STR markers provides additional information present in the underlying sequence variation that is typically masked by traditional fragment-based genotyping. However, the interpretation of STR profiles generated by targeted sequencing methods are susceptible to the same factors encountered in profiles processed through capillary gel electrophoresis. These factors include stochastic variation, noise, stutter artifacts, heterozygote imbalance, and allelic drop-out/in. Our goal is to characterize and understand how these behave in targeted sequence datasets. Here, we developed a framework using statistical tools to systematically interpret the characteristics of single-source DNA profiles generated by targeted sequencing. Sensitivity studies were performed using known single-source samples amplified with the PowerSeq 46GY System Prototype with varying DNA target masses ranging from 15 pg to 500 pg. The STR loci were subjected to DNA library preparation using two commercially available library kits and sequenced on the Illumina MiSeq platform. Raw FASTQ data files were analyzed in STRait Razor v2.0 without applying any thresholds (at a coverage ≥ 1). We investigated the effect of library normalization on average sequence coverage and studied methods for setting analytical and zygosity thresholds. All the data were analyzed per DNA quantity as well as investigated per method. Analyses presented can be applied to sequence data generated by similar targeted sequencing panels and/or NGS platforms.

Citation

FSI: Genetics

Volume

Pub Type

Journals

Download Paper

DOI Link

Keywords

Next Generation Sequencing (NGS), Short Tandem Repeat (STR), Receiver Operating Characteristic (ROC) plots, Noise, Zygosity, Analytical Threshold

Health and Forensic genetics

Citation

Riman, S. , Iyer, H. , Borsuk, L. and Vallone, P. (2019), Understanding the characteristics of sequence-based single-source DNA profiles, FSI: Genetics, [online], https://doi.org/10.1016/j.fsigen.2019.102192 (Accessed July 25, 2026)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created November 9, 2019, Updated February 13, 2020

Was this page helpful?