Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Ultrasensitive sequencing of STR markers utilizing unique molecular identifiers and the SiMSen-Seq method

Published

Author(s)

Maja Sidstedt, Arvid Gynna, Kevin Kiesler, Linda Jansson, Becky Steffen, Joakim Hakansson, Gustav Johansson, Yalda Bogestal, Andreas Tillmar, Peter Radstrom, Anders Stahlberg, Peter Vallone, Johannes Hedman

Abstract

Massively parallel sequencing (MPS) is increasingly applied in forensic short tandem repeat (STR) analysis. The presence of stutter artefacts and other PCR or sequencing errors in the MPS-STR data partly limits the detection of low DNA amounts, e.g., in complex mixtures. Unique molecular identifiers (UMIs) have been applied in several scientific fields to reduce noise in sequencing. UMIs consist of a stretch of random nucleotides, a unique barcode for each starting DNA molecule, that is incorporated in the DNA template using either ligation or PCR. The barcode is used to generate consensus reads, thus removing errors. The SiMSen-Seq (Simple, multiplexed, PCR-based barcoding of DNA for sensitive mutation detection using sequencing) method relies on PCR-based introduction of UMIs and includes a sophisticated hairpin design to reduce unspecific primer binding as well as PCR protocol adjustments to further optimize the reaction. In this study, SiMSen-Seq is applied to develop a proof-of-concept seven STR multiplex for MPS library preparation and an associated bioinformatics pipeline. Additionally, machine learning (ML) models were evaluated to further improve UMI allele calling. Overall, the seven STR multiplex resulted in complete detection and concordant alleles for 47 single-source samples at 1 ng input DNA as well as for low-template samples at 62.5 pg input DNA. For twelve challenging mixtures with minor contributions of 10 pg to 150 pg and ratios of 1-15% relative to the major donor, 99.2% of the expected alleles were detected by applying the UMIs in combination with an ML filter. The main impact of UMIs was a substantially lowered number of artefacts as well as reduced stutter ratios, which were generally below 5% of the parental allele. In conclusion, UMI-based STR sequencing opens new means for improved analysis of challenging crime scene samples including complex mixtures.
Citation
Forensic Science International: Genetics
Volume
71

Keywords

forensic DNA, machine learning, massively parallel sequencing, short tandem repeats, targeted PCR, UMI

Citation

Sidstedt, M. , Gynna, A. , Kiesler, K. , Jansson, L. , Steffen, B. , Hakansson, J. , Johansson, G. , Bogestal, Y. , Tillmar, A. , Radstrom, P. , Stahlberg, A. , Vallone, P. and Hedman, J. (2024), Ultrasensitive sequencing of STR markers utilizing unique molecular identifiers and the SiMSen-Seq method, Forensic Science International: Genetics, [online], https://doi.org/10.1016/j.fsigen.2024.103047, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=956772 (Accessed December 14, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created April 3, 2024, Updated July 30, 2024