STRSeq: A catalog of sequence diversity at human identification Short Tandem Repeat loci

Katherine Gettings; Lisa Borsuk; D Ballard; Martin Bodner; Bruce Budowle; Laurence Devesse; Jonathan King; W. Parson; Christopher Phillips; Peter Vallone

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

STRSeq: A catalog of sequence diversity at human identification Short Tandem Repeat loci

Published

November 1, 2017

Author(s)

Katherine Gettings, Lisa Borsuk, D Ballard, Martin Bodner, Bruce Budowle, Laurence Devesse, Jonathan King, W. Parson, Christopher Phillips, Peter Vallone

Abstract

The STR Sequencing Project (STRSeq) was initiated to facilitate the description of sequence- based alleles at the Short Tandem Repeat (STR) loci targeted in human identification assays. This international collaborative effort, which has been endorsed by the ISFG DNA Commission, provides a framework for communication among laboratories. The initial data used to populate the project are the aggregate alleles observed in targeted sequencing studies across four laboratories: National Institute of Standards and Technology (N=1786), Kings College London (N=1043), University of North Texas Health Sciences Center (N=839), and University of Santiago de Compostela (N=944), for a total of 4612 individuals. STRSeq data are maintained as GenBank records at the U.S. National Center for Biotechnology Information (NCBI), which participates in a daily data exchange with the DNA DataBank of Japan (DDBJ) and the European Nucleotide Archive (ENA). Each GenBank record contains: (a) the observed sequence of a STR region, (b) annotation ("bracketing") of the repeat region and flanking region polymorphisms, (c) information regarding the sequencing assay and data quality, and (d) backward compatible length-based allele designation. STRSeq GenBank records are organized within a BioProject at NCBI (https://www.ncbi.nlm.nih.gov/bioproject/380127), which is sub-divided into: commonly used autosomal STRs, alternate autosomal STRs, Y-chromosomal STRs, and X-chromosomal STRs. Each of these categories is further divided into locus-specific BioProjects. The BioProject hierarchy facilitates access to the GenBank records by browsing, BLAST searching, or ftp download. Future plans include user interface tools at strseq.nist.gov, a pathway for submission of additional allele records by laboratories performing population sample sequencing and interaction with the STRidER web portal for quality control (https://strider.online).

Citation

Forensic Science International: Genetics

Volume

Pub Type

Journals

Download Paper

DOI Link

Keywords

Forensic, DNA, STR, Sequencing, NGS

Forensic science and Forensic genetics

Citation

Gettings, K. , Borsuk, L. , Ballard, D. , Bodner, M. , Budowle, B. , Devesse, L. , King, J. , Parson, W. , Phillips, C. and Vallone, P. (2017), STRSeq: A catalog of sequence diversity at human identification Short Tandem Repeat loci, Forensic Science International: Genetics, [online], https://doi.org/10.1016/j.fsigen.2017.08.017 (Accessed July 14, 2026)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created October 31, 2017, Updated June 24, 2021

Was this page helpful?