Skip to main content

NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.

Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.

U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

STRSeq: A catalog of sequence diversity at human identification Short Tandem Repeat loci

Published

Author(s)

Katherine Gettings, Lisa Borsuk, D Ballard, Martin Bodner, Bruce Budowle, Laurence Devesse, Jonathan King, W. Parson, Christopher Phillips, Peter Vallone

Abstract

The STR Sequencing Project (STRSeq) was initiated to facilitate the description of sequence- based alleles at the Short Tandem Repeat (STR) loci targeted in human identification assays. This international collaborative effort, which has been endorsed by the ISFG DNA Commission, provides a framework for communication among laboratories. The initial data used to populate the project are the aggregate alleles observed in targeted sequencing studies across four laboratories: National Institute of Standards and Technology (N=1786), Kings College London (N=1043), University of North Texas Health Sciences Center (N=839), and University of Santiago de Compostela (N=944), for a total of 4612 individuals. STRSeq data are maintained as GenBank records at the U.S. National Center for Biotechnology Information (NCBI), which participates in a daily data exchange with the DNA DataBank of Japan (DDBJ) and the European Nucleotide Archive (ENA). Each GenBank record contains: (a) the observed sequence of a STR region, (b) annotation ("bracketing") of the repeat region and flanking region polymorphisms, (c) information regarding the sequencing assay and data quality, and (d) backward compatible length-based allele designation. STRSeq GenBank records are organized within a BioProject at NCBI (https://www.ncbi.nlm.nih.gov/bioproject/380127), which is sub-divided into: commonly used autosomal STRs, alternate autosomal STRs, Y-chromosomal STRs, and X-chromosomal STRs. Each of these categories is further divided into locus-specific BioProjects. The BioProject hierarchy facilitates access to the GenBank records by browsing, BLAST searching, or ftp download. Future plans include user interface tools at strseq.nist.gov, a pathway for submission of additional allele records by laboratories performing population sample sequencing and interaction with the STRidER web portal for quality control (https://strider.online).
Citation
Forensic Science International: Genetics
Volume
31

Keywords

Forensic, DNA, STR, Sequencing, NGS

Citation

Gettings, K. , Borsuk, L. , Ballard, D. , Bodner, M. , Budowle, B. , Devesse, L. , King, J. , Parson, W. , Phillips, C. and Vallone, P. (2017), STRSeq: A catalog of sequence diversity at human identification Short Tandem Repeat loci, Forensic Science International: Genetics, [online], https://doi.org/10.1016/j.fsigen.2017.08.017 (Accessed October 9, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created October 31, 2017, Updated June 24, 2021
Was this page helpful?