Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Sequence-based allelic variations and frequencies for 22 autosomal STR loci in the Lebanese population

Published

Author(s)

Sarah Riman, Mirna Ghemrawi, Lisa Borsuk, Rami Mahfouz, Susan Walsh, Peter Vallone

Abstract

This is the first study that characterizes the sequence-based allelic variations of 22 autosomal Short Tandem Repeat (aSTR) loci in a population dataset collected from Lebanon. Genomic DNA extracts from 195 unrelated Lebanese individuals were amplified with PowerSeq 46GY System Prototype. Targeted amplicons were subjected to DNA library preparation and sequenced on the Verogen MiSeq FGx Sequencing System. Raw FASTQ data files were processed by STRait Razor v3. Sequence strings were annotated according to the considerations of the DNA Commission of the International Society for Forensic Genetics (ISFG) and tabulated herein with their respective allelic frequencies and GeneBank accession and version numbers. The sequenced Lebanese dataset resulted in 429 distinct allelic sequences as compared to the 236 alleles identified by length only. The increase in the number of alleles was observed at 18 out of 22 aSTR loci and was attributed to the sequence variations residing in both the STR repeat motifs and flanking regions. The study uncovered 25 novel aSTR allelic sequences across 12 loci for which GenBank records did not previously exist in the STRSeq BioProject, PRJNA380127. For a concordance check, the length-based allelic calls derived from the full sequences were compared to those genotyped using capillary electrophoresis (CE) methods. Population genetic parameters relevant to the evaluation of forensic DNA evidence were assessed for the sequence-based data and compared to the parameters generated from the length-based information. Using the sequence-based data, Analysis of MOlecular VAriance (AMOVA), genetic distances, and population genetic structure were evaluated for 1231 individuals sampled from the Lebanese and four U.S. populations (African American, Asian, Caucasian, and Hispanic). The results were tabulated and visualized in a population tree, multidimensional scaling scatter plots, and bar plots. This newly established sequence-based database for the Lebanese population can be beneficial for extending NGS applicability to casework or paternity testing and assessing the strength of evidence for NGS-STR profiles. The described novel sequence variants at certain loci can further help in the effort to characterize the sequence diversity of STR markers from different populations around the world.
Citation
Forensic Science International: Genetics
Volume
65

Keywords

Next Generation Sequencing, PowerSeq 46GY, Autosomal STR loci, Sequence variations, Lebanese population

Citation

Riman, S. , Ghemrawi, M. , Borsuk, L. , Mahfouz, R. , Walsh, S. and Vallone, P. (2023), Sequence-based allelic variations and frequencies for 22 autosomal STR loci in the Lebanese population, Forensic Science International: Genetics, [online], https://doi.org/10.1016/j.fsigen.2023.102872, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=935008 (Accessed December 4, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created April 10, 2023, Updated July 30, 2024