Best practices for benchmarking germline small-variant calls in human genomes

Justin M. Zook; Marc L. Salit

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Best practices for benchmarking germline small-variant calls in human genomes

Published

March 11, 2019

Author(s)

Justin M. Zook, Marc L. Salit

Abstract

Standardized benchmarking approaches are required to assess the accuracy of variants called from sequence data. Although variant- calling tools and the metrics used to assess their performance continue to improve, important challenges remain. Here, as part of the Global Alliance for Genomics and Health (GA4GH), we present a benchmarking framework for variant calling. We provide guidance on how to match variant calls with different representations, define standard performance metrics, and stratify performance by variant type and genome context. We describe limitations of high-confidence calls and regions that can be used as truth sets (for example, single- nucleotide variant concordance of two methods is 99.7% inside versus 76.5% outside high-confidence regions). Our web-based app enables comparison of variant calls against truth sets to obtain a standardized performance report. Our approach has been piloted in the PrecisionFDA variant-calling challenges to identify the best-in-class variant-calling methods within high-confidence regions. Finally, we recommend a set of best practices for using our tools and evaluating the results.

Citation

Nature Biotechnology

Volume

Pub Type

Journals

Download Paper

https://doi.org/10.1038/s41587-019-0054-x

Keywords

genomics, DNA sequencing, benchmarking, reference materials, reference data

Bioscience, Genomics, Reference data and Reference materials

Citation

Zook, J. and Salit, M. (2019), Best practices for benchmarking germline small-variant calls in human genomes, Nature Biotechnology, [online], https://doi.org/10.1038/s41587-019-0054-x (Accessed August 24, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created March 11, 2019, Updated January 27, 2020

Was this page helpful?

Best practices for benchmarking germline small-variant calls in human genomes

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats

Issues