Skip to main content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Summary

Consortium hosted by NIST and JIMB dedicated to authoritative characterization of benchmark human genomes. Sign up for General GIAB and Analysis Team email lists. Public workshops held annually - next workshop planned for April 1-2, 2020 at Stanford University, CA. Click here for the GIAB FAQ

Description

GIAB Logo
Credit: NIST

Consortium goals:
The Genome in a Bottle Consortium is a public-private-academic consortium hosted by NIST to develop the technical infrastructure (reference standards, reference methods, and reference data) to enable translation of whole human genome sequencing to clinical practice. The priority of GIAB is authoritative characterization of human genomes for use in analytical validation and technology development, optimization, and demonstration.

Reference samples:
GIAB has currently characterized a pilot genome (NA12878/HG001) from the HapMap project,  and two son/father/mother trios of Ashkenazi Jewish and Chinese ancestry from the Personal Genome Project (selected because, unlike the pilot genome, they are consented for commercial redistribution and reidentification).  These samples and their IDs from NIST, Coriell, and PGP are in this table.

Benchmark (or "High-confidence") variant calls and regions:
We developed an integration pipeline to utilize sequencing data generated by multiple technologies to generate variant calls and regions for use in benchmarking and validating variant calling pipelines. Currently, benchmark VCF and BED files for small variants are available for GRCh37 and GRCh38 under each genome at ftp://ftp-trace.ncbi.nlm.nih.gov/giab/ftp/release/

Draft benchmarks for difficult variants and regions:
Structural variants: Currently available for HG002 on GRCh37
Small variants in more difficult regions: Currently available for HG002 on GRCh37 and GRCh38 
MHC: Currently available for HG002 on GRCh37 and GRCh38 

Benchmarking best practices:
To establish best practices for using GIAB genomes for benchmarking, we have worked with the Global Alliance for Genomics and Health Benchmarking Team:
Benchmarking tools, Manuscript, GitHub

Sequencing Data:
Data and analyses from most short, linked, and long read sequencing methods are publicly available without publication embargo (data indexed in GIAB GitHub and FTP).
GIAB 2016 Scientific Data publication
NCBI GIAB Bioproject
NCBI SRA Run Selector
Amazon AWS S3 bucket: s3://giab 

Ongoing and Future work:
Current work in the GIAB Analysis Team is focused on establishing benchmark large indel and structural variant calls, as well as calls in difficult genomic contexts (e.g., homopolymers/tandem repeats and difficult-to-map regions like pseudogenes and segmental duplications).  GIAB is also exploring expanding to additional samples consented for release of WGS and redistribution of commercial products: increasing the diversity of germline reference samples and developing paired tumor-normal cell lines.

Workshops:
The consortium was initiated in a set of meetings in 2011 and 2012, and the consortium  holds open, public workshops typically annually at Stanford University in Palo Alto, CA or at NIST in Gaithersburg, MD. Slides from workshops and conferences are available here. The next workshop is planned for April 1-2, 2020 at Stanford University. The consortium and workshops are open and new participants are welcome.

Publications by GIAB:
Benchmark small variants for 7 GIAB genomes
Older benchmark small variants for pilot genome
GIAB data collected through 2016
GA4GH Best practices for benchmarking germline small variants
Posters and Presentations by GIAB

Publications using GIAB:
All google scholar publications mentioning GIAB and NIST

GIAB Email Lists:
General announcements
Analysis Team

Blog about GIAB Work

Created July 25, 2012, Updated November 22, 2019