Skip to main content

NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.

Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.

U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

A Diploid Assembly-based Benchmark for Variants in The Major Histocompatibility Complex

Published

Author(s)

Justin M. Zook, Justin M. Wagner, Chen-Shan Chin, Qiandong Zeng, Alexander Dilthey, Tobias Marschall, Mikko Rautiainen, Erik Garrison, Shilpa Garg

Abstract

Most human genomes are characterized by aligning individual reads to the reference genome, but accurate long reads and linked reads now enable us to construct accurate, phased de novo assemblies. We focus on a medically important, highly variable, 5 million base-pair (bp) region where diploid assembly is particularly useful - the Major Histocompatibility Complex (MHC). Here, we develop a human genome benchmark derived from a diploid assembly for the openly-consented Genome in a Bottle sample HG002. We assemble a single contig for each haplotype, align them to the reference, call phased small and structural variants, and define a small variant benchmark for the MHC, covering 94% of the MHC and 22368 variants smaller than 50 bp, 49% more variants than a mapping-based benchmark. This benchmark reliably identifies errors in mapping-based callsets, and enables performance assessment in regions with much denser, complex variation than regions covered by previous benchmarks.
Citation
Nature Communications
Volume
11

Keywords

genomics, DNA sequencing, major histocompatibility complex, reference materials

Citation

Zook, J. , Wagner, J. , Chin, C. , Zeng, Q. , Dilthey, A. , Marschall, T. , Rautiainen, M. , Garrison, E. and Garg, S. (2020), A Diploid Assembly-based Benchmark for Variants in The Major Histocompatibility Complex, Nature Communications, [online], https://doi.org/10.1038/s41467-020-18564-9 (Accessed October 9, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created September 21, 2020, Updated March 1, 2021
Was this page helpful?