Back to Search Start Over

Curated variation benchmarks for challenging medically relevant autosomal genes.

Authors :
Wagner J
Olson ND
Harris L
McDaniel J
Cheng H
Fungtammasan A
Hwang YC
Gupta R
Wenger AM
Rowell WJ
Khan ZM
Farek J
Zhu Y
Pisupati A
Mahmoud M
Xiao C
Yoo B
Sahraeian SME
Miller DE
Jáspez D
Lorenzo-Salazar JM
Muñoz-Barrera A
Rubio-Rodríguez LA
Flores C
Narzisi G
Evani US
Clarke WE
Lee J
Mason CE
Lincoln SE
Miga KH
Ebbert MTW
Shumate A
Li H
Chin CS
Zook JM
Sedlazeck FJ
Source :
Nature biotechnology [Nat Biotechnol] 2022 May; Vol. 40 (5), pp. 672-680. Date of Electronic Publication: 2022 Feb 07.
Publication Year :
2022

Abstract

The repetitive nature and complexity of some medically relevant genes poses a challenge for their accurate analysis in a clinical setting. The Genome in a Bottle Consortium has provided variant benchmark sets, but these exclude nearly 400 medically relevant genes due to their repetitiveness or polymorphic complexity. Here, we characterize 273 of these 395 challenging autosomal genes using a haplotype-resolved whole-genome assembly. This curated benchmark reports over 17,000 single-nucleotide variations, 3,600 insertions and deletions and 200 structural variations each for human genome reference GRCh37 and GRCh38 across HG002. We show that false duplications in either GRCh37 or GRCh38 result in reference-specific, missed variants for short- and long-read technologies in medically relevant genes, including CBS, CRYAA and KCNE1. When masking these false duplications, variant recall can improve from 8% to 100%. Forming benchmarks from a haplotype-resolved whole-genome assembly may become a prototype for future benchmarks covering the whole genome.<br /> (© 2022. This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply.)

Details

Language :
English
ISSN :
1546-1696
Volume :
40
Issue :
5
Database :
MEDLINE
Journal :
Nature biotechnology
Publication Type :
Academic Journal
Accession number :
35132260
Full Text :
https://doi.org/10.1038/s41587-021-01158-1