Back to Search Start Over

Pangenome graphs improve the analysis of structural variants in rare genetic diseases.

Authors :
Groza C
Schwendinger-Schreck C
Cheung WA
Farrow EG
Thiffault I
Lake J
Rizzo WB
Evrony G
Curran T
Bourque G
Pastinen T
Source :
Nature communications [Nat Commun] 2024 Jan 22; Vol. 15 (1), pp. 657. Date of Electronic Publication: 2024 Jan 22.
Publication Year :
2024

Abstract

Rare DNA alterations that cause heritable diseases are only partially resolvable by clinical next-generation sequencing due to the difficulty of detecting structural variation (SV) in all genomic contexts. Long-read, high fidelity genome sequencing (HiFi-GS) detects SVs with increased sensitivity and enables assembling personal and graph genomes. We leverage standard reference genomes, public assemblies (n = 94) and a large collection of HiFi-GS data from a rare disease program (Genomic Answers for Kids, GA4K, n = 574 assemblies) to build a graph genome representing a unified SV callset in GA4K, identify common variation and prioritize SVs that are more likely to cause genetic disease (MAF < 0.01). Using graphs, we obtain a higher level of reproducibility than the standard reference approach. We observe over 200,000 SV alleles unique to GA4K, including nearly 1000 rare variants that impact coding sequence. With improved specificity for rare SVs, we isolate 30 candidate SVs in phenotypically prioritized genes, including known disease SVs. We isolate a novel diagnostic SV in KMT2E, demonstrating use of personal assemblies coupled with pangenome graphs for rare disease genomics. The community may interrogate our pangenome with additional assemblies to discover new SVs within the allele frequency spectrum relevant to genetic diseases.<br /> (© 2024. The Author(s).)

Details

Language :
English
ISSN :
2041-1723
Volume :
15
Issue :
1
Database :
MEDLINE
Journal :
Nature communications
Publication Type :
Academic Journal
Accession number :
38253606
Full Text :
https://doi.org/10.1038/s41467-024-44980-2