62 results on '"Weisburd B"'
Search Results
2. VP.81 In vivo modulation of novel genetic modifiers for LAMA2-RD
- Author
-
Pini, V., primary, Weisburd, B., additional, Merteroglu, M., additional, Sealy, I., additional, White, R., additional, Busch-Nentwich, E., additional, and Muntoni, F., additional
- Published
- 2022
- Full Text
- View/download PDF
3. CONGENITAL MUSCULAR DYSTROPHIES
- Author
-
Pini, V., primary, Weisburd, B., additional, Ganesh, V., additional, Troia, S. Di, additional, Catapano, F., additional, Aguti, S., additional, Busch-Nentwich, E., additional, and Muntoni, F., additional
- Published
- 2021
- Full Text
- View/download PDF
4. The mutational constraint spectrum quantified from variation in 141,456 humans
- Author
-
Karczewski, KJ, Francioli, LC, Tiao, G, Cummings, BB, Alföldi, J, Wang, Q, Collins, RL, Laricchia, KM, Ganna, A, Birnbaum, DP, Gauthier, LD, Brand, H, Solomonson, M, Watts, NA, Rhodes, D, Singer-Berk, M, England, EM, Seaby, EG, Kosmicki, JA, Walters, RK, Tashman, K, Farjoun, Y, Banks, E, Poterba, T, Wang, A, Seed, C, Whiffin, N, Chong, JX, Samocha, KE, Pierce-Hoffman, E, Zappala, Z, O’Donnell-Luria, AH, Vallabh Minikel, E, Weisburd, B, Lek, M, Ware, JS, Vittal, C, Armean, IM, Bergelson, L, Cibulskis, K, Connolly, JM, Covarrubias, M, Donnelly, S, Ferriera, S, Gabriel, S, Gentry, J, Gupta, N, Jeandet, T, Kaplan, D, Llanwarne, C, Munshi, J, Novod, S, Petrillo, N, Roazen, D, Ruano-Rubio, V, Saltzman, A, Schleicher, M, Soto, J, Tibbetts, K, Tolonen, C, Wade, G, Talkowski, ME, Genome Aggregation Database (gnomAD) Consortium, Neale, BM, Daly, MJ, MacArthur, DG, Tampere University, Clinical Medicine, Department of Clinical Chemistry, Wellcome Trust, Rosetrees Trust, Institute for Molecular Medicine Finland, Data Science Genetic Epidemiology Lab, Centre of Excellence in Complex Disease Genetics, Department of Medicine, Clinicum, Gastroenterologian yksikkö, HUS Abdominal Center, University Management, HUS Psychiatry, Department of Psychiatry, HUS Neurocenter, Department of Neurosciences, Neurologian yksikkö, Department of Public Health, Aarno Palotie / Principal Investigator, Genomics of Neurological and Neuropsychiatric Disorders, Samuli Olli Ripatti / Principal Investigator, Complex Disease Genetics, Biostatistics Helsinki, Doctoral Programme in Clinical Research, and Biosciences
- Subjects
Male ,Mutation rate ,ved/biology.organism_classification_rank.species ,VARIANTS ,Genome ,Whole Exome Sequencing ,Cohort Studies ,0302 clinical medicine ,Mutation Rate ,Loss of Function Mutation ,Databases, Genetic ,Exome ,Organism ,Exome sequencing ,0303 health sciences ,Genes, Essential ,Multidisciplinary ,1184 Genetics, developmental biology, physiology ,Brain ,Phenotype ,Multidisciplinary Sciences ,Cardiovascular Diseases ,Science & Technology - Other Topics ,Female ,Proprotein Convertase 9 ,BURDEN ,Medical genomics ,Adult ,General Science & Technology ,Computational biology ,Biology ,3121 Internal medicine ,Article ,03 medical and health sciences ,Humans ,Genetic Predisposition to Disease ,RNA, Messenger ,Model organism ,Gene ,030304 developmental biology ,Science & Technology ,Whole Genome Sequencing ,Genome, Human ,ved/biology ,Genetic Variation ,Reproducibility of Results ,Rare variants ,MODEL ,DE-NOVO MUTATIONS ,Genome Aggregation Database Consortium ,3111 Biomedicine ,030217 neurology & neurosurgery ,Function (biology) ,Genome-Wide Association Study - Abstract
Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases., A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.
- Published
- 2020
5. The mutational constraint spectrum quantified from variation in 141,456 humans (vol 581, pg 434, 2020)
- Author
-
Gudmundsson, S, Karczewski, KJ, Francioli, LC, Tiao, G, Cummings, BB, Alfoldi, J, Wang, Q, Collins, RL, Laricchia, KM, Ganna, A, Birnbaum, DP, Gauthier, LD, Brand, H, Solomonson, M, Watts, NA, Rhodes, D, Singer-Berk, M, England, EM, Seaby, EG, Kosmicki, JA, Walters, RK, Tashman, K, Farjoun, Y, Banks, E, Poterba, T, Wang, A, Seed, C, Whiffin, N, Chong, JX, Samocha, KE, Pierce-Hoffman, E, Zappala, Z, O'Donnell-Luria, AH, Minikel, EV, Weisburd, B, Lek, M, Ware, JS, Vittal, C, Armean, IM, Bergelson, L, Cibulskis, K, Connolly, KM, Covarrubias, M, Donnelly, S, Ferriera, S, Gabriel, S, Gentry, J, Gupta, N, Jeandet, T, Kaplan, D, Llanwarne, C, Munshi, R, Novod, S, Petrillo, N, Roazen, D, Ruano-Rubio, V, Saltzman, A, Schleicher, M, Soto, J, Tibbetts, K, Tolonen, C, Wade, G, Talkowski, ME, Neale, BM, Daly, MJ, and MacArthur, DG
- Subjects
Multidisciplinary Sciences ,Science & Technology ,General Science & Technology ,Genome Aggregation Database Consortium ,Science & Technology - Other Topics ,OF-FUNCTION VARIANTS - Published
- 2020
6. CONGENITAL MUSCULAR DYSTROPHIES: EP.70 Exploring the role of genetic modifiers in a mild LAMA2-RD case associated with a LAMA2 loss-of-function mutation
- Author
-
Pini, V., Weisburd, B., Ganesh, V., Troia, S. Di, Catapano, F., Aguti, S., Busch-Nentwich, E., and Muntoni, F.
- Published
- 2021
- Full Text
- View/download PDF
7. Decoding Human Cytomegalovirus
- Author
-
Stern-Ginossar, N., primary, Weisburd, B., additional, Michalski, A., additional, Le, V. T. K., additional, Hein, M. Y., additional, Huang, S.-X., additional, Ma, M., additional, Shen, B., additional, Qian, S.-B., additional, Hengel, H., additional, Mann, M., additional, Ingolia, N. T., additional, and Weissman, J. S., additional
- Published
- 2012
- Full Text
- View/download PDF
8. Genome Sequencing for Diagnosing Rare Diseases.
- Author
-
Wojcik, M. H., Lemire, G., Berger, E., Zaki, M. S., Wissmann, M., Win, W., White, S. M., Weisburd, B., Wieczorek, D., Waddell, L. B., Verboon, J. M., VanNoy, G. E., Töpf, A., Tan, T. Y., Syrbe, S., Strehlow, V., Straub, V., Stenton, S. L., Snow, H., and Singer-Berk, M.
- Subjects
- *
NUCLEOTIDE sequencing , *RARE diseases , *GENETIC testing , *GENOMICS , *TANDEM repeats - Abstract
BACKGROUND: Genetic variants that cause rare disorders may remain elusive even after expansive testing, such as exome sequencing. The diagnostic yield of genome sequencing, particularly after a negative evaluation, remains poorly defined. METHODS: We sequenced and analyzed the genomes of families with diverse phenotypes who were suspected to have a rare monogenic disease and for whom genetic testing had not revealed a diagnosis, as well as the genomes of a replication cohort at an independent clinical center. RESULTS: We sequenced the genomes of 822 families (744 in the initial cohort and 78 in the replication cohort) and made a molecular diagnosis in 218 of 744 families (29.3%). Of the 218 families, 61 (28.0%) -- 8.2% of families in the initial cohort -- had variants that required genome sequencing for identification, including coding variants, intronic variants, small structural variants, copy-neutral inversions, complex rearrangements, and tandem repeat expansions. Most families in which a molecular diagnosis was made after previous nondiagnostic exome sequencing (63.5%) had variants that could be detected by reanalysis of the exome-sequence data (53.4%) or by additional analytic methods, such as copy-number variant calling, to exome-sequence data (10.8%). We obtained similar results in the replication cohort: in 33% of the families in which a molecular diagnosis was made, or 8% of the cohort, genome sequencing was required, which showed the applicability of these findings to both research and clinical environments. CONCLUSIONS: The diagnostic yield of genome sequencing in a large, diverse research cohort and in a small clinical cohort of persons who had previously undergone genetic testing was approximately 8% and included several types of pathogenic variation that had not previously been detected by means of exome sequencing or other techniques. (Funded by the National Human Genome Research Institute and others.). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity
- Author
-
Saleheen, D, Natarajan, P, Armean, IM, Zhao, W, Rasheed, A, Khetarpal, SA, Won, H-H, Karczewski, KJ, O'Donnell-Luria, AH, Samocha, KE, Weisburd, B, Gupta, N, Zaidi, M, Samuel, M, Imran, A, Abbas, S, Majeed, F, Ishaq, M, Akhtar, S, Trindade, K, Mucksavage, M, Qamar, N, Zaman, KS, Yaqoob, Z, Saghir, T, Rizvi, SNH, Memon, A, Hayyat Mallick, N, Rasheed, SZ, Memon, F-U-R, Mahmood, K, Ahmed, N, Do, R, Krauss, RM, MacArthur, DG, Gabriel, S, Lander, ES, Daly, MJ, Frossard, P, Danesh, J, Rader, DJ, and Kathiresan, S
- Subjects
fasting ,genetic association studies ,phenotype ,postprandial period ,DNA Mutational Analysis ,Sodium-Hydrogen Antiporter ,gene frequency ,coronary disease ,reverse genetics ,consanguinity ,cohort studies ,male ,middle aged ,Pakistan ,Cytochrome P450 Family 2 ,genes ,humans ,triglycerides ,Apolipoprotein C-III ,RNA splice sites ,gene deletion ,neuregulins ,Interleukin-8 ,pedigree ,dietary fats ,3. Good health ,homozygote ,female ,myocardial infarction ,1-Alkyl-2-acetylglycerophosphocholine Esterase ,exome ,phosphoproteins - Abstract
A major goal of biomedicine is to understand the function of every gene in the human genome. Loss-of-function mutations can disrupt both copies of a given gene in humans and phenotypic analysis of such 'human knockouts' can provide insight into gene function. Consanguineous unions are more likely to result in offspring carrying homozygous loss-of-function mutations. In Pakistan, consanguinity rates are notably high. Here we sequence the protein-coding regions of 10,503 adult participants in the Pakistan Risk of Myocardial Infarction Study (PROMIS), designed to understand the determinants of cardiometabolic diseases in individuals from South Asia. We identified individuals carrying homozygous predicted loss-of-function (pLoF) mutations, and performed phenotypic analysis involving more than 200 biochemical and disease traits. We enumerated 49,138 rare (
10. Determinants of penetrance and variable expressivity in monogenic metabolic conditions across 77,184 exomes
- Author
-
Grarup, N., Pollin, T.I., Han, S., Ma, R.C.W., Baxter, S., Cho, Y.S., DeFronzo, R.A., Centeno-Cruz, F., England, E., Hwang, M.Y., Palmer, C.N.A., Tomlinson, B., Watts, N., Linneberg, A., Garc��a-Ortiz, H., Florez, J.C., Barzilai, N., Goodrich, J.K., Dornbos, P., Chambers, J.C., Gieger, C., So, W.Y., Morris, A.D., Dupuis, J., Mohlke, K.L., Kuusisto, J., Spector, T.D., Haiman, C., Nilsson, P.M., Wong, T.-Y., Cole, J.B., van Dam, R.M., Weisburd, B., Meigs, J.B., Tracy, R.P., Sveden, A., Teo, Y.Y., Psaty, B.M., Cheng, C.-Y., Lehman, D.M., Koesterer, R., Strauch, K., Mart��nez-Hern��ndez, A., Orozco, L., Bonnycastle, L.L., J��rgensen, M.E., MacArthur, D.G., Kwak, S.-H., Saleheen, D., O'Donnell-Luria, A., Morrison, A.C., Koistinen, H.A., Ng, M.C.Y., Udler, M.S., Garay-Sevilla, M.E., Boehnke, M., Revilla-Monsalve, C., Chan, E., Heard-Costa, N.L., Maloney, K.A., Post, W.S., Pedersen, O., Contreras-Cubas, C., Sladek, R., AMP-T2D-GENES Consortia, Park, K.S., Tuomilehto, J., Strom, T.M., Preuss, M., Gross, M., Burtt, N.P., Flannick, J., Correa, A., Lee, J.-Y., Mendoza-Caamal, E., Hanis, C.L., Schurmann, C., Dahl, A., Loos, R.J.F., Islas-Andrade, S., Sim, X., Zappala, Z., Vasan, R.S., Groop, L., Liu, J., Thameem, F., Zhang, H., Henderson, B.E., Gonz��lez-Villalpando, C., Meitinger, T., Gonzalez, M.E., Laakso, M., Tusi��-Luna, T., Lee, J., Barajas-Olmos, F., Glaser, B., Wilson, J.G., Reiner, A.P.s, Rotter, J.I., McCarthy, M.I., Kooner, J.S., Duggirala, R., Lange, L., Small, K.S., Tuomi, T., Boerwinkle, E., Bowden, D.W., Kim, B.-J., Rich, S.S., Atzmon, G., C��rdova, E., Caulkins, L., Wood, J., Singer-Berk, M., Chami, N., Lyssenko, V., Mercader, J.M., Kim, Y.J., Aguilar-Salinas, C.A., Tam, C.H.T., O'Donnell, C.J., Hernandez, J.M.M., Son, R., Zaitlen, N., Tai, E.S., Hansen, T., Kang, H.M., Witte, D.R., Bottinger, E., Chan, J., and Blangero, J.
- Subjects
3. Good health - Abstract
Hundreds of thousands of genetic variants have been reported to cause severe monogenic diseases, but the probability that a variant carrier develops the disease (termed penetrance) is unknown for virtually all of them. Additionally, the clinical utility of common polygenetic variation remains uncertain. Using exome sequencing from 77,184 adult individuals (38,618 multi-ancestral individuals from a type 2 diabetes case-control study and 38,566 participants from the UK Biobank, for whom genotype array data were also available), we apply clinical standard-of-care gene variant curation for eight monogenic metabolic conditions. Rare variants causing monogenic diabetes and dyslipidemias display effect sizes significantly larger than the top 1% of the corresponding polygenic scores. Nevertheless, penetrance estimates for monogenic variant carriers average 60% or lower for most conditions. We assess epidemiologic and genetic factors contributing to risk prediction in monogenic variant carriers, demonstrating that inclusion of polygenic variation significantly improves biomarker estimation for two monogenic dyslipidemias.
11. A CCG expansion in ABCD3 causes oculopharyngodistal myopathy in individuals of European ancestry.
- Author
-
Cortese A, Beecroft SJ, Facchini S, Curro R, Cabrera-Serrano M, Stevanovski I, Chintalaphani SR, Gamaarachchi H, Weisburd B, Folland C, Monahan G, Scriba CK, Dofash L, Johari M, Grosz BR, Ellis M, Fearnley LG, Tankard R, Read J, Merve A, Dominik N, Vegezzi E, Schnekenberg RP, Fernandez-Eulate G, Masingue M, Giovannini D, Delatycki MB, Storey E, Gardner M, Amor DJ, Nicholson G, Vucic S, Henderson RD, Robertson T, Dyke J, Fabian V, Mastaglia F, Davis MR, Kennerson M, Quinlivan R, Hammans S, Tucci A, Bahlo M, McLean CA, Laing NG, Stojkovic T, Houlden H, Hanna MG, Deveson IW, Lockhart PJ, Lamont PJ, Fahey MC, Bugiardini E, and Ravenscroft G
- Subjects
- Humans, Male, Female, Adult, Middle Aged, ATP-Binding Cassette Transporters genetics, Myopathies, Structural, Congenital genetics, Myopathies, Structural, Congenital pathology, Pedigree, Aged, Young Adult, Fibroblasts metabolism, Fibroblasts pathology, Muscle Weakness genetics, Muscle Weakness pathology, Adolescent, Muscular Dystrophies, Trinucleotide Repeat Expansion genetics, White People genetics, Muscle, Skeletal pathology
- Abstract
Oculopharyngodistal myopathy (OPDM) is an inherited myopathy manifesting with ptosis, dysphagia and distal weakness. Pathologically it is characterised by rimmed vacuoles and intranuclear inclusions on muscle biopsy. In recent years CGG • CCG repeat expansion in four different genes were identified in OPDM individuals in Asian populations. None of these have been found in affected individuals of non-Asian ancestry. In this study we describe the identification of CCG expansions in ABCD3, ranging from 118 to 694 repeats, in 35 affected individuals across eight unrelated OPDM families of European ancestry. ABCD3 transcript appears upregulated in fibroblasts and skeletal muscle from OPDM individuals, suggesting a potential role of over-expression of CCG repeat containing ABCD3 transcript in progressive skeletal muscle degeneration. The study provides further evidence of the role of non-coding repeat expansions in unsolved neuromuscular diseases and strengthens the association between the CGG • CCG repeat motif and a specific pattern of muscle weakness., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
12. Expanding the genetics and phenotypes of ocular congenital cranial dysinnervation disorders.
- Author
-
Jurgens JA, Barry BJ, Chan WM, MacKinnon S, Whitman MC, Matos Ruiz PM, Pratt BM, England EM, Pais L, Lemire G, Groopman E, Glaze C, Russell KA, Singer-Berk M, Di Gioia SA, Lee AS, Andrews C, Shaaban S, Wirth MM, Bekele S, Toffoloni M, Bradford VR, Foster EE, Berube L, Rivera-Quiles C, Mensching FM, Sanchis-Juan A, Fu JM, Wong I, Zhao X, Wilson MW, Weisburd B, Lek M, Brand H, Talkowski ME, MacArthur DG, O'Donnell-Luria A, Robson CD, Hunter DG, and Engle EC
- Abstract
Purpose: To identify genetic etiologies and genotype/phenotype associations for unsolved ocular congenital cranial dysinnervation disorders (oCCDDs)., Methods: We coupled phenotyping with exome or genome sequencing of 467 probands (550 affected and 1108 total individuals) with genetically unsolved oCCDDs, integrating analyses of pedigrees, human and animal model phenotypes, and de novo variants to identify rare candidate single nucleotide variants, insertion/deletions, and structural variants disrupting protein-coding regions. Prioritized variants were classified for pathogenicity and evaluated for genotype/phenotype correlations., Results: Analyses elucidated phenotypic subgroups, identified pathogenic/likely pathogenic variant(s) in 43/467 probands (9.2%), and prioritized variants of uncertain significance in 70/467 additional probands (15.0%). These included known and novel variants in established oCCDD genes, genes associated with syndromes that sometimes include oCCDDs (e.g., MYH10, KIF21B, TGFBR2, TUBB6), genes that fit the syndromic component of the phenotype but had no prior oCCDD association (e.g., CDK13, TGFB2), genes with no reported association with oCCDDs or the syndromic phenotypes (e.g., TUBA4A, KIF5C, CTNNA1, KLB, FGF21), and genes associated with oCCDD phenocopies that had resulted in misdiagnoses., Conclusion: This study suggests that unsolved oCCDDs are clinically and genetically heterogeneous disorders often overlapping other Mendelian conditions and nominates many candidates for future replication and functional studies., (Copyright © 2024. Published by Elsevier Inc.)
- Published
- 2024
- Full Text
- View/download PDF
13. Diagnosing missed cases of spinal muscular atrophy in genome, exome, and panel sequencing datasets.
- Author
-
Weisburd B, Sharma R, Pata V, Reimand T, Ganesh VS, Austin-Tse C, Osei-Owusu I, O'Heir E, O'Leary M, Pais L, Stafki SA, Daugherty AL, Folland C, Perić S, Fahmy N, Udd B, Horakova M, Łusakowska A, Manoj R, Nalini A, Karcagi V, Polavarapu K, Lochmüller H, Horvath R, Bönnemann CG, Donkervoort S, Haliloğlu G, Herguner O, Kang PB, Ravenscroft G, Laing N, Scott HS, Töpf A, Straub V, Pajusalu S, Õunap K, Tiao G, Rehm HL, and O'Donnell-Luria A
- Abstract
Spinal muscular atrophy (SMA) is a genetic disorder that causes progressive degeneration of lower motor neurons and the subsequent loss of muscle function throughout the body. It is the second most common recessive disorder in individuals of European descent and is present in all populations. Accurate tools exist for diagnosing SMA from genome sequencing data. However, there are no publicly available tools for GRCh38-aligned data from panel or exome sequencing assays which continue to be used as first line tests for neuromuscular disorders. This deficiency creates a critical gap in our ability to diagnose SMA in large existing rare disease cohorts, as well as newly sequenced exome and panel datasets. We therefore developed and extensively validated a new tool - SMA Finder - that can diagnose SMA not only in genome, but also exome and panel sequencing samples aligned to GRCh37, GRCh38, or T2T-CHM13. It works by evaluating aligned reads that overlap the c.840 position of SMN1 and SMN2 in order to detect the most common molecular causes of SMA. We applied SMA Finder to 16,626 exomes and 3,911 genomes from heterogeneous rare disease cohorts sequenced at the Broad Institute Center for Mendelian Genomics as well as 1,157 exomes and 8,762 panel sequencing samples from Tartu University Hospital. SMA Finder correctly identified all 16 known SMA cases and reported nine novel diagnoses which have since been confirmed by clinical testing, with another four novel diagnoses undergoing validation. Notably, out of the 29 total SMA positive cases, 23 had an initial clinical diagnosis of muscular dystrophy, congenital myasthenic syndrome, or myopathy. This underscored the frequency with which SMA can be misdiagnosed as other neuromuscular disorders and confirmed the utility of using SMA Finder to reanalyze phenotypically diverse neuromuscular disease cohorts. Finally, we evaluated SMA Finder on 198,868 individuals that had both exome and genome sequencing data within the UK Biobank (UKBB) and found that SMA Finder's overall false positive rate was less than 1 / 200,000 exome samples, and its positive predictive value (PPV) was 97%. We also observed 100% concordance between UKBB exome and genome calls. This analysis showed that, even though it is located within a segmental duplication, the most common causal variant for SMA can be detected with comparable accuracy to monogenic disease variants in non-repetitive regions. Additionally, the high PPV demonstrated by SMA Finder, the existence of treatment options for SMA in which early diagnosis is imperative for therapeutic benefit, as well as widespread availability of clinical confirmatory testing for SMA, warrants the addition of SMN1 to the ACMG list of genes with reportable secondary findings after genome and exome sequencing., Competing Interests: HLR receives research funding from Microsoft and previously received funding from Illumina to support rare disease gene discovery and diagnosis. AODL has consulted for Tome Biosciences, Ono Pharma USA Inc, and Addition Therapeutics, and is member of the scientific advisory board for Congenica Inc and the Simons Foundation SPARK for Autism study. AL received honoraria for speaking at educational events for Biogen, PTC and Roche, is a subinvestigator in clinical trials by Roche and PTC, and is involved in a project supported by Biogen (POL-SMA-17-11166). PBK has received research support from ML Bio and Sarepta Therapeutics, and has consulted for Lupin, Neurogene, NS Pharma, and Teneofour.
- Published
- 2024
- Full Text
- View/download PDF
14. STRchive: a dynamic resource detailing population-level and locus-specific insights at tandem repeat disease loci.
- Author
-
Hiatt L, Weisburd B, Dolzhenko E, VanNoy GE, Kurtas EN, Rehm HL, Quinlan A, and Dashnow H
- Abstract
Approximately 3% of the human genome consists of repetitive elements called tandem repeats (TRs), which include short tandem repeats (STRs) of 1-6bp motifs and variable number tandem repeats (VNTRs) of 7+bp motifs. TR variants contribute to several dozen mono- and polygenic diseases but remain understudied and "enigmatic," particularly relative to single nucleotide variants. It remains comparatively challenging to interpret the clinical significance of TR variants. Although existing resources provide portions of necessary data for interpretation at disease-associated loci, it is currently difficult or impossible to efficiently invoke the additional details critical to proper interpretation, such as motif pathogenicity, disease penetrance, and age of onset distributions. It is also often unclear how to apply population information to analyses. We present STRchive (S-T-archive, http://strchive.org/ ), a dynamic resource consolidating information on TR disease loci in humans from research literature, up-to-date clinical resources, and large-scale genomic databases, with the goal of streamlining TR variant interpretation at disease-associated loci. We apply STRchive -including pathogenic thresholds, motif classification, and clinical phenotypes-to a gnomAD cohort of ∼18.5k individuals genotyped at 60 disease-associated loci. Through detailed literature curation, we demonstrate that the majority of TR diseases affect children despite being thought of as adult diseases. Additionally, we show that pathogenic genotypes can be found within gnomAD which do not necessarily overlap with known disease prevalence, and leverage STRchive to interpret locus-specific findings therein. We apply a diagnostic blueprint empowered by STRchive to relevant clinical vignettes, highlighting possible pitfalls in TR variant interpretation. As a living resource, STRchive is maintained by experts, takes community contributions, and will evolve as understanding of TR diseases progresses.
- Published
- 2024
- Full Text
- View/download PDF
15. The landscape of regional missense mutational intolerance quantified from 125,748 exomes.
- Author
-
Chao KR, Wang L, Panchal R, Liao C, Abderrazzaq H, Ye R, Schultz P, Compitello J, Grant RH, Kosmicki JA, Weisburd B, Phu W, Wilson MW, Laricchia KM, Goodrich JK, Goldstein D, Goldstein JI, Vittal C, Poterba T, Baxter S, Watts NA, Solomonson M, Tiao G, Rehm HL, Neale BM, Talkowski ME, MacArthur DG, O'Donnell-Luria A, Karczewski KJ, Radivojac P, Daly MJ, and Samocha KE
- Abstract
Missense variants can have a range of functional impacts depending on factors such as the specific amino acid substitution and location within the gene. To interpret their deleteriousness, studies have sought to identify regions within genes that are specifically intolerant of missense variation
1-12 . Here, we leverage the patterns of rare missense variation in 125,748 individuals in the Genome Aggregation Database (gnomAD)13 against a null mutational model to identify transcripts that display regional differences in missense constraint. Missense-depleted regions are enriched for ClinVar14 pathogenic variants, de novo missense variants from individuals with neurodevelopmental disorders (NDDs)15,16 , and complex trait heritability. Following ClinGen calibration recommendations for the ACMG/AMP guidelines, we establish that regions with less than 20% of their expected missense variation achieve moderate support for pathogenicity. We create a missense deleteriousness metric (MPC) that incorporates regional constraint and outperforms other deleteriousness scores at stratifying case and control de novo missense variation, with a strong enrichment in NDDs. These results provide additional tools to aid in missense variant interpretation.- Published
- 2024
- Full Text
- View/download PDF
16. Genome and RNA sequencing boost neuromuscular diagnoses to 62% from 34% with exome sequencing alone.
- Author
-
Marchant RG, Bryen SJ, Bahlo M, Cairns A, Chao KR, Corbett A, Davis MR, Ganesh VS, Ghaoui R, Jones KJ, Kornberg AJ, Lek M, Liang C, MacArthur DG, Oates EC, O'Donnell-Luria A, O'Grady GL, Osei-Owusu IA, Rafehi H, Reddel SW, Roxburgh RH, Ryan MM, Sandaradura SA, Scott LW, Valkanas E, Weisburd B, Young H, Evesson FJ, Waddell LB, and Cooper ST
- Subjects
- Humans, Male, Female, Adult, Sequence Analysis, RNA methods, Child, Adolescent, Exome genetics, Middle Aged, Young Adult, Child, Preschool, High-Throughput Nucleotide Sequencing, Infant, Genetic Testing methods, Neuromuscular Diseases genetics, Neuromuscular Diseases diagnosis, Exome Sequencing
- Abstract
Objective: Most families with heritable neuromuscular disorders do not receive a molecular diagnosis. Here we evaluate diagnostic utility of exome, genome, RNA sequencing, and protein studies and provide evidence-based recommendations for their integration into practice., Methods: In total, 247 families with suspected monogenic neuromuscular disorders who remained without a genetic diagnosis after standard diagnostic investigations underwent research-led massively parallel sequencing: neuromuscular disorder gene panel, exome, genome, and/or RNA sequencing to identify causal variants. Protein and RNA studies were also deployed when required., Results: Integration of exome sequencing and auxiliary genome, RNA and/or protein studies identified causal or likely causal variants in 62% (152 out of 247) of families. Exome sequencing alone informed 55% (83 out of 152) of diagnoses, with remaining diagnoses (45%; 69 out of 152) requiring genome sequencing, RNA and/or protein studies to identify variants and/or support pathogenicity. Arrestingly, novel disease genes accounted for <4% (6 out of 152) of diagnoses while 36.2% of solved families (55 out of 152) harbored at least one splice-altering or structural variant in a known neuromuscular disorder gene. We posit that contemporary neuromuscular disorder gene-panel sequencing could likely provide 66% (100 out of 152) of our diagnoses today., Interpretation: Our results emphasize thorough clinical phenotyping to enable deep scrutiny of all rare genetic variation in phenotypically consistent genes. Post-exome auxiliary investigations extended our diagnostic yield by 81% overall (34-62%). We present a diagnostic algorithm that details deployment of genomic and auxiliary investigations to obtain these diagnoses today most effectively. We hope this provides a practical guide for clinicians as they gain greater access to clinical genome and transcriptome sequencing., (© 2024 The Authors. Annals of Clinical and Translational Neurology published by Wiley Periodicals LLC on behalf of American Neurological Association.)
- Published
- 2024
- Full Text
- View/download PDF
17. Narrowing the diagnostic gap: Genomes, episignatures, long-read sequencing, and health economic analyses in an exome-negative intellectual disability cohort.
- Author
-
Dias KR, Shrestha R, Schofield D, Evans CA, O'Heir E, Zhu Y, Zhang F, Standen K, Weisburd B, Stenton SL, Sanchis-Juan A, Brand H, Talkowski ME, Ma A, Ghedia S, Wilson M, Sandaradura SA, Smith J, Kamien B, Turner A, Bakshi M, Adès LC, Mowat D, Regan M, McGillivray G, Savarirayan R, White SM, Tan TY, Stark Z, Brown NJ, Pérez-Jurado LA, Krzesinski E, Hunter MF, Akesson L, Fennell AP, Yeung A, Boughtwood T, Ewans LJ, Kerkhof J, Lucas C, Carey L, French H, Rapadas M, Stevanovski I, Deveson IW, Cliffe C, Elakis G, Kirk EP, Dudding-Byth T, Fletcher J, Walsh R, Corbett MA, Kroes T, Gecz J, Meldrum C, Cliffe S, Wall M, Lunke S, North K, Amor DJ, Field M, Sadikovic B, Buckley MF, O'Donnell-Luria A, and Roscioli T
- Subjects
- Humans, Male, Female, Cohort Studies, Genetic Testing economics, Genetic Testing methods, Whole Genome Sequencing economics, Child, Genome, Human genetics, DNA Copy Number Variations genetics, Polymorphism, Single Nucleotide genetics, Child, Preschool, Intellectual Disability genetics, Intellectual Disability diagnosis, Exome genetics, Exome Sequencing economics
- Abstract
Purpose: Genome sequencing (GS)-specific diagnostic rates in prospective tightly ascertained exome sequencing (ES)-negative intellectual disability (ID) cohorts have not been reported extensively., Methods: ES, GS, epigenetic signatures, and long-read sequencing diagnoses were assessed in 74 trios with at least moderate ID., Results: The ES diagnostic yield was 42 of 74 (57%). GS diagnoses were made in 9 of 32 (28%) ES-unresolved families. Repeated ES with a contemporary pipeline on the GS-diagnosed families identified 8 of 9 single-nucleotide variations/copy-number variations undetected in older ES, confirming a GS-unique diagnostic rate of 1 in 32 (3%). Episignatures contributed diagnostic information in 9% with GS corroboration in 1 of 32 (3%) and diagnostic clues in 2 of 32 (6%). A genetic etiology for ID was detected in 51 of 74 (69%) families. Twelve candidate disease genes were identified. Contemporary ES followed by GS cost US$4976 (95% CI: $3704; $6969) per diagnosis and first-line GS at a cost of $7062 (95% CI: $6210; $8475) per diagnosis., Conclusion: Performing GS only in ID trios would be cost equivalent to ES if GS were available at $2435, about a 60% reduction from current prices. This study demonstrates that first-line GS achieves higher diagnostic rate than contemporary ES but at a higher cost., Competing Interests: Conflict of Interest The authors declare no conflicts of interest., (Copyright © 2024 American College of Medical Genetics and Genomics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
18. Critical assessment of variant prioritization methods for rare disease diagnosis within the rare genomes project.
- Author
-
Stenton SL, O'Leary MC, Lemire G, VanNoy GE, DiTroia S, Ganesh VS, Groopman E, O'Heir E, Mangilog B, Osei-Owusu I, Pais LS, Serrano J, Singer-Berk M, Weisburd B, Wilson MW, Austin-Tse C, Abdelhakim M, Althagafi A, Babbi G, Bellazzi R, Bovo S, Carta MG, Casadio R, Coenen PJ, De Paoli F, Floris M, Gajapathy M, Hoehndorf R, Jacobsen JOB, Joseph T, Kamandula A, Katsonis P, Kint C, Lichtarge O, Limongelli I, Lu Y, Magni P, Mamidi TKK, Martelli PL, Mulargia M, Nicora G, Nykamp K, Pejaver V, Peng Y, Pham THC, Podda MS, Rao A, Rizzo E, Saipradeep VG, Savojardo C, Schols P, Shen Y, Sivadasan N, Smedley D, Soru D, Srinivasan R, Sun Y, Sunderam U, Tan W, Tiwari N, Wang X, Wang Y, Williams A, Worthey EA, Yin R, You Y, Zeiberg D, Zucca S, Bakolitsa C, Brenner SE, Fullerton SM, Radivojac P, Rehm HL, and O'Donnell-Luria A
- Subjects
- Humans, Genome, Human genetics, Genetic Variation genetics, Computational Biology methods, Phenotype, Rare Diseases genetics, Rare Diseases diagnosis
- Abstract
Background: A major obstacle faced by families with rare diseases is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years and causal variants are identified in under 50%, even when capturing variants genome-wide. To aid in the interpretation and prioritization of the vast number of variants detected, computational methods are proliferating. Knowing which tools are most effective remains unclear. To evaluate the performance of computational methods, and to encourage innovation in method development, we designed a Critical Assessment of Genome Interpretation (CAGI) community challenge to place variant prioritization models head-to-head in a real-life clinical diagnostic setting., Methods: We utilized genome sequencing (GS) data from families sequenced in the Rare Genomes Project (RGP), a direct-to-participant research study on the utility of GS for rare disease diagnosis and gene discovery. Challenge predictors were provided with a dataset of variant calls and phenotype terms from 175 RGP individuals (65 families), including 35 solved training set families with causal variants specified, and 30 unlabeled test set families (14 solved, 16 unsolved). We tasked teams to identify causal variants in as many families as possible. Predictors submitted variant predictions with estimated probability of causal relationship (EPCR) values. Model performance was determined by two metrics, a weighted score based on the rank position of causal variants, and the maximum F-measure, based on precision and recall of causal variants across all EPCR values., Results: Sixteen teams submitted predictions from 52 models, some with manual review incorporated. Top performers recalled causal variants in up to 13 of 14 solved families within the top 5 ranked variants. Newly discovered diagnostic variants were returned to two previously unsolved families following confirmatory RNA sequencing, and two novel disease gene candidates were entered into Matchmaker Exchange. In one example, RNA sequencing demonstrated aberrant splicing due to a deep intronic indel in ASNS, identified in trans with a frameshift variant in an unsolved proband with phenotypes consistent with asparagine synthetase deficiency., Conclusions: Model methodology and performance was highly variable. Models weighing call quality, allele frequency, predicted deleteriousness, segregation, and phenotype were effective in identifying causal variants, and models open to phenotype expansion and non-coding variants were able to capture more difficult diagnoses and discover new diagnoses. Overall, computational models can significantly aid variant prioritization. For use in diagnostics, detailed review and conservative assessment of prioritized variants against established criteria is needed., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
19. Novel syndromic neurodevelopmental disorder caused by de novo deletion of CHASERR , a long noncoding RNA.
- Author
-
Ganesh VS, Riquin K, Chatron N, Lamar KM, Aziz MC, Monin P, O'Leary M, Goodrich JK, Garimella KV, England E, Yoon E, Weisburd B, Aguet F, Bacino CA, Murdock DR, Dai H, Rosenfeld JA, Emrick LT, Ketkar S, Sarusi Y, Sanlaville D, Kayani S, Broadbent B, Isidor B, Pengam A, Cogné B, MacArthur DG, Ulitsky I, Carvill GL, and O'Donnell-Luria A
- Abstract
Genes encoding long non-coding RNAs (lncRNAs) comprise a large fraction of the human genome, yet haploinsufficiency of a lncRNA has not been shown to cause a Mendelian disease. CHASERR is a highly conserved human lncRNA adjacent to CHD2- a coding gene in which de novo loss-of-function variants cause developmental and epileptic encephalopathy. Here we report three unrelated individuals each harboring an ultra-rare heterozygous de novo deletion in the CHASERR locus. We report similarities in severe developmental delay, facial dysmorphisms, and cerebral dysmyelination in these individuals, distinguishing them from the phenotypic spectrum of CHD2 haploinsufficiency. We demonstrate reduced CHASERR mRNA expression and corresponding increased CHD2 mRNA and protein in whole blood and patient-derived cell lines-specifically increased expression of the CHD2 allele in cis with the CHASERR deletion, as predicted from a prior mouse model of Chaserr haploinsufficiency. We show for the first time that de novo structural variants facilitated by Alu-mediated non-allelic homologous recombination led to deletion of a non-coding element (the lncRNA CHASERR ) to cause a rare syndromic neurodevelopmental disorder. We also demonstrate that CHD2 has bidirectional dosage sensitivity in human disease. This work highlights the need to carefully evaluate other lncRNAs, particularly those upstream of genes associated with Mendelian disorders.
- Published
- 2024
- Full Text
- View/download PDF
20. Systematic evaluation of genome sequencing for the diagnostic assessment of autism spectrum disorder and fetal structural anomalies.
- Author
-
Lowther C, Valkanas E, Giordano JL, Wang HZ, Currall BB, O'Keefe K, Pierce-Hoffman E, Kurtas NE, Whelan CW, Hao SP, Weisburd B, Jalili V, Fu J, Wong I, Collins RL, Zhao X, Austin-Tse CA, Evangelista E, Lemire G, Aggarwal VS, Lucente D, Gauthier LD, Tolonen C, Sahakian N, Stevens C, An JY, Dong S, Norton ME, MacKenzie TC, Devlin B, Gilmore K, Powell BC, Brandt A, Vetrini F, DiVito M, Sanders SJ, MacArthur DG, Hodge JC, O'Donnell-Luria A, Rehm HL, Vora NL, Levy B, Brand H, Wapner RJ, and Talkowski ME
- Subjects
- Female, Pregnancy, Humans, Pregnancy Trimester, First, Ultrasonography, Prenatal, Chromosome Mapping, Exome, Autism Spectrum Disorder diagnosis, Autism Spectrum Disorder genetics
- Abstract
Short-read genome sequencing (GS) holds the promise of becoming the primary diagnostic approach for the assessment of autism spectrum disorder (ASD) and fetal structural anomalies (FSAs). However, few studies have comprehensively evaluated its performance against current standard-of-care diagnostic tests: karyotype, chromosomal microarray (CMA), and exome sequencing (ES). To assess the clinical utility of GS, we compared its diagnostic yield against these three tests in 1,612 quartet families including an individual with ASD and in 295 prenatal families. Our GS analytic framework identified a diagnostic variant in 7.8% of ASD probands, almost 2-fold more than CMA (4.3%) and 3-fold more than ES (2.7%). However, when we systematically captured copy-number variants (CNVs) from the exome data, the diagnostic yield of ES (7.4%) was brought much closer to, but did not surpass, GS. Similarly, we estimated that GS could achieve an overall diagnostic yield of 46.1% in unselected FSAs, representing a 17.2% increased yield over karyotype, 14.1% over CMA, and 4.1% over ES with CNV calling or 36.1% increase without CNV discovery. Overall, GS provided an added diagnostic yield of 0.4% and 0.8% beyond the combination of all three standard-of-care tests in ASD and FSAs, respectively. This corresponded to nine GS unique diagnostic variants, including sequence variants in exons not captured by ES, structural variants (SVs) inaccessible to existing standard-of-care tests, and SVs where the resolution of GS changed variant classification. Overall, this large-scale evaluation demonstrated that GS significantly outperforms each individual standard-of-care test while also outperforming the combination of all three tests, thus warranting consideration as the first-tier diagnostic approach for the assessment of ASD and FSAs., Competing Interests: Declaration of interests M.E.T. and H.R. receive research funding from Microsoft Inc and/or research reagents from Illumina Inc. M.E.T. also received research funding from Levo Therapeutics and research reagents from Ionis Therapeutics for unrelated research projects., (Copyright © 2023 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
21. Unique Capabilities of Genome Sequencing for Rare Disease Diagnosis.
- Author
-
Wojcik MH, Lemire G, Zaki MS, Wissman M, Win W, White S, Weisburd B, Waddell LB, Verboon JM, VanNoy GE, Töpf A, Tan TY, Straub V, Stenton SL, Snow H, Singer-Berk M, Silver J, Shril S, Seaby EG, Schneider R, Sankaran VG, Sanchis-Juan A, Russell KA, Reinson K, Ravenscroft G, Pierce EA, Place EM, Pajusalu S, Pais L, Õunap K, Osei-Owusu I, Okur V, Oja KT, O'Leary M, O'Heir E, Morel C, Marchant RG, Mangilog BE, Madden JA, MacArthur D, Lovgren A, Lerner-Ellis JP, Lin J, Laing N, Hildebrandt F, Groopman E, Goodrich J, Gleeson JG, Ghaoui R, Genetti CA, Gazda HT, Ganesh VS, Ganapathy M, Gallacher L, Fu J, Evangelista E, England E, Donkervoort S, DiTroia S, Cooper ST, Chung WK, Christodoulou J, Chao KR, Cato LD, Bujakowska KM, Bryen SJ, Brand H, Bonnemann C, Beggs AH, Baxter SM, Agrawal PB, Talkowski M, Austin-Tse C, Rehm HL, and O'Donnell-Luria A
- Abstract
Background: Causal variants underlying rare disorders may remain elusive even after expansive gene panels or exome sequencing (ES). Clinicians and researchers may then turn to genome sequencing (GS), though the added value of this technique and its optimal use remain poorly defined. We therefore investigated the advantages of GS within a phenotypically diverse cohort., Methods: GS was performed for 744 individuals with rare disease who were genetically undiagnosed. Analysis included review of single nucleotide, indel, structural, and mitochondrial variants., Results: We successfully solved 218/744 (29.3%) cases using GS, with most solves involving established disease genes (157/218, 72.0%). Of all solved cases, 148 (67.9%) had previously had non-diagnostic ES. We systematically evaluated the 218 causal variants for features requiring GS to identify and 61/218 (28.0%) met these criteria, representing 8.2% of the entire cohort. These included small structural variants (13), copy neutral inversions and complex rearrangements (8), tandem repeat expansions (6), deep intronic variants (15), and coding variants that may be more easily found using GS related to uniformity of coverage (19)., Conclusion: We describe the diagnostic yield of GS in a large and diverse cohort, illustrating several types of pathogenic variation eluding ES or other techniques. Our results reveal a higher diagnostic yield of GS, supporting the utility of a genome-first approach, with consideration of GS as a secondary or tertiary test when higher-resolution structural variant analysis is needed or there is a strong clinical suspicion for a condition and prior targeted genetic testing has been negative.
- Published
- 2023
- Full Text
- View/download PDF
22. Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project.
- Author
-
Stenton SL, O'Leary M, Lemire G, VanNoy GE, DiTroia S, Ganesh VS, Groopman E, O'Heir E, Mangilog B, Osei-Owusu I, Pais LS, Serrano J, Singer-Berk M, Weisburd B, Wilson M, Austin-Tse C, Abdelhakim M, Althagafi A, Babbi G, Bellazzi R, Bovo S, Carta MG, Casadio R, Coenen PJ, De Paoli F, Floris M, Gajapathy M, Hoehndorf R, Jacobsen JOB, Joseph T, Kamandula A, Katsonis P, Kint C, Lichtarge O, Limongelli I, Lu Y, Magni P, Mamidi TKK, Martelli PL, Mulargia M, Nicora G, Nykamp K, Pejaver V, Peng Y, Pham THC, Podda MS, Rao A, Rizzo E, Saipradeep VG, Savojardo C, Schols P, Shen Y, Sivadasan N, Smedley D, Soru D, Srinivasan R, Sun Y, Sunderam U, Tan W, Tiwari N, Wang X, Wang Y, Williams A, Worthey EA, Yin R, You Y, Zeiberg D, Zucca S, Bakolitsa C, Brenner SE, Fullerton SM, Radivojac P, Rehm HL, and O'Donnell-Luria A
- Abstract
Background: A major obstacle faced by rare disease families is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years, and causal variants are identified in under 50%. The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing (GS) for diagnosis and gene discovery. Families are consented for sharing of sequence and phenotype data with researchers, allowing development of a Critical Assessment of Genome Interpretation (CAGI) community challenge, placing variant prioritization models head-to-head in a real-life clinical diagnostic setting., Methods: Predictors were provided a dataset of phenotype terms and variant calls from GS of 175 RGP individuals (65 families), including 35 solved training set families, with causal variants specified, and 30 test set families (14 solved, 16 unsolved). The challenge tasked teams with identifying the causal variants in as many test set families as possible. Ranked variant predictions were submitted with estimated probability of causal relationship (EPCR) values. Model performance was determined by two metrics, a weighted score based on rank position of true positive causal variants and maximum F-measure, based on precision and recall of causal variants across EPCR thresholds., Results: Sixteen teams submitted predictions from 52 models, some with manual review incorporated. Top performing teams recalled the causal variants in up to 13 of 14 solved families by prioritizing high quality variant calls that were rare, predicted deleterious, segregating correctly, and consistent with reported phenotype. In unsolved families, newly discovered diagnostic variants were returned to two families following confirmatory RNA sequencing, and two prioritized novel disease gene candidates were entered into Matchmaker Exchange. In one example, RNA sequencing demonstrated aberrant splicing due to a deep intronic indel in ASNS , identified in trans with a frameshift variant, in an unsolved proband with phenotype overlap with asparagine synthetase deficiency., Conclusions: By objective assessment of variant predictions, we provide insights into current state-of-the-art algorithms and platforms for genome sequencing analysis for rare disease diagnosis and explore areas for future optimization. Identification of diagnostic variants in unsolved families promotes synergy between researchers with clinical and computational expertise as a means of advancing the field of clinical genome interpretation., Competing Interests: Competing interests. Authors S.Z., I.L., E.R., P.M., and R.B., own shares of enGenome srl. Authors F.D.P. and G.N. are employees of enGenome srl. Authors T.J., R.S., S.G.V., N.S., A.R., U.S., N.T., are employees of TCS Ltd. Authors P.J.C., C.K., K.N., and P.S. are employees of Invitae Ltd. H.L.R. receives support from Illumina and Microsoft for rare disease gene discovery and diagnosis. A.O’D-L. is a member of the scientific advisory board for Congenica Inc and the Simons Foundation SPARK for Autism study and co-chairs the clinical advisory board for CAGI. S.E.B receives support at UC Berkeley from a research agreement from TCS. All other authors report no competing interests.
- Published
- 2023
- Full Text
- View/download PDF
23. RFC1 in an Australasian neurological disease cohort: extending the genetic heterogeneity and implications for diagnostics.
- Author
-
Scriba CK, Stevanovski I, Chintalaphani SR, Gamaarachchi H, Ghaoui R, Ghia D, Henderson RD, Jordan N, Winkel A, Lamont PJ, Rodrigues MJ, Roxburgh RH, Weisburd B, Laing NG, Deveson IW, Davis MR, and Ravenscroft G
- Abstract
Cerebellar ataxia, neuropathy and vestibular areflexia syndrome is a progressive, generally late-onset, neurological disorder associated with biallelic pentanucleotide expansions in Intron 2 of the RFC1 gene. The locus exhibits substantial genetic variability, with multiple pathogenic and benign pentanucleotide repeat alleles previously identified. To determine the contribution of pathogenic RFC1 expansions to neurological disease within an Australasian cohort and further investigate the heterogeneity exhibited at the locus, a combination of flanking and repeat-primed PCR was used to screen a cohort of 242 Australasian patients with neurological disease. Patients whose data indicated large gaps within expanded alleles following repeat-primed PCR, underwent targeted long-read sequencing to identify novel repeat motifs at the locus. To increase diagnostic yield, additional probes at the RFC1 repeat region were incorporated into the PathWest diagnostic laboratory targeted neurological disease gene panel to enable first-pass screening of the locus for all samples tested on the panel. Within the Australasian cohort, we detected known pathogenic biallelic expansions in 15.3% ( n = 37) of patients. Thirty indicated biallelic AAGGG expansions, two had biallelic 'Māori alleles' [(AAAGG)
exp (AAGGG)exp ], two samples were compound heterozygous for the Māori allele and an AAGGG expansion, two samples had biallelic ACAGG expansions and one sample was compound heterozygous for the ACAGG and AAGGG expansions. Forty-five samples tested indicated the presence of biallelic expansions not known to be pathogenic. A large proportion (84%) showed complex interrupted patterns following repeat-primed PCR, suggesting that these expansions are likely to be comprised of more than one repeat motif, including previously unknown repeats. Using targeted long-read sequencing, we identified three novel repeat motifs in expanded alleles. Here, we also show that short-read sequencing can be used to reliably screen for the presence or absence of biallelic RFC1 expansions in all samples tested using the PathWest targeted neurological disease gene panel. Our results show that RFC1 pathogenic expansions make a substantial contribution to neurological disease in the Australasian population and further extend the heterogeneity of the locus. To accommodate the increased complexity, we outline a multi-step workflow utilizing both targeted short- and long-read sequencing to achieve a definitive genotype and provide accurate diagnoses for patients., Competing Interests: H.G. has previously received travel and accommodation expenses from ONT to speak at conferences. H.G. and I.W.D. have paid consultant roles with Sequin Pty Ltd., (© The Author(s) 2023. Published by Oxford University Press on behalf of the Guarantors of Brain.)- Published
- 2023
- Full Text
- View/download PDF
24. Unexpected frequency of the pathogenic AR CAG repeat expansion in the general population.
- Author
-
Zanovello M, Ibáñez K, Brown AL, Sivakumar P, Bombaci A, Santos L, van Vugt JJFA, Narzisi G, Karra R, Scholz SW, Ding J, Gibbs JR, Chiò A, Dalgard C, Weisburd B, Hanna MG, Greensmith L, Phatnani H, Veldink JH, Traynor BJ, Polke J, Houlden H, Fratta P, and Tucci A
- Subjects
- Humans, Male, Muscular Atrophy, Polymerase Chain Reaction, Trinucleotide Repeat Expansion genetics, Receptors, Androgen genetics, Muscular Atrophy, Spinal genetics
- Abstract
CAG repeat expansions in exon 1 of the AR gene on the X chromosome cause spinal and bulbar muscular atrophy, a male-specific progressive neuromuscular disorder associated with a variety of extra-neurological symptoms. The disease has a reported male prevalence of approximately 1:30 000 or less, but the AR repeat expansion frequency is unknown. We established a pipeline, which combines the use of the ExpansionHunter tool and visual validation, to detect AR CAG expansion on whole-genome sequencing data, benchmarked it to fragment PCR sizing, and applied it to 74 277 unrelated individuals from four large cohorts. Our pipeline showed sensitivity of 100% [95% confidence interval (CI) 90.8-100%], specificity of 99% (95% CI 94.2-99.7%), and a positive predictive value of 97.4% (95% CI 84.4-99.6%). We found the mutation frequency to be 1:3182 (95% CI 1:2309-1:4386, n = 117 734) X chromosomes-10 times more frequent than the reported disease prevalence. Modelling using the novel mutation frequency led to estimate disease prevalence of 1:6887 males, more than four times more frequent than the reported disease prevalence. This discrepancy is possibly due to underdiagnosis of this neuromuscular condition, reduced penetrance, and/or pleomorphic clinical manifestations., (© The Author(s) 2023. Published by Oxford University Press on behalf of the Guarantors of Brain.)
- Published
- 2023
- Full Text
- View/download PDF
25. Insights from a genome-wide truth set of tandem repeat variation.
- Author
-
Weisburd B, Tiao G, and Rehm HL
- Abstract
Tools for genotyping tandem repeats (TRs) from short read sequencing data have improved significantly over the past decade. Extensive comparisons of these tools to gold standard diagnostic methods like RP-PCR have confirmed their accuracy for tens to hundreds of well-studied loci. However, a scarcity of high-quality orthogonal truth data limited our ability to measure tool accuracy for the millions of other loci throughout the genome. To address this, we developed a TR truth set based on the Synthetic Diploid Benchmark (SynDip). By identifying the subset of insertions and deletions that represent TR expansions or contractions with motifs between 2 and 50 base pairs, we obtained accurate genotypes for 139,795 pure and 6,845 interrupted repeats in a single diploid sample. Our approach did not require running existing genotyping tools on short read or long read sequencing data and provided an alternative, more accurate view of tandem repeat variation. We applied this truth set to compare the strengths and weaknesses of widely-used tools for genotyping TRs, evaluated the completeness of existing genome-wide TR catalogs, and explored the properties of tandem repeat variation throughout the genome. We found that, without filtering, ExpansionHunter had higher accuracy than GangSTR and HipSTR over a wide range of motifs and allele sizes. Also, when errors in allele size occurred, ExpansionHunter tended to overestimate expansion sizes, while GangSTR tended to underestimate them. Additionally, we saw that widely-used TR catalogs miss between 16% and 41% of variant loci in the truth set. These results suggest that genome-wide analyses would benefit from genotyping a larger set of loci as well as further tool development that builds on the strengths of current algorithms. To that end, we developed a new catalog of 2.8 million loci that captures 95% of variant loci in the truth set, and created a modified version of ExpansionHunter that runs 2 to 3x faster than the original while producing the same output.
- Published
- 2023
- Full Text
- View/download PDF
26. Transcriptome and Genome Analysis Uncovers a DMD Structural Variant: A Case Report.
- Author
-
Folland C, Ganesh V, Weisburd B, McLean C, Kornberg AJ, O'Donnell-Luria A, Rehm HL, Stevanovski I, Chintalaphani SR, Kennedy P, Deveson IW, and Ravenscroft G
- Abstract
Objective: Duchenne muscular dystrophy (DMD) is caused by pathogenic variants in the dystrophin gene ( DMD ). Hypermethylated CGG expansions within DIP2B 5' UTR are associated with an intellectual development disorder. Here, we demonstrate the diagnostic utility of genomic short-read sequencing (SRS) and transcriptome sequencing to identify a novel DMD structural variant (SV) and a DIP2B CGG expansion in a patient with DMD for whom conventional diagnostic testing failed to yield a genetic diagnosis., Methods: We performed genomic SRS, skeletal muscle transcriptome sequencing, and targeted programmable long-read sequencing (LRS)., Results: The proband had a typical DMD clinical presentation, autism spectrum disorder (ASD), and dystrophinopathy on muscle biopsy. Transcriptome analysis identified 6 aberrantly expressed genes; DMD and DIP2B were the strongest underexpression and overexpression outliers, respectively. Genomic SRS identified a 216 kb paracentric inversion (NC_000023.11: g.33162217-33378800) overlapping 2 DMD promoters. ExpansionHunter indicated an expansion of 109 CGG repeats within the 5' UTR of DIP2B . Targeted genomic LRS confirmed the SV and genotyped the DIP2B repeat expansion as 270 CGG repeats., Discussion: Here, transcriptome data heavily guided genomic analysis to resolve a complex DMD inversion and a DIP2B repeat expansion. Longitudinal follow-up will be important for clarifying the clinical significance of the DIP2B genotype., (Copyright © 2023 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the American Academy of Neurology.)
- Published
- 2023
- Full Text
- View/download PDF
27. REViewer: haplotype-resolved visualization of read alignments in and around tandem repeats.
- Author
-
Dolzhenko E, Weisburd B, Ibañez K, Rajan-Babu IS, Anyansi C, Bennett MF, Billingsley K, Carroll A, Clamons S, Danzi MC, Deshpande V, Ding J, Fazal S, Halman A, Jadhav B, Qiu Y, Richmond PA, Saunders CT, Scheffler K, van Vugt JJFA, Zwamborn RRAJ, Chong SS, Friedman JM, Tucci A, Rehm HL, and Eberle MA
- Subjects
- Alleles, Exome, Fragile X Mental Retardation Protein genetics, Haplotypes, High-Throughput Nucleotide Sequencing methods, Humans, Amyotrophic Lateral Sclerosis genetics, Tandem Repeat Sequences
- Abstract
Background: Expansions of short tandem repeats are the cause of many neurogenetic disorders including familial amyotrophic lateral sclerosis, Huntington disease, and many others. Multiple methods have been recently developed that can identify repeat expansions in whole genome or exome sequencing data. Despite the widely recognized need for visual assessment of variant calls in clinical settings, current computational tools lack the ability to produce such visualizations for repeat expansions. Expanded repeats are difficult to visualize because they correspond to large insertions relative to the reference genome and involve many misaligning and ambiguously aligning reads., Results: We implemented REViewer, a computational method for visualization of sequencing data in genomic regions containing long repeat expansions and FlipBook, a companion image viewer designed for manual curation of large collections of REViewer images. To generate a read pileup, REViewer reconstructs local haplotype sequences and distributes reads to these haplotypes in a way that is most consistent with the fragment lengths and evenness of read coverage. To create appropriate training materials for onboarding new users, we performed a concordance study involving 12 scientists involved in short tandem repeat research. We used the results of this study to create a user guide that describes the basic principles of using REViewer as well as a guide to the typical features of read pileups that correspond to low confidence repeat genotype calls. Additionally, we demonstrated that REViewer can be used to annotate clinically relevant repeat interruptions by comparing visual assessment results of 44 FMR1 repeat alleles with the results of triplet repeat primed PCR. For 38 of these alleles, the results of visual assessment were consistent with triplet repeat primed PCR., Conclusions: Read pileup plots generated by REViewer offer an intuitive way to visualize sequencing data in regions containing long repeat expansions. Laboratories can use REViewer and FlipBook to assess the quality of repeat genotype calls as well as to visually detect interruptions or other imperfections in the repeat sequence and the surrounding flanking regions. REViewer and FlipBook are available under open-source licenses at https://github.com/illumina/REViewer and https://github.com/broadinstitute/flipbook respectively., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
28. Questioning the Association of the STMN2 Dinucleotide Repeat With Amyotrophic Lateral Sclerosis.
- Author
-
Ross JP, Akçimen F, Liao C, Spiegelman D, Weisburd B, Dupré N, Dion PA, Rouleau GA, and Farhan SMK
- Abstract
Objectives: Recently, the number of dinucleotide CA repeats in an intron of the STMN2 gene was reported to be associated with an increased risk for amyotrophic lateral sclerosis (ALS). Therefore, we sought to replicate this observation in an independent group of ALS patients and a much larger control group., Methods: Here, we used whole-genome sequencing and tested the STMN2 CA repeat in a case-control cohort of the European genetic background and in genomes from various populations in the gnomAD cohort to attempt to replicate this proposed association., Results: We find that repeats well above the previously reported pathogenic threshold of 19 are commonly observed in unaffected individuals across different populations. Furthermore, we did not observe an association between longer STMN2 CA repeats and ALS phenotype., Discussion: In summary, our results do not support a role of STMN2 CA repeats toward ALS risk. As TDP-43 aggregation is central to ALS pathogenesis, lowered expression of STMN2 could be used as a biomarker for ALS. Therefore, a variant associated both with the risk for ALS and the level of STMN2 expression would be clinically useful. However, for a variant to be actionable, it must be strongly replicated in independent cohorts and exceed the rigorous statistical thresholds applied., (Copyright © 2022 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the American Academy of Neurology.)
- Published
- 2022
- Full Text
- View/download PDF
29. seqr: A web-based analysis and collaboration tool for rare disease genomics.
- Author
-
Pais LS, Snow H, Weisburd B, Zhang S, Baxter SM, DiTroia S, O'Heir E, England E, Chao KR, Lemire G, Osei-Owusu I, VanNoy GE, Wilson M, Nguyen K, Arachchi H, Phu W, Solomonson M, Mano S, O'Leary M, Lovgren A, Babb L, Austin-Tse CA, Rehm HL, MacArthur DG, and O'Donnell-Luria A
- Subjects
- Exome, Humans, Internet, Software, Genomics, Rare Diseases diagnosis, Rare Diseases genetics
- Abstract
Exome and genome sequencing have become the tools of choice for rare disease diagnosis, leading to large amounts of data available for analyses. To identify causal variants in these datasets, powerful filtering and decision support tools that can be efficiently used by clinicians and researchers are required. To address this need, we developed seqr - an open-source, web-based tool for family-based monogenic disease analysis that allows researchers to work collaboratively to search and annotate genomic callsets. To date, seqr is being used in several research pipelines and one clinical diagnostic lab. In our own experience through the Broad Institute Center for Mendelian Genomics, seqr has enabled analyses of over 10,000 families, supporting the diagnosis of more than 3,800 individuals with rare disease and discovery of over 300 novel disease genes. Here, we describe a framework for genomic analysis in rare disease that leverages seqr's capabilities for variant filtration, annotation, and causal variant identification, as well as support for research collaboration and data sharing. The seqr platform is available as open source software, allowing low-cost participation in rare disease research, and a community effort to support diagnosis and gene discovery in rare disease., (© 2022 Wiley Periodicals LLC.)
- Published
- 2022
- Full Text
- View/download PDF
30. Addendum: The mutational constraint spectrum quantified from variation in 141,456 humans.
- Author
-
Gudmundsson S, Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, Collins RL, Laricchia KM, Ganna A, Birnbaum DP, Gauthier LD, Brand H, Solomonson M, Watts NA, Rhodes D, Singer-Berk M, England EM, Seaby EG, Kosmicki JA, Walters RK, Tashman K, Farjoun Y, Banks E, Poterba T, Wang A, Seed C, Whiffin N, Chong JX, Samocha KE, Pierce-Hoffman E, Zappala Z, O'Donnell-Luria AH, Minikel EV, Weisburd B, Lek M, Ware JS, Vittal C, Armean IM, Bergelson L, Cibulskis K, Connolly KM, Covarrubias M, Donnelly S, Ferriera S, Gabriel S, Gentry J, Gupta N, Jeandet T, Kaplan D, Llanwarne C, Munshi R, Novod S, Petrillo N, Roazen D, Ruano-Rubio V, Saltzman A, Schleicher M, Soto J, Tibbetts K, Tolonen C, Wade G, Talkowski ME, Neale BM, Daly MJ, and MacArthur DG
- Published
- 2021
- Full Text
- View/download PDF
31. Determinants of penetrance and variable expressivity in monogenic metabolic conditions across 77,184 exomes.
- Author
-
Goodrich JK, Singer-Berk M, Son R, Sveden A, Wood J, England E, Cole JB, Weisburd B, Watts N, Caulkins L, Dornbos P, Koesterer R, Zappala Z, Zhang H, Maloney KA, Dahl A, Aguilar-Salinas CA, Atzmon G, Barajas-Olmos F, Barzilai N, Blangero J, Boerwinkle E, Bonnycastle LL, Bottinger E, Bowden DW, Centeno-Cruz F, Chambers JC, Chami N, Chan E, Chan J, Cheng CY, Cho YS, Contreras-Cubas C, Córdova E, Correa A, DeFronzo RA, Duggirala R, Dupuis J, Garay-Sevilla ME, García-Ortiz H, Gieger C, Glaser B, González-Villalpando C, Gonzalez ME, Grarup N, Groop L, Gross M, Haiman C, Han S, Hanis CL, Hansen T, Heard-Costa NL, Henderson BE, Hernandez JMM, Hwang MY, Islas-Andrade S, Jørgensen ME, Kang HM, Kim BJ, Kim YJ, Koistinen HA, Kooner JS, Kuusisto J, Kwak SH, Laakso M, Lange L, Lee JY, Lee J, Lehman DM, Linneberg A, Liu J, Loos RJF, Lyssenko V, Ma RCW, Martínez-Hernández A, Meigs JB, Meitinger T, Mendoza-Caamal E, Mohlke KL, Morris AD, Morrison AC, Ng MCY, Nilsson PM, O'Donnell CJ, Orozco L, Palmer CNA, Park KS, Post WS, Pedersen O, Preuss M, Psaty BM, Reiner AP, Revilla-Monsalve C, Rich SS, Rotter JI, Saleheen D, Schurmann C, Sim X, Sladek R, Small KS, So WY, Spector TD, Strauch K, Strom TM, Tai ES, Tam CHT, Teo YY, Thameem F, Tomlinson B, Tracy RP, Tuomi T, Tuomilehto J, Tusié-Luna T, van Dam RM, Vasan RS, Wilson JG, Witte DR, Wong TY, Burtt NP, Zaitlen N, McCarthy MI, Boehnke M, Pollin TI, Flannick J, Mercader JM, O'Donnell-Luria A, Baxter S, Florez JC, MacArthur DG, and Udler MS
- Subjects
- Adult, Biological Variation, Population, Biomarkers metabolism, Diabetes Mellitus, Type 2 metabolism, Dyslipidemias metabolism, Exome genetics, Genotype, Humans, Multifactorial Inheritance, Penetrance, Risk Assessment, Diabetes Mellitus, Type 2 genetics, Dyslipidemias genetics, Genetic Predisposition to Disease genetics
- Abstract
Hundreds of thousands of genetic variants have been reported to cause severe monogenic diseases, but the probability that a variant carrier develops the disease (termed penetrance) is unknown for virtually all of them. Additionally, the clinical utility of common polygenetic variation remains uncertain. Using exome sequencing from 77,184 adult individuals (38,618 multi-ancestral individuals from a type 2 diabetes case-control study and 38,566 participants from the UK Biobank, for whom genotype array data were also available), we apply clinical standard-of-care gene variant curation for eight monogenic metabolic conditions. Rare variants causing monogenic diabetes and dyslipidemias display effect sizes significantly larger than the top 1% of the corresponding polygenic scores. Nevertheless, penetrance estimates for monogenic variant carriers average 60% or lower for most conditions. We assess epidemiologic and genetic factors contributing to risk prediction in monogenic variant carriers, demonstrating that inclusion of polygenic variation significantly improves biomarker estimation for two monogenic dyslipidemias.
- Published
- 2021
- Full Text
- View/download PDF
32. A form of muscular dystrophy associated with pathogenic variants in JAG2.
- Author
-
Coppens S, Barnard AM, Puusepp S, Pajusalu S, Õunap K, Vargas-Franco D, Bruels CC, Donkervoort S, Pais L, Chao KR, Goodrich JK, England EM, Weisburd B, Ganesh VS, Gudmundsson S, O'Donnell-Luria A, Nigul M, Ilves P, Mohassel P, Siddique T, Milone M, Nicolau S, Maroofian R, Houlden H, Hanna MG, Quinlivan R, Toosi MB, Karimiani EG, Costagliola S, Deconinck N, Kadhim H, Macke E, Lanpher BC, Klee EW, Łusakowska A, Kostera-Pruszczyk A, Hahn A, Schrank B, Nishino I, Ogasawara M, El Sherif R, Stojkovic T, Nelson I, Bonne G, Cohen E, Boland-Augé A, Deleuze JF, Meng Y, Töpf A, Vilain C, Pacak CA, Rivera-Zengotita ML, Bönnemann CG, Straub V, Handford PA, Draper I, Walter GA, and Kang PB
- Published
- 2021
- Full Text
- View/download PDF
33. Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.
- Author
-
Zhao X, Collins RL, Lee WP, Weber AM, Jun Y, Zhu Q, Weisburd B, Huang Y, Audano PA, Wang H, Walker M, Lowther C, Fu J, Gerstein MB, Devine SE, Marschall T, Korbel JO, Eichler EE, Chaisson MJP, Lee C, Mills RE, Brand H, and Talkowski ME
- Subjects
- DNA Copy Number Variations, Exons genetics, Humans, Research Design, Segmental Duplications, Genomic, Sequence Alignment, Genome, Human genetics, Genomic Structural Variation, Genomics methods, Goals, Whole Genome Sequencing methods, Whole Genome Sequencing standards
- Abstract
Virtually all genome sequencing efforts in national biobanks, complex and Mendelian disease programs, and medical genetic initiatives are reliant upon short-read whole-genome sequencing (srWGS), which presents challenges for the detection of structural variants (SVs) relative to emerging long-read WGS (lrWGS) technologies. Given this ubiquity of srWGS in large-scale genomics initiatives, we sought to establish expectations for routine SV detection from this data type by comparison with lrWGS assembly, as well as to quantify the genomic properties and added value of SVs uniquely accessible to each technology. Analyses from the Human Genome Structural Variation Consortium (HGSVC) of three families captured ~11,000 SVs per genome from srWGS and ~25,000 SVs per genome from lrWGS assembly. Detection power and precision for SV discovery varied dramatically by genomic context and variant class: 9.7% of the current GRCh38 reference is defined by segmental duplication (SD) and simple repeat (SR), yet 91.4% of deletions that were specifically discovered by lrWGS localized to these regions. Across the remaining 90.3% of reference sequence, we observed extremely high (93.8%) concordance between technologies for deletions in these datasets. In contrast, lrWGS was superior for detection of insertions across all genomic contexts. Given that non-SD/SR sequences encompass 95.9% of currently annotated disease-associated exons, improved sensitivity from lrWGS to discover novel pathogenic deletions in these currently interpretable genomic regions is likely to be incremental. However, these analyses highlight the considerable added value of assembly-based lrWGS to create new catalogs of insertions and transposable elements, as well as disease-associated repeat expansions in genomic sequences that were previously recalcitrant to routine assessment., (Copyright © 2021. Published by Elsevier Inc.)
- Published
- 2021
- Full Text
- View/download PDF
34. Author Correction: The mutational constraint spectrum quantified from variation in 141,456 humans.
- Author
-
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, Collins RL, Laricchia KM, Ganna A, Birnbaum DP, Gauthier LD, Brand H, Solomonson M, Watts NA, Rhodes D, Singer-Berk M, England EM, Seaby EG, Kosmicki JA, Walters RK, Tashman K, Farjoun Y, Banks E, Poterba T, Wang A, Seed C, Whiffin N, Chong JX, Samocha KE, Pierce-Hoffman E, Zappala Z, O'Donnell-Luria AH, Minikel EV, Weisburd B, Lek M, Ware JS, Vittal C, Armean IM, Bergelson L, Cibulskis K, Connolly KM, Covarrubias M, Donnelly S, Ferriera S, Gabriel S, Gentry J, Gupta N, Jeandet T, Kaplan D, Llanwarne C, Munshi R, Novod S, Petrillo N, Roazen D, Ruano-Rubio V, Saltzman A, Schleicher M, Soto J, Tibbetts K, Tolonen C, Wade G, Talkowski ME, Neale BM, Daly MJ, and MacArthur DG
- Published
- 2021
- Full Text
- View/download PDF
35. WGS and RNA Studies Diagnose Noncoding DMD Variants in Males With High Creatine Kinase.
- Author
-
Waddell LB, Bryen SJ, Cummings BB, Bournazos A, Evesson FJ, Joshi H, Marshall JL, Tukiainen T, Valkanas E, Weisburd B, Sadedin S, Davis MR, Faiz F, Gooding R, Sandaradura SA, O'Grady GL, Tchan MC, Mowat DR, Oates EC, Farrar MA, Sampaio H, Ma A, Neas K, Wang MX, Charlton A, Chan C, Kenwright DN, Graf N, Arbuckle S, Clarke NF, MacArthur DG, Jones KJ, Lek M, and Cooper ST
- Abstract
Objective: To describe the diagnostic utility of whole-genome sequencing and RNA studies in boys with suspected dystrophinopathy, for whom multiplex ligation-dependent probe amplification and exomic parallel sequencing failed to yield a genetic diagnosis, and to use remnant normal DMD splicing in 3 families to define critical levels of wild-type dystrophin bridging clinical spectrums of Duchenne to myalgia., Methods: Exome, genome, and/or muscle RNA sequencing was performed for 7 males with elevated creatine kinase. PCR of muscle-derived complementary DNA (cDNA) studied consequences for DMD premessenger RNA (pre-mRNA) splicing. Quantitative Western blot was used to determine levels of dystrophin, relative to control muscle., Results: Splice-altering intronic single nucleotide variants or structural rearrangements in DMD were identified in all 7 families. Four individuals, with abnormal splicing causing a premature stop codon and nonsense-mediated decay, expressed remnant levels of normally spliced DMD mRNA. Quantitative Western blot enabled correlation of wild-type dystrophin and clinical severity, with 0%-5% dystrophin conferring a Duchenne phenotype, 10% ± 2% a Becker phenotype, and 15% ± 2% dystrophin associated with myalgia without manifesting weakness., Conclusions: Whole-genome sequencing relied heavily on RNA studies to identify DMD splice-altering variants. Short-read RNA sequencing was regularly confounded by the effectiveness of nonsense-mediated mRNA decay and low read depth of the giant DMD mRNA. PCR of muscle cDNA provided a simple, yet informative approach. Highly relevant to genetic therapies for dystrophinopathies, our data align strongly with previous studies of mutant dystrophin in Becker muscular dystrophy, with the collective conclusion that a fractional increase in levels of normal dystrophin between 5% and 20% is clinically significant., (Copyright © 2021 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the American Academy of Neurology.)
- Published
- 2021
- Full Text
- View/download PDF
36. A novel RFC1 repeat motif (ACAGG) in two Asia-Pacific CANVAS families.
- Author
-
Scriba CK, Beecroft SJ, Clayton JS, Cortese A, Sullivan R, Yau WY, Dominik N, Rodrigues M, Walker E, Dyer Z, Wu TY, Davis MR, Chandler DC, Weisburd B, Houlden H, Reilly MM, Laing NG, Lamont PJ, Roxburgh RH, and Ravenscroft G
- Subjects
- Aged, Bilateral Vestibulopathy complications, Bilateral Vestibulopathy diagnosis, Cerebellar Ataxia complications, Cerebellar Ataxia diagnosis, Cohort Studies, Female, Humans, Indonesia, Male, Middle Aged, Pedigree, Asian People genetics, Bilateral Vestibulopathy genetics, Cerebellar Ataxia genetics, DNA Repeat Expansion genetics, Replication Protein C genetics
- Abstract
Cerebellar ataxia, neuropathy and vestibular areflexia syndrome (CANVAS) is a progressive late-onset, neurological disease. Recently, a pentanucleotide expansion in intron 2 of RFC1 was identified as the genetic cause of CANVAS. We screened an Asian-Pacific cohort for CANVAS and identified a novel RFC1 repeat expansion motif, (ACAGG)exp, in three affected individuals. This motif was associated with additional clinical features including fasciculations and elevated serum creatine kinase. These features have not previously been described in individuals with genetically-confirmed CANVAS. Haplotype analysis showed our patients shared the same core haplotype as previously published, supporting the possibility of a single origin of the RFC1 disease allele. We analysed data from >26 000 genetically diverse individuals in gnomAD to show enrichment of (ACAGG) in non-European populations., (© The Author(s) (2020). Published by Oxford University Press on behalf of the Guarantors of Brain. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
- Published
- 2020
- Full Text
- View/download PDF
37. Physician-Machine Interaction in the Decision Making Process.
- Author
-
Goldberg S, Temkin A, and Weisburd B
- Subjects
- Expert Systems, Machine Learning, Decision Making
- Abstract
We propose an approach to decision support systems (DSS) that starts with the user first making their own unassisted decision αU and providing this decision as an input to the algorithm. Then, if the decision based of machine learning (ML) disagrees with the user's initial decision, it iteratively works with the user to converge to a common decision or at least make the user reconsider input values that are inconsistent with αU. We provide a detailed description of this approach along with examples, and then discuss potential benefits and limitations of this approach.
- Published
- 2020
- Full Text
- View/download PDF
38. The mutational constraint spectrum quantified from variation in 141,456 humans.
- Author
-
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, Collins RL, Laricchia KM, Ganna A, Birnbaum DP, Gauthier LD, Brand H, Solomonson M, Watts NA, Rhodes D, Singer-Berk M, England EM, Seaby EG, Kosmicki JA, Walters RK, Tashman K, Farjoun Y, Banks E, Poterba T, Wang A, Seed C, Whiffin N, Chong JX, Samocha KE, Pierce-Hoffman E, Zappala Z, O'Donnell-Luria AH, Minikel EV, Weisburd B, Lek M, Ware JS, Vittal C, Armean IM, Bergelson L, Cibulskis K, Connolly KM, Covarrubias M, Donnelly S, Ferriera S, Gabriel S, Gentry J, Gupta N, Jeandet T, Kaplan D, Llanwarne C, Munshi R, Novod S, Petrillo N, Roazen D, Ruano-Rubio V, Saltzman A, Schleicher M, Soto J, Tibbetts K, Tolonen C, Wade G, Talkowski ME, Neale BM, Daly MJ, and MacArthur DG
- Subjects
- Adult, Brain metabolism, Cardiovascular Diseases genetics, Cohort Studies, Databases, Genetic, Female, Genetic Predisposition to Disease genetics, Genome-Wide Association Study, Humans, Loss of Function Mutation genetics, Male, Mutation Rate, Proprotein Convertase 9 genetics, RNA, Messenger genetics, Reproducibility of Results, Exome Sequencing, Whole Genome Sequencing, Exome genetics, Genes, Essential genetics, Genetic Variation genetics, Genome, Human genetics
- Abstract
Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes
1 . Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.- Published
- 2020
- Full Text
- View/download PDF
39. Recurrent TTN metatranscript-only c.39974-11T>G splice variant associated with autosomal recessive arthrogryposis multiplex congenita and myopathy.
- Author
-
Bryen SJ, Ewans LJ, Pinner J, MacLennan SC, Donkervoort S, Castro D, Töpf A, O'Grady G, Cummings B, Chao KR, Weisburd B, Francioli L, Faiz F, Bournazos AM, Hu Y, Grosmann C, Malicki DM, Doyle H, Witting N, Vissing J, Claeys KG, Urankar K, Beleza-Meireles A, Baptista J, Ellard S, Savarese M, Johari M, Vihola A, Udd B, Majumdar A, Straub V, Bönnemann CG, MacArthur DG, Davis MR, and Cooper ST
- Subjects
- Child, Child, Preschool, Female, Genetic Association Studies, Humans, Infant, Male, Mutation, Pedigree, Phenotype, Radiography, Alternative Splicing, Arthrogryposis diagnosis, Arthrogryposis genetics, Connectin genetics, Genes, Recessive, Genetic Predisposition to Disease, Muscular Diseases diagnosis, Muscular Diseases genetics
- Abstract
We present eight families with arthrogryposis multiplex congenita and myopathy bearing a TTN intron 213 extended splice-site variant (NM_001267550.1:c.39974-11T>G), inherited in trans with a second pathogenic TTN variant. Muscle-derived RNA studies of three individuals confirmed mis-splicing induced by the c.39974-11T>G variant; in-frame exon 214 skipping or use of a cryptic 3' splice-site effecting a frameshift. Confounding interpretation of pathogenicity is the absence of exons 213-217 within the described skeletal muscle TTN N2A isoform. However, RNA-sequencing from 365 adult human gastrocnemius samples revealed that 56% specimens predominantly include exons 213-217 in TTN transcripts (inclusion rate ≥66%). Further, RNA-sequencing of five fetal muscle samples confirmed that 4/5 specimens predominantly include exons 213-217 (fifth sample inclusion rate 57%). Contractures improved significantly with age for four individuals, which may be linked to decreased expression of pathogenic fetal transcripts. Our study extends emerging evidence supporting a vital developmental role for TTN isoforms containing metatranscript-only exons., (© 2019 Wiley Periodicals, Inc.)
- Published
- 2020
- Full Text
- View/download PDF
40. Variant Score Ranker-a web application for intuitive missense variant prioritization.
- Author
-
Du J, Sudarsanam M, Pérez-Palma E, Ganna A, Francioli L, Iqbal S, Niestroj LM, Leu C, Weisburd B, Poterba T, Nürnberg P, Daly MJ, Palotie A, May P, and Lal D
- Subjects
- Mutation, Missense, Software
- Abstract
Motivation: The correct classification of missense variants as benign or pathogenic remains challenging. Pathogenic variants are expected to have higher deleterious prediction scores than benign variants in the same gene. However, most of the existing variant annotation tools do not reference the score range of benign population variants on gene level., Results: We present a web-application, Variant Score Ranker, which enables users to rapidly annotate variants and perform gene-specific variant score ranking on the population level. We also provide an intuitive example of how gene- and population-calibrated variant ranking scores can improve epilepsy variant prioritization., Availability and Implementation: http://vsranker.broadinstitute.org., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)
- Published
- 2019
- Full Text
- View/download PDF
41. Correction: Insights into the genetic epidemiology of Crohn's and rare diseases in the Ashkenazi Jewish population.
- Author
-
Rivas MA, Avila BE, Koskela J, Huang H, Stevens C, Pirinen M, Haritunians T, Neale BM, Kurki M, Ganna A, Graham D, Glaser B, Peter I, Atzmon G, Barzilai N, Levine AP, Schiff E, Pontikos N, Weisburd B, Lek M, Karczewski KJ, Bloom J, Minikel EV, Petersen BS, Beaugerie L, Seksik P, Cosnes J, Schreiber S, Bokemeyer B, Bethge J, Heap G, Ahmad T, Plagnol V, Segal AW, Targan S, Turner D, Saavalainen P, Farkkila M, Kontula K, Palotie A, Brant SR, Duerr RH, Silverberg MS, Rioux JD, Weersma RK, Franke A, Jostins L, Anderson CA, Barrett JC, MacArthur DG, Jalas C, Sokol H, Xavier RJ, Pulver A, Cho JH, McGovern DPB, and Daly MJ
- Abstract
[This corrects the article DOI: 10.1371/journal.pgen.1007329.].
- Published
- 2019
- Full Text
- View/download PDF
42. matchbox: An open-source tool for patient matching via the Matchmaker Exchange.
- Author
-
Arachchi H, Wojcik MH, Weisburd B, Jacobsen JOB, Valkanas E, Baxter S, Byrne AB, O'Donnell-Luria AH, Haendel M, Smedley D, MacArthur DG, Philippakis AA, and Rehm HL
- Subjects
- Access to Information, Genetic Association Studies, Genetic Predisposition to Disease, Humans, Information Dissemination methods, Phenotype, Software, Web Browser, Information Storage and Retrieval methods, Patient Selection, Rare Diseases genetics
- Abstract
Rare disease investigators constantly face challenges in identifying additional cases to build evidence for gene-disease causality. The Matchmaker Exchange (MME) addresses this limitation by providing a mechanism for matching patients across genomic centers via a federated network. The MME has revolutionized searching for additional cases by making it possible to query across institutional boundaries, so that what was once a laborious and manual process of contacting researchers is now automated and computable. However, while the MME network is beginning to scale, the growth of additional nodes is limited by the lack of easy-to-use solutions that can be implemented by any rare disease database owner, even one without significant software engineering resources. Here, we describe matchbox, which is an open-source, platform-independent, portable bridge between any given rare disease genomic center and the MME network, which has already led to novel gene discoveries. We also describe how matchbox greatly reduces the barrier to participation by overcoming challenges for new databases to join the MME., (© 2018 Wiley Periodicals, Inc.)
- Published
- 2018
- Full Text
- View/download PDF
43. Insights into the genetic epidemiology of Crohn's and rare diseases in the Ashkenazi Jewish population.
- Author
-
Rivas MA, Avila BE, Koskela J, Huang H, Stevens C, Pirinen M, Haritunians T, Neale BM, Kurki M, Ganna A, Graham D, Glaser B, Peter I, Atzmon G, Barzilai N, Levine AP, Schiff E, Pontikos N, Weisburd B, Lek M, Karczewski KJ, Bloom J, Minikel EV, Petersen BS, Beaugerie L, Seksik P, Cosnes J, Schreiber S, Bokemeyer B, Bethge J, Heap G, Ahmad T, Plagnol V, Segal AW, Targan S, Turner D, Saavalainen P, Farkkila M, Kontula K, Palotie A, Brant SR, Duerr RH, Silverberg MS, Rioux JD, Weersma RK, Franke A, Jostins L, Anderson CA, Barrett JC, MacArthur DG, Jalas C, Sokol H, Xavier RJ, Pulver A, Cho JH, McGovern DPB, and Daly MJ
- Subjects
- Algorithms, Crohn Disease epidemiology, Genetics, Population, Genome-Wide Association Study, Haplotypes, Humans, Models, Genetic, Molecular Epidemiology, Polymorphism, Single Nucleotide, Rare Diseases epidemiology, Crohn Disease genetics, Genetic Predisposition to Disease genetics, Jews genetics, Rare Diseases genetics
- Abstract
As part of a broader collaborative network of exome sequencing studies, we developed a jointly called data set of 5,685 Ashkenazi Jewish exomes. We make publicly available a resource of site and allele frequencies, which should serve as a reference for medical genetics in the Ashkenazim (hosted in part at https://ibd.broadinstitute.org, also available in gnomAD at http://gnomad.broadinstitute.org). We estimate that 34% of protein-coding alleles present in the Ashkenazi Jewish population at frequencies greater than 0.2% are significantly more frequent (mean 15-fold) than their maximum frequency observed in other reference populations. Arising via a well-described founder effect approximately 30 generations ago, this catalog of enriched alleles can contribute to differences in genetic risk and overall prevalence of diseases between populations. As validation we document 148 AJ enriched protein-altering alleles that overlap with "pathogenic" ClinVar alleles (table available at https://github.com/macarthur-lab/clinvar/blob/master/output/clinvar.tsv), including those that account for 10-100 fold differences in prevalence between AJ and non-AJ populations of some rare diseases, especially recessive conditions, including Gaucher disease (GBA, p.Asn409Ser, 8-fold enrichment); Canavan disease (ASPA, p.Glu285Ala, 12-fold enrichment); and Tay-Sachs disease (HEXA, c.1421+1G>C, 27-fold enrichment; p.Tyr427IlefsTer5, 12-fold enrichment). We next sought to use this catalog, of well-established relevance to Mendelian disease, to explore Crohn's disease, a common disease with an estimated two to four-fold excess prevalence in AJ. We specifically attempt to evaluate whether strong acting rare alleles, particularly protein-truncating or otherwise large effect-size alleles, enriched by the same founder-effect, contribute excess genetic risk to Crohn's disease in AJ, and find that ten rare genetic risk factors in NOD2 and LRRK2 are enriched in AJ (p < 0.005), including several novel contributing alleles, show evidence of association to CD. Independently, we find that genomewide common variant risk defined by GWAS shows a strong difference between AJ and non-AJ European control population samples (0.97 s.d. higher, p<10-16). Taken together, the results suggest coordinated selection in AJ population for higher CD risk alleles in general. The results and approach illustrate the value of exome sequencing data in case-control studies along with reference data sets like ExAC (sites VCF available via FTP at ftp.broadinstitute.org/pub/ExAC_release/release0.3/) to pinpoint genetic variation that contributes to variable disease predisposition across populations., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2018
- Full Text
- View/download PDF
44. ClinVar data parsing.
- Author
-
Zhang X, Minikel EV, O'Donnell-Luria AH, MacArthur DG, Ware JS, and Weisburd B
- Abstract
This software repository provides a pipeline for converting raw ClinVar data files into analysis-friendly tab-delimited tables, and also provides these tables for the most recent ClinVar release. Separate tables are generated for genome builds GRCh37 and GRCh38 as well as for mono-allelic variants and complex multi-allelic variants. Additionally, the tables are augmented with allele frequencies from the ExAC and gnomAD datasets as these are often consulted when analyzing ClinVar variants. Overall, this work provides ClinVar data in a format that is easier to work with and can be directly loaded into a variety of popular analysis tools such as R, python pandas, and SQL databases., Competing Interests: Competing interests: No competing interests were disclosed.
- Published
- 2017
- Full Text
- View/download PDF
45. Pathogenic ASXL1 somatic variants in reference databases complicate germline variant interpretation for Bohring-Opitz Syndrome.
- Author
-
Carlston CM, O'Donnell-Luria AH, Underhill HR, Cummings BB, Weisburd B, Minikel EV, Birnbaum DP, Tvrdik T, MacArthur DG, and Mao R
- Subjects
- Aged, Aged, 80 and over, Alleles, Amino Acid Substitution, Child, Preschool, Databases, Genetic, Facies, Female, Humans, Infant, Male, Middle Aged, Phenotype, Craniosynostoses diagnosis, Craniosynostoses genetics, Genetic Association Studies methods, Germ-Line Mutation, Intellectual Disability diagnosis, Intellectual Disability genetics, Mutation, Repressor Proteins genetics
- Abstract
The clinical interpretation of genetic variants has come to rely heavily on reference population databases such as the Exome Aggregation Consortium (ExAC) database. Pathogenic variants in genes associated with severe, pediatric-onset, highly penetrant, autosomal dominant conditions are assumed to be absent or rare in these databases. Exome sequencing of a 6-year-old female patient with seizures, developmental delay, dysmorphic features, and failure to thrive identified an ASXL1 variant previously reported as causative of Bohring-Opitz syndrome (BOS). Surprisingly, the variant was observed seven times in the ExAC database, presumably in individuals without BOS. Although the BOS phenotype fit, the presence of the variant in reference population databases introduced ambiguity in result interpretation. Review of the literature revealed that acquired somatic mosaicism of ASXL1 variants (including pathogenic variants) during hematopoietic clonal expansion can occur with aging in healthy individuals. We examined all ASXL1 truncating variants in the ExAC database and determined most are likely somatic. Failure to consider somatic mosaicism may lead to the inaccurate assumption that conditions like BOS have reduced penetrance, or the misclassification of potentially pathogenic variants., (© 2017 Wiley Periodicals, Inc.)
- Published
- 2017
- Full Text
- View/download PDF
46. Improving genetic diagnosis in Mendelian disease with transcriptome sequencing.
- Author
-
Cummings BB, Marshall JL, Tukiainen T, Lek M, Donkervoort S, Foley AR, Bolduc V, Waddell LB, Sandaradura SA, O'Grady GL, Estrella E, Reddy HM, Zhao F, Weisburd B, Karczewski KJ, O'Donnell-Luria AH, Birnbaum D, Sarkozy A, Hu Y, Gonorazky H, Claeys K, Joshi H, Bournazos A, Oates EC, Ghaoui R, Davis MR, Laing NG, Topf A, Kang PB, Beggs AH, North KN, Straub V, Dowling JJ, Muntoni F, Clarke NF, Cooper ST, Bönnemann CG, and MacArthur DG
- Subjects
- Collagen Type VI genetics, Collagen Type VI metabolism, Humans, Muscular Diseases genetics, Muscular Diseases metabolism, Mutation, High-Throughput Nucleotide Sequencing methods, Transcriptome genetics
- Abstract
Exome and whole-genome sequencing are becoming increasingly routine approaches in Mendelian disease diagnosis. Despite their success, the current diagnostic rate for genomic analyses across a variety of rare diseases is approximately 25 to 50%. We explore the utility of transcriptome sequencing [RNA sequencing (RNA-seq)] as a complementary diagnostic tool in a cohort of 50 patients with genetically undiagnosed rare muscle disorders. We describe an integrated approach to analyze patient muscle RNA-seq, leveraging an analysis framework focused on the detection of transcript-level changes that are unique to the patient compared to more than 180 control skeletal muscle samples. We demonstrate the power of RNA-seq to validate candidate splice-disrupting mutations and to identify splice-altering variants in both exonic and deep intronic regions, yielding an overall diagnosis rate of 35%. We also report the discovery of a highly recurrent de novo intronic mutation in COL6A1 that results in a dominantly acting splice-gain event, disrupting the critical glycine repeat motif of the triple helical domain. We identify this pathogenic variant in a total of 27 genetically unsolved patients in an external collagen VI-like dystrophy cohort, thus explaining approximately 25% of patients clinically suggestive of having collagen VI dystrophy in whom prior genetic analysis is negative. Overall, this study represents a large systematic application of transcriptome sequencing to rare disease diagnosis and highlights its utility for the detection and interpretation of variants missed by current standard diagnostic approaches., (Copyright © 2017, American Association for the Advancement of Science.)
- Published
- 2017
- Full Text
- View/download PDF
47. Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity.
- Author
-
Saleheen D, Natarajan P, Armean IM, Zhao W, Rasheed A, Khetarpal SA, Won HH, Karczewski KJ, O'Donnell-Luria AH, Samocha KE, Weisburd B, Gupta N, Zaidi M, Samuel M, Imran A, Abbas S, Majeed F, Ishaq M, Akhtar S, Trindade K, Mucksavage M, Qamar N, Zaman KS, Yaqoob Z, Saghir T, Rizvi SNH, Memon A, Hayyat Mallick N, Ishaq M, Rasheed SZ, Memon FU, Mahmood K, Ahmed N, Do R, Krauss RM, MacArthur DG, Gabriel S, Lander ES, Daly MJ, Frossard P, Danesh J, Rader DJ, and Kathiresan S
- Subjects
- 1-Alkyl-2-acetylglycerophosphocholine Esterase deficiency, 1-Alkyl-2-acetylglycerophosphocholine Esterase genetics, Apolipoprotein C-III deficiency, Apolipoprotein C-III genetics, Cohort Studies, Coronary Disease blood, Coronary Disease genetics, Cytochrome P450 Family 2 genetics, Dietary Fats pharmacology, Exome genetics, Fasting blood, Female, Gene Frequency, Humans, Interleukin-8 blood, Male, Middle Aged, Myocardial Infarction blood, Myocardial Infarction genetics, Neuregulins genetics, Pakistan, Pedigree, Phosphoproteins genetics, Postprandial Period, RNA Splice Sites genetics, Reverse Genetics methods, Sodium-Hydrogen Exchangers genetics, Triglycerides blood, Consanguinity, DNA Mutational Analysis, Gene Deletion, Genes genetics, Genetic Association Studies methods, Homozygote, Phenotype
- Abstract
A major goal of biomedicine is to understand the function of every gene in the human genome. Loss-of-function mutations can disrupt both copies of a given gene in humans and phenotypic analysis of such 'human knockouts' can provide insight into gene function. Consanguineous unions are more likely to result in offspring carrying homozygous loss-of-function mutations. In Pakistan, consanguinity rates are notably high. Here we sequence the protein-coding regions of 10,503 adult participants in the Pakistan Risk of Myocardial Infarction Study (PROMIS), designed to understand the determinants of cardiometabolic diseases in individuals from South Asia. We identified individuals carrying homozygous predicted loss-of-function (pLoF) mutations, and performed phenotypic analysis involving more than 200 biochemical and disease traits. We enumerated 49,138 rare (<1% minor allele frequency) pLoF mutations. These pLoF mutations are estimated to knock out 1,317 genes, each in at least one participant. Homozygosity for pLoF mutations at PLA2G7 was associated with absent enzymatic activity of soluble lipoprotein-associated phospholipase A2; at CYP2F1, with higher plasma interleukin-8 concentrations; at TREH, with lower concentrations of apoB-containing lipoprotein subfractions; at either A3GALT2 or NRG4, with markedly reduced plasma insulin C-peptide concentrations; and at SLC9A3R1, with mediators of calcium and phosphate signalling. Heterozygous deficiency of APOC3 has been shown to protect against coronary heart disease; we identified APOC3 homozygous pLoF carriers in our cohort. We recruited these human knockouts and challenged them with an oral fat load. Compared with family members lacking the mutation, individuals with APOC3 knocked out displayed marked blunting of the usual post-prandial rise in plasma triglycerides. Overall, these observations provide a roadmap for a 'human knockout project', a systematic effort to understand the phenotypic consequences of complete disruption of genes in humans.
- Published
- 2017
- Full Text
- View/download PDF
48. The ExAC browser: displaying reference data information from over 60 000 exomes.
- Author
-
Karczewski KJ, Weisburd B, Thomas B, Solomonson M, Ruderfer DM, Kavanagh D, Hamamsy T, Lek M, Samocha KE, Cummings BB, Birnbaum D, Daly MJ, and MacArthur DG
- Subjects
- Genome-Wide Association Study methods, Humans, Software, User-Computer Interface, Computational Biology methods, Databases, Genetic, Exome, Genomics methods, Web Browser
- Abstract
Worldwide, hundreds of thousands of humans have had their genomes or exomes sequenced, and access to the resulting data sets can provide valuable information for variant interpretation and understanding gene function. Here, we present a lightweight, flexible browser framework to display large population datasets of genetic variation. We demonstrate its use for exome sequence data from 60 706 individuals in the Exome Aggregation Consortium (ExAC). The ExAC browser provides gene- and transcript-centric displays of variation, a critical view for clinical applications. Additionally, we provide a variant display, which includes population frequency and functional annotation data as well as short read support for the called variant. This browser is open-source, freely available at http://exac.broadinstitute.org, and has already been used extensively by clinical laboratories worldwide., (© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2017
- Full Text
- View/download PDF
49. Analysis of protein-coding genetic variation in 60,706 humans.
- Author
-
Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, O'Donnell-Luria AH, Ware JS, Hill AJ, Cummings BB, Tukiainen T, Birnbaum DP, Kosmicki JA, Duncan LE, Estrada K, Zhao F, Zou J, Pierce-Hoffman E, Berghout J, Cooper DN, Deflaux N, DePristo M, Do R, Flannick J, Fromer M, Gauthier L, Goldstein J, Gupta N, Howrigan D, Kiezun A, Kurki MI, Moonshine AL, Natarajan P, Orozco L, Peloso GM, Poplin R, Rivas MA, Ruano-Rubio V, Rose SA, Ruderfer DM, Shakir K, Stenson PD, Stevens C, Thomas BP, Tiao G, Tusie-Luna MT, Weisburd B, Won HH, Yu D, Altshuler DM, Ardissino D, Boehnke M, Danesh J, Donnelly S, Elosua R, Florez JC, Gabriel SB, Getz G, Glatt SJ, Hultman CM, Kathiresan S, Laakso M, McCarroll S, McCarthy MI, McGovern D, McPherson R, Neale BM, Palotie A, Purcell SM, Saleheen D, Scharf JM, Sklar P, Sullivan PF, Tuomilehto J, Tsuang MT, Watkins HC, Wilson JG, Daly MJ, and MacArthur DG
- Subjects
- DNA Mutational Analysis, Datasets as Topic, Humans, Phenotype, Proteome genetics, Rare Diseases genetics, Sample Size, Exome genetics, Genetic Variation genetics
- Abstract
Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
- Published
- 2016
- Full Text
- View/download PDF
50. Correction: Compensatory induction of MYC expression by sustained CDK9 inhibition via a BRD4-dependent mechanism.
- Author
-
Lu H, Xue Y, Yu GK, Arias C, Lin J, Fong S, Faure M, Weisburd B, Ji X, Mercier A, Sutton J, Luo K, Gao Z, and Zhou Q
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.