44 results on '"Knowles DA"'
Search Results
2. Development of Fatigue Testing System for in-situ Observation by AFM & SEM
- Author
-
Farokh Payam Amir, Payton Oliver, Mostafavi Mahmoud, Picco Loren, Moore Stacy, Martin Tomas, Warren A.D., and Knowles David
- Subjects
Engineering (General). Civil engineering (General) ,TA1-2040 - Abstract
A three-point bend fatigue miniature stage for in-situ observation of fatigue microcrack initiation and growth behaviour by scanning electron microscopy (SEM) and atomic force microscopy (AFM) has been manufactured. Details of the stage design with finite element analysis of the stress profiles on loading are provided. The proposed stage facilitates study of the micro mechanisms of fatigue when used during SEM and AFM scanning of the sample surface. To demonstrate the applicability of the system, fatigue tests have been performed on annealed AISI Type 316 stainless steel. Surface topography images obtained by SEM and HS-AFM (High Speed AFM) are presented for comparison. The data can be used to validate crystal plasticity models which should then directly predict multiaxial behaviour without recourse to deformation rules such as equivalent stress or strain.
- Published
- 2019
- Full Text
- View/download PDF
3. Nonparametric identification of regulatory interactions from spatial and temporal gene expression data
- Author
-
Aswani Anil, Keränen Soile VE, Brown James, Fowlkes Charless C, Knowles David W, Biggin Mark D, Bickel Peter, and Tomlin Claire J
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background The correlation between the expression levels of transcription factors and their target genes can be used to infer interactions within animal regulatory networks, but current methods are limited in their ability to make correct predictions. Results Here we describe a novel approach which uses nonparametric statistics to generate ordinary differential equation (ODE) models from expression data. Compared to other dynamical methods, our approach requires minimal information about the mathematical structure of the ODE; it does not use qualitative descriptions of interactions within the network; and it employs new statistics to protect against over-fitting. It generates spatio-temporal maps of factor activity, highlighting the times and spatial locations at which different regulators might affect target gene expression levels. We identify an ODE model for eve mRNA pattern formation in the Drosophila melanogaster blastoderm and show that this reproduces the experimental patterns well. Compared to a non-dynamic, spatial-correlation model, our ODE gives 59% better agreement to the experimentally measured pattern. Our model suggests that protein factors frequently have the potential to behave as both an activator and inhibitor for the same cis-regulatory module depending on the factors' concentration, and implies different modes of activation and repression. Conclusions Our method provides an objective quantification of the regulatory potential of transcription factors in a network, is suitable for both low- and moderate-dimensional gene expression datasets, and includes improvements over existing dynamic and static models.
- Published
- 2010
- Full Text
- View/download PDF
4. Phenotypic complexities of rare heterozygous neurexin-1 deletions.
- Author
-
Fernando MB, Fan Y, Zhang Y, Tokolyi A, Murphy AN, Kammourh S, Michael Deans PJ, Ghorbani S, Onatzevitch R, Pero A, Padilla C, Williams S, Flaherty EK, Prytkova IA, Cao L, Knowles DA, Fang G, Slesinger PA, and Brennand KJ
- Abstract
Given the large number of genes significantly associated with risk for neuropsychiatric disorders, a critical unanswered question is the extent to which diverse mutations --sometimes impacting the same gene-- will require tailored therapeutic strategies. Here we consider this in the context of rare neuropsychiatric disorder-associated copy number variants (2p16.3) resulting in heterozygous deletions in NRXN1 , a pre-synaptic cell adhesion protein that serves as a critical synaptic organizer in the brain. Complex patterns of NRXN1 alternative splicing are fundamental to establishing diverse neurocircuitry, vary between the cell types of the brain, and are differentially impacted by unique (non-recurrent) deletions. We contrast the cell-type-specific impact of patient-specific mutations in NRXN1 using human induced pluripotent stem cells, finding that perturbations in NRXN1 splicing result in divergent cell-type-specific synaptic outcomes. Via distinct loss-of-function (LOF) and gain-of-function (GOF) mechanisms, NRXN1
+/- deletions cause decreased synaptic activity in glutamatergic neurons, yet increased synaptic activity in GABAergic neurons. Reciprocal isogenic manipulations causally demonstrate that aberrant splicing drives these changes in synaptic activity. For NRXN1 deletions, and perhaps more broadly, precision medicine will require stratifying patients based on whether their gene mutations act through LOF or GOF mechanisms, in order to achieve individualized restoration of NRXN1 isoform repertoires by increasing wildtype, or ablating mutant isoforms. Given the increasing number of mutations predicted to engender both LOF and GOF mechanisms in brain disorders, our findings add nuance to future considerations of precision medicine., Competing Interests: Ethics declarations / Competing interest statement. All authors have no competing interests to declare.- Published
- 2024
- Full Text
- View/download PDF
5. A Bayesian framework for inferring dynamic intercellular interactions from time-series single-cell data.
- Author
-
Park C, Mani S, Beltran-Velez N, Maurer K, Huang T, Li S, Gohil S, Livak KJ, Knowles DA, Wu CJ, and Azizi E
- Subjects
- Humans, T-Lymphocytes metabolism, Sequence Analysis, RNA methods, Bayes Theorem, Single-Cell Analysis methods, Cell Communication
- Abstract
Characterizing cell-cell communication and tracking its variability over time are crucial for understanding the coordination of biological processes mediating normal development, disease progression, and responses to perturbations such as therapies. Existing tools fail to capture time-dependent intercellular interactions and primarily rely on databases compiled from limited contexts. We introduce DIISCO, a Bayesian framework designed to characterize the temporal dynamics of cellular interactions using single-cell RNA-sequencing data from multiple time points. Our method utilizes structured Gaussian process regression to unveil time-resolved interactions among diverse cell types according to their coevolution and incorporates prior knowledge of receptor-ligand complexes. We show the interpretability of DIISCO in simulated data and new data collected from T cells cocultured with lymphoma cells, demonstrating its potential to uncover dynamic cell-cell cross talk., (© 2024 Park et al.; Published by Cold Spring Harbor Laboratory Press.)
- Published
- 2024
- Full Text
- View/download PDF
6. Prediction of on-target and off-target activity of CRISPR-Cas13d guide RNAs using deep learning.
- Author
-
Wessels HH, Stirn A, Méndez-Mancilla A, Kim EJ, Hart SK, Knowles DA, and Sanjana NE
- Subjects
- Humans, CRISPR-Cas Systems genetics, Clustered Regularly Interspaced Short Palindromic Repeats, RNA, Gene Editing, RNA, Guide, CRISPR-Cas Systems, Deep Learning
- Abstract
Transcriptome engineering applications in living cells with RNA-targeting CRISPR effectors depend on accurate prediction of on-target activity and off-target avoidance. Here we design and test ~200,000 RfxCas13d guide RNAs targeting essential genes in human cells with systematically designed mismatches and insertions and deletions (indels). We find that mismatches and indels have a position- and context-dependent impact on Cas13d activity, and mismatches that result in G-U wobble pairings are better tolerated than other single-base mismatches. Using this large-scale dataset, we train a convolutional neural network that we term targeted inhibition of gene expression via gRNA design (TIGER) to predict efficacy from guide sequence and context. TIGER outperforms the existing models at predicting on-target and off-target activity on our dataset and published datasets. We show that TIGER scoring combined with specific mismatches yields the first general framework to modulate transcript expression, enabling the use of RNA-targeting CRISPRs to precisely control gene dosage., (© 2023. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2024
- Full Text
- View/download PDF
7. Smoother: a unified and modular framework for incorporating structural dependency in spatial omics data.
- Author
-
Su J, Reynier JB, Fu X, Zhong G, Jiang J, Escalante RS, Wang Y, Aparicio L, Izar B, Knowles DA, and Rabadan R
- Subjects
- Humans, Male, Tumor Microenvironment, Fibroblasts, Prostate
- Abstract
Spatial omics technologies can help identify spatially organized biological processes, but existing computational approaches often overlook structural dependencies in the data. Here, we introduce Smoother, a unified framework that integrates positional information into non-spatial models via modular priors and losses. In simulated and real datasets, Smoother enables accurate data imputation, cell-type deconvolution, and dimensionality reduction with remarkable efficiency. In colorectal cancer, Smoother-guided deconvolution reveals plasma cell and fibroblast subtype localizations linked to tumor microenvironment restructuring. Additionally, joint modeling of spatial and single-cell human prostate data with Smoother allows for spatial mapping of reference populations with significantly reduced ambiguity., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
8. DIISCO: A Bayesian framework for inferring dynamic intercellular interactions from time-series single-cell data.
- Author
-
Park C, Mani S, Beltran-Velez N, Maurer K, Gohil S, Li S, Huang T, Knowles DA, Wu CJ, and Azizi E
- Abstract
Characterizing cell-cell communication and tracking its variability over time is essential for understanding the coordination of biological processes mediating normal development, progression of disease, or responses to perturbations such as therapies. Existing tools lack the ability to capture time-dependent intercellular interactions, such as those influenced by therapy, and primarily rely on existing databases compiled from limited contexts. We present DIISCO, a Bayesian framework for characterizing the temporal dynamics of cellular interactions using single-cell RNA-sequencing data from multiple time points. Our method uses structured Gaussian process regression to unveil time-resolved interactions among diverse cell types according to their co-evolution and incorporates prior knowledge of receptor-ligand complexes. We show the interpretability of DIISCO in simulated data and new data collected from CAR-T cells co-cultured with lymphoma cells, demonstrating its potential to uncover dynamic cell-cell crosstalk., Competing Interests: 6Conflict of Interest C.J.W. is an equity holder of BioNTech and receives research funding from Pharmacyclics.
- Published
- 2023
- Full Text
- View/download PDF
9. Large-scale causal discovery using interventional data sheds light on the regulatory network architecture of blood traits.
- Author
-
Brown BC, Morris JA, Lappalainen T, and Knowles DA
- Abstract
Inference of directed biological networks is an important but notoriously challenging problem. We introduce inverse sparse regression (inspre) , an approach to learning causal networks that leverages large-scale intervention-response data. Applied to 788 genes from the genome-wide perturb-seq dataset, inspre helps elucidate the network architecture of blood traits., Competing Interests: Competing Interests Tuuli Lappalainen is a paid adviser or consultant of Variant Bio and GSK.
- Published
- 2023
- Full Text
- View/download PDF
10. Cas13d-mediated isoform-specific RNA knockdown with a unified computational and experimental toolbox.
- Author
-
Schertzer MD, Stirn A, Isaev K, Pereira L, Das A, Harbison C, Park SH, Wessels HH, Sanjana NE, and Knowles DA
- Abstract
Alternative splicing is an essential mechanism for diversifying proteins, in which mature RNA isoforms produce proteins with potentially distinct functions. Two major challenges in characterizing the cellular function of isoforms are the lack of experimental methods to specifically and efficiently modulate isoform expression and computational tools for complex experimental design. To address these gaps, we developed and methodically tested a strategy which pairs the RNA-targeting CRISPR/Cas13d system with guide RNAs that span exon-exon junctions in the mature RNA. We performed a high-throughput essentiality screen, quantitative RT-PCR assays, and PacBio long read sequencing to affirm our ability to specifically target and robustly knockdown individual RNA isoforms. In parallel, we provide computational tools for experimental design and screen analysis. Considering all possible splice junctions annotated in GENCODE for multi-isoform genes and our gRNA efficacy predictions, we estimate that our junction-centric strategy can uniquely target up to 89% of human RNA isoforms, including 50,066 protein-coding and 11,415 lncRNA isoforms. Importantly, this specificity spans all splicing and transcriptional events, including exon skipping and inclusion, alternative 5' and 3' splice sites, and alternative starts and ends.
- Published
- 2023
- Full Text
- View/download PDF
11. Single-cell multi-omics defines the cell-type-specific impact of splicing aberrations in human hematopoietic clonal outgrowths.
- Author
-
Cortés-López M, Chamely P, Hawkins AG, Stanley RF, Swett AD, Ganesan S, Mouhieddine TH, Dai X, Kluegel L, Chen C, Batta K, Furer N, Vedula RS, Beaulaurier J, Drong AW, Hickey S, Dusaj N, Mullokandov G, Stasiw AM, Su J, Chaligné R, Juul S, Harrington E, Knowles DA, Potenski CJ, Wiseman DH, Tanay A, Shlush L, Lindsley RC, Ghobrial IM, Taylor J, Abdel-Wahab O, Gaiti F, and Landau DA
- Subjects
- Humans, Multiomics, RNA Splicing genetics, RNA Splicing Factors genetics, RNA Splicing Factors metabolism, Mutation genetics, Phosphoproteins genetics, Phosphoproteins metabolism, RNA Splice Sites, Myelodysplastic Syndromes genetics, Myelodysplastic Syndromes metabolism
- Abstract
RNA splicing factors are recurrently mutated in clonal blood disorders, but the impact of dysregulated splicing in hematopoiesis remains unclear. To overcome technical limitations, we integrated genotyping of transcriptomes (GoT) with long-read single-cell transcriptomics and proteogenomics for single-cell profiling of transcriptomes, surface proteins, somatic mutations, and RNA splicing (GoT-Splice). We applied GoT-Splice to hematopoietic progenitors from myelodysplastic syndrome (MDS) patients with mutations in the core splicing factor SF3B1. SF3B1
mut cells were enriched in the megakaryocytic-erythroid lineage, with expansion of SF3B1mut erythroid progenitor cells. We uncovered distinct cryptic 3' splice site usage in different progenitor populations and stage-specific aberrant splicing during erythroid differentiation. Profiling SF3B1-mutated clonal hematopoiesis samples revealed that erythroid bias and cell-type-specific cryptic 3' splice site usage in SF3B1mut cells precede overt MDS. Collectively, GoT-Splice defines the cell-type-specific impact of somatic mutations on RNA splicing, from early clonal outgrowths to overt neoplasia, directly in human samples., Competing Interests: Declaration of interests F.G. serves as a consultant for S2 Genomics Inc. X.D., J.B., A.W.D., S.H., S.J., and E.H. are employees of Oxford Nanopore Technologies Inc. and are shareholders and/or share option holders. I.M.G. serves on the advisory or consulting board of Bristol Myers Squibb, Takeda, Janssen, Sanofi, Novartis, Amgen, Celgene, Cellectar, Pfizer, Menarini Silicon Biosystems, Oncopeptides, The Binding Site, GlazoSmithKlein, AbbVie, Adaptive, and 10x Genomics. O.A.-W. has served as a consultant for H3B Biomedicine, Foundation Medicine Inc., Merck, Pfizer, and Janssen, and O.A.-W. is on the Scientific Advisory Board of Envisagenics Inc. and AIChemy. O.A.-W. has received prior research funding from H3B Biomedicine and LOXO Oncology unrelated to the current manuscript. D.A.L. has served as a consultant for AbbVie, AstraZeneca, and Illumina and is on the Scientific Advisory Board of Mission Bio, Pangea, Alethiomics, and C2i Genomics; D.A.L. has received prior research funding from BMS, 10x Genomics, Ultima Genomics, and Illumina unrelated to the current manuscript., (Copyright © 2023 Elsevier Inc. All rights reserved.)- Published
- 2023
- Full Text
- View/download PDF
12. Multiset correlation and factor analysis enables exploration of multi-omics data.
- Author
-
Brown BC, Wang C, Kasela S, Aguet F, Nachun DC, Taylor KD, Tracy RP, Durda P, Liu Y, Johnson WC, Van Den Berg D, Gupta N, Gabriel S, Smith JD, Gerzsten R, Clish C, Wong Q, Papanicolau G, Blackwell TW, Rotter JI, Rich SS, Barr RG, Ardlie KG, Knowles DA, and Lappalainen T
- Abstract
Multi-omics datasets are becoming more common, necessitating better integration methods to realize their revolutionary potential. Here, we introduce multi-set correlation and factor analysis (MCFA), an unsupervised integration method tailored to the unique challenges of high-dimensional genomics data that enables fast inference of shared and private factors. We used MCFA to integrate methylation markers, protein expression, RNA expression, and metabolite levels in 614 diverse samples from the Trans-Omics for Precision Medicine/Multi-Ethnic Study of Atherosclerosis multi-omics pilot. Samples cluster strongly by ancestry in the shared space, even in the absence of genetic information, while private spaces frequently capture dataset-specific technical variation. Finally, we integrated genetic data by conducting a genome-wide association study (GWAS) of our inferred factors, observing that several factors are enriched for GWAS hits and trans -expression quantitative trait loci. Two of these factors appear to be related to metabolic disease. Our study provides a foundation and framework for further integrative analysis of ever larger multi-modal genomic datasets., Competing Interests: T.L. is a paid adviser or consultant of GSK, Pfizer, and Goldfinch Bio and has equity in Variant Bio. F.A. is an employee and shareholder of Illumina, Inc., (© 2023 The Authors.)
- Published
- 2023
- Full Text
- View/download PDF
13. Computational models of dopamine release measured by fast scan cyclic voltammetry in vivo.
- Author
-
Shashaank N, Somayaji M, Miotto M, Mosharov EV, Makowicz EA, Knowles DA, Ruocco G, and Sulzer DL
- Abstract
Dopamine neurotransmission in the striatum is central to many normal and disease functions. Ventral midbrain dopamine neurons exhibit ongoing tonic firing that produces low extrasynaptic levels of dopamine below the detection of conventional extrasynaptic cyclic voltammetry (∼10-20 nanomolar), with superimposed bursts that can saturate the dopamine uptake transporter and produce transient micromolar concentrations. The bursts are known to lead to marked presynaptic plasticity via multiple mechanisms, but analysis methods for these kinetic parameters are limited. To provide a deeper understanding of the mechanics of the modulation of dopamine neurotransmission by physiological, genetic, and pharmacological means, we present three computational models of dopamine release with different levels of spatiotemporal complexity to analyze in vivo fast-scan cyclic voltammetry recordings from the dorsal striatum of mice. The models accurately fit to cyclic voltammetry data and provide estimates of presynaptic dopamine facilitation/depression kinetics and dopamine transporter reuptake kinetics, and we used the models to analyze the role of synuclein proteins in neurotransmission. The models' results support recent findings linking the presynaptic protein α-synuclein to the short-term facilitation and long-term depression of dopamine release, as well as reveal a new role for β-synuclein and/or γ-synuclein in the long-term regulation of dopamine reuptake., (© The Author(s) 2023. Published by Oxford University Press on behalf of National Academy of Sciences.)
- Published
- 2023
- Full Text
- View/download PDF
14. LDmat: efficiently queryable compression of linkage disequilibrium matrices.
- Author
-
Weiner RJ, Lakhani C, Knowles DA, and Gürsoy G
- Subjects
- Humans, Linkage Disequilibrium, Genome-Wide Association Study, Genome, Software, Data Compression
- Abstract
Motivation: Linkage disequilibrium (LD) matrices derived from large populations are widely used in population genetics in fine-mapping, LD score regression, and linear mixed models for Genome-wide Association Studies (GWAS). However, these matrices can reach large sizes when they are derived from millions of individuals; hence, moving, sharing and extracting granular information from this large amount of data can be cumbersome., Results: We sought to address the need for compressing and easily querying large LD matrices by developing LDmat. LDmat is a standalone tool to compress large LD matrices in an HDF5 file format and query these compressed matrices. It can extract submatrices corresponding to a sub-region of the genome, a list of select loci, and loci within a minor allele frequency range. LDmat can also rebuild the original file formats from the compressed files., Availability and Implementation: LDmat is implemented in python, and can be installed on Unix systems with the command 'pip install ldmat'. It can also be accessed through https://github.com/G2Lab/ldmat and https://pypi.org/project/ldmat/., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2023. Published by Oxford University Press.)
- Published
- 2023
- Full Text
- View/download PDF
15. Using epigenomics to understand cellular responses to environmental influences in diseases.
- Author
-
Wattacheril JJ, Raj S, Knowles DA, and Greally JM
- Subjects
- Humans, Epigenesis, Genetic, Epigenomics, Environmental Exposure adverse effects, Phenotype, Non-alcoholic Fatty Liver Disease genetics
- Abstract
It is a generally accepted model that environmental influences can exert their effects, at least in part, by changing the molecular regulators of transcription that are described as epigenetic. As there is biochemical evidence that some epigenetic regulators of transcription can maintain their states long term and through cell division, an epigenetic model encompasses the idea of maintenance of the effect of an exposure long after it is no longer present. The evidence supporting this model is mostly from the observation of alterations of molecular regulators of transcription following exposures. With the understanding that the interpretation of these associations is more complex than originally recognised, this model may be oversimplistic; therefore, adopting novel perspectives and experimental approaches when examining how environmental exposures are linked to phenotypes may prove worthwhile. In this review, we have chosen to use the example of nonalcoholic fatty liver disease (NAFLD), a common, complex human disease with strong environmental and genetic influences. We describe how epigenomic approaches combined with emerging functional genetic and single-cell genomic techniques are poised to generate new insights into the pathogenesis of environmentally influenced human disease phenotypes exemplified by NAFLD., Competing Interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: JJW receives grant and contract funding from Galectin, Intercept, Genfit, Janssen, Shire, Conatus, Zydus, and Perspectum; she has served on advisory boards for Astra Zeneca/ MedImmune and AMRA. SR, DAK and JMG declare no conflicts. JMG, SR and DAK declare no conflicts., (Copyright: © 2023 Wattacheril et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
16. Integrative transcriptomic analysis of the amyotrophic lateral sclerosis spinal cord implicates glial activation and suggests new risk genes.
- Author
-
Humphrey J, Venkatesh S, Hasan R, Herb JT, de Paiva Lopes K, Küçükali F, Byrska-Bishop M, Evani US, Narzisi G, Fagegaltier D, Sleegers K, Phatnani H, Knowles DA, Fratta P, and Raj T
- Subjects
- Humans, Retrospective Studies, Transcriptome, Spinal Cord metabolism, Amyotrophic Lateral Sclerosis genetics, Amyotrophic Lateral Sclerosis metabolism, Neurodegenerative Diseases metabolism
- Abstract
Amyotrophic lateral sclerosis (ALS) is a progressively fatal neurodegenerative disease affecting motor neurons in the brain and spinal cord. In this study, we investigated gene expression changes in ALS via RNA sequencing in 380 postmortem samples from cervical, thoracic and lumbar spinal cord segments from 154 individuals with ALS and 49 control individuals. We observed an increase in microglia and astrocyte gene expression, accompanied by a decrease in oligodendrocyte gene expression. By creating a gene co-expression network in the ALS samples, we identified several activated microglia modules that negatively correlate with retrospective disease duration. We mapped molecular quantitative trait loci and found several potential ALS risk loci that may act through gene expression or splicing in the spinal cord and assign putative cell types for FNBP1, ACSL5, SH3RF1 and NFASC. Finally, we outline how common genetic variants associated with splicing of C9orf72 act as proxies for the well-known repeat expansion, and we use the same mechanism to suggest ATXN3 as a putative risk gene., (© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2023
- Full Text
- View/download PDF
17. Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models.
- Author
-
Malina S, Cizin D, and Knowles DA
- Subjects
- Causality, Research Design, Genomics, Mendelian Randomization Analysis methods, Deep Learning
- Abstract
Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the 'true' global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR's causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation., Competing Interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: SM is employed by and holds equity in Dyno Therapeutics.
- Published
- 2022
- Full Text
- View/download PDF
18. Welch-weighted Egger regression reduces false positives due to correlated pleiotropy in Mendelian randomization.
- Author
-
Brown BC and Knowles DA
- Subjects
- Computer Simulation, Female, Humans, Inflammation blood, Inflammation genetics, Male, Mendelian Randomization Analysis standards, Phenotype, Polymorphism, Single Nucleotide, False Positive Reactions, Genetic Pleiotropy, Mendelian Randomization Analysis methods, Models, Genetic, Regression Analysis
- Abstract
Modern population-scale biobanks contain simultaneous measurements of many phenotypes, providing unprecedented opportunity to study the relationship between biomarkers and disease. However, inferring causal effects from observational data is notoriously challenging. Mendelian randomization (MR) has recently received increased attention as a class of methods for estimating causal effects using genetic associations. However, standard methods result in pervasive false positives when two traits share a heritable, unobserved common cause. This is the problem of correlated pleiotropy. Here, we introduce a flexible framework for simulating traits with a common genetic confounder that generalizes recently proposed models, as well as a simple approach we call Welch-weighted Egger regression (WWER) for estimating causal effects. We show in comprehensive simulations that our method substantially reduces false positives due to correlated pleiotropy while being fast enough to apply to hundreds of phenotypes. We apply our method first to a subset of the UK Biobank consisting of blood traits and inflammatory disease, and then to a broader set of 411 heritable phenotypes. We detect many effects with strong literature support, as well as numerous behavioral effects that appear to stem from physician advice given to people at high risk for disease. We conclude that WWER is a powerful tool for exploratory data analysis in ever-growing databases of genotypes and phenotypes., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2021 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2021
- Full Text
- View/download PDF
19. An integrated approach to identify environmental modulators of genetic risk factors for complex traits.
- Author
-
Balliu B, Carcamo-Orive I, Gloudemans MJ, Nachun DC, Durrant MG, Gazal S, Park CY, Knowles DA, Wabitsch M, Quertermous T, Knowles JW, and Montgomery SB
- Subjects
- Autoimmune Diseases etiology, Autoimmune Diseases pathology, Humans, Mental Disorders etiology, Mental Disorders pathology, Metabolic Diseases etiology, Metabolic Diseases pathology, Phenotype, Gene-Environment Interaction, Genetic Predisposition to Disease, Genome-Wide Association Study, Multifactorial Inheritance, Polymorphism, Single Nucleotide, Quantitative Trait Loci
- Abstract
Complex traits and diseases can be influenced by both genetics and environment. However, given the large number of environmental stimuli and power challenges for gene-by-environment testing, it remains a critical challenge to identify and prioritize specific disease-relevant environmental exposures. We propose a framework for leveraging signals from transcriptional responses to environmental perturbations to identify disease-relevant perturbations that can modulate genetic risk for complex traits and inform the functions of genetic variants associated with complex traits. We perturbed human skeletal-muscle-, fat-, and liver-relevant cell lines with 21 perturbations affecting insulin resistance, glucose homeostasis, and metabolic regulation in humans and identified thousands of environmentally responsive genes. By combining these data with GWASs from 31 distinct polygenic traits, we show that the heritability of multiple traits is enriched in regions surrounding genes responsive to specific perturbations and, further, that environmentally responsive genes are enriched for associations with specific diseases and phenotypes from the GWAS Catalog. Overall, we demonstrate the advantages of large-scale characterization of transcriptional changes in diversely stimulated and pathologically relevant cells to identify disease-relevant perturbations., Competing Interests: Declaration of interests S.B.M. is on the SAB for Myome., (Copyright © 2021 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2021
- Full Text
- View/download PDF
20. Distinct Classes of Complex Structural Variation Uncovered across Thousands of Cancer Genome Graphs.
- Author
-
Hadi K, Yao X, Behr JM, Deshpande A, Xanthopoulakis C, Tian H, Kudman S, Rosiene J, Darmofal M, DeRose J, Mortensen R, Adney EM, Shaiber A, Gajic Z, Sigouros M, Eng K, Wala JA, Wrzeszczyński KO, Arora K, Shah M, Emde AK, Felice V, Frank MO, Darnell RB, Ghandi M, Huang F, Dewhurst S, Maciejowski J, de Lange T, Setton J, Riaz N, Reis-Filho JS, Powell S, Knowles DA, Reznik E, Mishra B, Beroukhim R, Zody MC, Robine N, Oman KM, Sanchez CA, Kuhner MK, Smith LP, Galipeau PC, Paulson TG, Reid BJ, Li X, Wilkes D, Sboner A, Mosquera JM, Elemento O, and Imielinski M
- Subjects
- Chromosome Inversion genetics, Chromothripsis, DNA Copy Number Variations genetics, Gene Rearrangement genetics, Genome, Human genetics, Humans, Mutation genetics, Whole Genome Sequencing methods, Genomic Structural Variation genetics, Genomics methods, Neoplasms genetics
- Abstract
Cancer genomes often harbor hundreds of somatic DNA rearrangement junctions, many of which cannot be easily classified into simple (e.g., deletion) or complex (e.g., chromothripsis) structural variant classes. Applying a novel genome graph computational paradigm to analyze the topology of junction copy number (JCN) across 2,778 tumor whole-genome sequences, we uncovered three novel complex rearrangement phenomena: pyrgo, rigma, and tyfonas. Pyrgo are "towers" of low-JCN duplications associated with early-replicating regions, superenhancers, and breast or ovarian cancers. Rigma comprise "chasms" of low-JCN deletions enriched in late-replicating fragile sites and gastrointestinal carcinomas. Tyfonas are "typhoons" of high-JCN junctions and fold-back inversions associated with expressed protein-coding fusions, breakend hypermutation, and acral, but not cutaneous, melanomas. Clustering of tumors according to genome graph-derived features identified subgroups associated with DNA repair defects and poor prognosis., Competing Interests: Declaration of Interests J.S.R.-F. reports receiving personal/consultancy fees from VolitionRx, Paige.AI, Goldman Sachs, REPARE Therapeutics, GRAIL, Ventana Medical Systems, Roche, Genentech, and InviCRO outside of the scope of the submitted work., (Copyright © 2020 Elsevier Inc. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
21. Molecular Choreography of Acute Exercise.
- Author
-
Contrepois K, Wu S, Moneghetti KJ, Hornburg D, Ahadi S, Tsai MS, Metwally AA, Wei E, Lee-McMullen B, Quijada JV, Chen S, Christle JW, Ellenberger M, Balliu B, Taylor S, Durrant MG, Knowles DA, Choudhry H, Ashland M, Bahmani A, Enslen B, Amsallem M, Kobayashi Y, Avina M, Perelman D, Schüssler-Fiorenza Rose SM, Zhou W, Ashley EA, Montgomery SB, Chaib H, Haddad F, and Snyder MP
- Subjects
- Aged, Biomarkers metabolism, Female, Humans, Insulin metabolism, Insulin Resistance, Leukocytes, Mononuclear metabolism, Longitudinal Studies, Male, Metabolome, Middle Aged, Oxygen metabolism, Oxygen Consumption, Proteome, Transcriptome, Energy Metabolism physiology, Exercise physiology
- Abstract
Acute physical activity leads to several changes in metabolic, cardiovascular, and immune pathways. Although studies have examined selected changes in these pathways, the system-wide molecular response to an acute bout of exercise has not been fully characterized. We performed longitudinal multi-omic profiling of plasma and peripheral blood mononuclear cells including metabolome, lipidome, immunome, proteome, and transcriptome from 36 well-characterized volunteers, before and after a controlled bout of symptom-limited exercise. Time-series analysis revealed thousands of molecular changes and an orchestrated choreography of biological processes involving energy metabolism, oxidative stress, inflammation, tissue repair, and growth factor response, as well as regulatory pathways. Most of these processes were dampened and some were reversed in insulin-resistant participants. Finally, we discovered biological pathways involved in cardiopulmonary exercise response and developed prediction models revealing potential resting blood-based biomarkers of peak oxygen consumption., Competing Interests: Declaration of Interests M.P.S. is a cofounder and on the advisory board of Personalis, SensOmics, January, Filtricine, Qbio, Protos, and Mirive. M.P.S. is on the advisory board of Genapsys and Tailai. M.P.S. is an inventor on provisional patent number 62/897,908 “Surrogate of VO2 MAX Test”. K.C. and F.H. are also listed as inventors. E.A.A. is a cofounder of Personalis, Deepcell, and SVEXA and on the advisory board of Apple, SequenceBio, and Foresite Labs., (Copyright © 2020 Elsevier Inc. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
22. A human lung tumor microenvironment interactome identifies clinically relevant cell-type cross-talk.
- Author
-
Gentles AJ, Hui AB, Feng W, Azizi A, Nair RV, Bouchard G, Knowles DA, Yu A, Jeong Y, Bejnood A, Forgó E, Varma S, Xu Y, Kuong A, Nair VS, West R, van de Rijn M, Hoang CD, Diehn M, and Plevritis SK
- Subjects
- Adenocarcinoma metabolism, Cell Line, Tumor, Fibroblasts metabolism, Humans, Intercellular Signaling Peptides and Proteins metabolism, Primary Cell Culture, Carcinoma, Non-Small-Cell Lung metabolism, Cell Communication, Lung Neoplasms metabolism, Receptor Cross-Talk, Tumor Microenvironment
- Abstract
Background: Tumors comprise a complex microenvironment of interacting malignant and stromal cell types. Much of our understanding of the tumor microenvironment comes from in vitro studies isolating the interactions between malignant cells and a single stromal cell type, often along a single pathway., Result: To develop a deeper understanding of the interactions between cells within human lung tumors, we perform RNA-seq profiling of flow-sorted malignant cells, endothelial cells, immune cells, fibroblasts, and bulk cells from freshly resected human primary non-small-cell lung tumors. We map the cell-specific differential expression of prognostically associated secreted factors and cell surface genes, and computationally reconstruct cross-talk between these cell types to generate a novel resource called the Lung Tumor Microenvironment Interactome (LTMI). Using this resource, we identify and validate a prognostically unfavorable influence of Gremlin-1 production by fibroblasts on proliferation of malignant lung adenocarcinoma cells. We also find a prognostically favorable association between infiltration of mast cells and less aggressive tumor cell behavior., Conclusion: These results illustrate the utility of the LTMI as a resource for generating hypotheses concerning tumor-microenvironment interactions that may have prognostic and therapeutic relevance.
- Published
- 2020
- Full Text
- View/download PDF
23. Genetic regulation of gene expression and splicing during a 10-year period of human aging.
- Author
-
Balliu B, Durrant M, Goede O, Abell N, Li X, Liu B, Gloudemans MJ, Cook NL, Smith KS, Knowles DA, Pala M, Cucca F, Schlessinger D, Jaiswal S, Sabatti C, Lind L, Ingelsson E, and Montgomery SB
- Subjects
- Aged, Aged, 80 and over, Aging metabolism, Female, Humans, Male, Aging genetics, Alternative Splicing, Gene Expression Regulation
- Abstract
Background: Molecular and cellular changes are intrinsic to aging and age-related diseases. Prior cross-sectional studies have investigated the combined effects of age and genetics on gene expression and alternative splicing; however, there has been no long-term, longitudinal characterization of these molecular changes, especially in older age., Results: We perform RNA sequencing in whole blood from the same individuals at ages 70 and 80 to quantify how gene expression, alternative splicing, and their genetic regulation are altered during this 10-year period of advanced aging at a population and individual level. We observe that individuals are more similar to their own expression profiles later in life than profiles of other individuals their own age. We identify 1291 and 294 genes differentially expressed and alternatively spliced with age, as well as 529 genes with outlying individual trajectories. Further, we observe a strong correlation of genetic effects on expression and splicing between the two ages, with a small subset of tested genes showing a reduction in genetic associations with expression and splicing in older age., Conclusions: These findings demonstrate that, although the transcriptome and its genetic regulation is mostly stable late in life, a small subset of genes is dynamic and is characterized by a reduction in genetic regulation, most likely due to increasing environmental variance with age.
- Published
- 2019
- Full Text
- View/download PDF
24. Landscape of stimulation-responsive chromatin across diverse human immune cells.
- Author
-
Calderon D, Nguyen MLT, Mezger A, Kathiria A, Müller F, Nguyen V, Lescano N, Wu B, Trombetta J, Ribado JV, Knowles DA, Gao Z, Blaeschke F, Parent AV, Burt TD, Anderson MS, Criswell LA, Greenleaf WJ, Marson A, and Pritchard JK
- Subjects
- Allelic Imbalance, B-Lymphocytes drug effects, B-Lymphocytes metabolism, Cells, Cultured, Chromatin drug effects, Chromatin immunology, Epigenesis, Genetic, Gene Expression Regulation genetics, Gene Expression Regulation immunology, Humans, Interleukin-2 pharmacology, Interleukin-4 pharmacology, Killer Cells, Natural drug effects, Killer Cells, Natural metabolism, Polysaccharides pharmacology, T-Lymphocytes drug effects, T-Lymphocytes metabolism, Transcriptome, B-Lymphocytes immunology, Chromatin genetics, Gene Expression Regulation drug effects, Killer Cells, Natural immunology, Response Elements genetics, T-Lymphocytes immunology
- Abstract
A hallmark of the immune system is the interplay among specialized cell types transitioning between resting and stimulated states. The gene regulatory landscape of this dynamic system has not been fully characterized in human cells. Here we collected assay for transposase-accessible chromatin using sequencing (ATAC-seq) and RNA sequencing data under resting and stimulated conditions for up to 32 immune cell populations. Stimulation caused widespread chromatin remodeling, including response elements shared between stimulated B and T cells. Furthermore, several autoimmune traits showed significant heritability in stimulation-responsive elements from distinct cell types, highlighting the importance of these cell states in autoimmunity. Allele-specific read mapping identified variants that alter chromatin accessibility in particular conditions, allowing us to observe evidence of function for a candidate causal variant that is undetected by existing large-scale studies in resting cells. Our results provide a resource of chromatin dynamics and highlight the need to characterize the effects of genetic variation in stimulated cells.
- Published
- 2019
- Full Text
- View/download PDF
25. Sparse discriminative latent characteristics for predicting cancer drug sensitivity from genomic features.
- Author
-
Knowles DA, Bouchard G, and Plevritis S
- Subjects
- Antineoplastic Agents pharmacology, Bayes Theorem, Biomarkers, Pharmacological, CCAAT-Enhancer-Binding Protein-delta genetics, Cell Line, Tumor, DNA Copy Number Variations, Genome, Genomics, Histone Deacetylase Inhibitors pharmacology, Humans, Neoplasms drug therapy, Panobinostat pharmacology, Regression Analysis, Statistics, Nonparametric, Forecasting methods, Neoplasms genetics
- Abstract
Drug screening studies typically involve assaying the sensitivity of a range of cancer cell lines across an array of anti-cancer therapeutics. Alongside these sensitivity measurements high dimensional molecular characterizations of the cell lines are typically available, including gene expression, copy number variation and genomic mutations. We propose a sparse multitask regression model which learns discriminative latent characteristics that predict drug sensitivity and are associated with specific molecular features. We use ideas from Bayesian nonparametrics to automatically infer the appropriate number of these latent characteristics. The resulting analysis couples high predictive performance with interpretability since each latent characteristic involves a typically small set of drugs, cell lines and genomic features. Our model uncovers a number of drug-gene sensitivity associations missed by single gene analyses. We functionally validate one such novel association: that increased expression of the cell-cycle regulator C/EBPδ decreases sensitivity to the histone deacetylase (HDAC) inhibitor panobinostat., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2019
- Full Text
- View/download PDF
26. Opportunities and challenges for transcriptome-wide association studies.
- Author
-
Wainberg M, Sinnott-Armstrong N, Mancuso N, Barbeira AN, Knowles DA, Golan D, Ermel R, Ruusalepp A, Quertermous T, Hao K, Björkegren JLM, Im HK, Pasaniuc B, Rivas MA, and Kundaje A
- Subjects
- Crohn Disease genetics, Genetic Variation genetics, Genome-Wide Association Study methods, Humans, Lipoproteins, LDL genetics, Quantitative Trait Loci genetics, Schizophrenia genetics, Genetic Predisposition to Disease genetics, Transcriptome genetics
- Abstract
Transcriptome-wide association studies (TWAS) integrate genome-wide association studies (GWAS) and gene expression datasets to identify gene-trait associations. In this Perspective, we explore properties of TWAS as a potential approach to prioritize causal genes at GWAS loci, by using simulations and case studies of literature-curated candidate causal genes for schizophrenia, low-density-lipoprotein cholesterol and Crohn's disease. We explore risk loci where TWAS accurately prioritizes the likely causal gene as well as loci where TWAS prioritizes multiple genes, some likely to be non-causal, owing to sharing of expression quantitative trait loci (eQTL). TWAS is especially prone to spurious prioritization with expression data from non-trait-related tissues or cell types, owing to substantial cross-cell-type variation in expression levels and eQTL strengths. Nonetheless, TWAS prioritizes candidate causal genes more accurately than simple baselines. We suggest best practices for causal-gene prioritization with TWAS and discuss future opportunities for improvement. Our results showcase the strengths and limitations of using eQTL datasets to determine causal genes at GWAS loci.
- Published
- 2019
- Full Text
- View/download PDF
27. Determining the genetic basis of anthracycline-cardiotoxicity by molecular response QTL mapping in induced cardiomyocytes.
- Author
-
Knowles DA, Burrows CK, Blischak JD, Patterson KM, Serie DJ, Norton N, Ober C, Pritchard JK, and Gilad Y
- Subjects
- Cells, Cultured, Doxorubicin toxicity, Gene Expression Profiling, Genome-Wide Association Study, Humans, Quantitative Trait Loci, Sequence Analysis, RNA, Anthracyclines toxicity, Cardiotoxicity, Myocytes, Cardiac drug effects
- Abstract
Anthracycline-induced cardiotoxicity (ACT) is a key limiting factor in setting optimal chemotherapy regimes, with almost half of patients expected to develop congestive heart failure given high doses. However, the genetic basis of sensitivity to anthracyclines remains unclear. We created a panel of iPSC-derived cardiomyocytes from 45 individuals and performed RNA-seq after 24 hr exposure to varying doxorubicin dosages. The transcriptomic response is substantial: the majority of genes are differentially expressed and over 6000 genes show evidence of differential splicing, the later driven by reduced splicing fidelity in the presence of doxorubicin. We show that inter-individual variation in transcriptional response is predictive of in vitro cell damage, which in turn is associated with in vivo ACT risk. We detect 447 response-expression quantitative trait loci (QTLs) and 42 response-splicing QTLs, which are enriched in lower ACT GWAS [Formula: see text]-values, supporting the in vivo relevance of our map of genetic regulation of cellular response to anthracyclines., Competing Interests: DK, CB, JB, KP, DS, NN, CO, JP, YG No competing interests declared, (© 2018, Knowles et al.)
- Published
- 2018
- Full Text
- View/download PDF
28. Interactions between genetic variation and cellular environment in skeletal muscle gene expression.
- Author
-
Taylor DL, Knowles DA, Scott LJ, Ramirez AH, Casale FP, Wolford BN, Guan L, Varshney A, Albanus RD, Parker SCJ, Narisu N, Chines PS, Erdos MR, Welch RP, Kinnunen L, Saramies J, Sundvall J, Lakka TA, Laakso M, Tuomilehto J, Koistinen HA, Stegle O, Boehnke M, Birney E, and Collins FS
- Subjects
- Energy Metabolism, Genetic Association Studies, Genotype, Humans, Muscle, Skeletal cytology, Phenotype, Polymorphism, Single Nucleotide, Quantitative Trait Loci, Cellular Microenvironment, Gene Expression Regulation, Gene-Environment Interaction, Genetic Variation, Muscle Fibers, Skeletal metabolism, Muscle, Skeletal metabolism
- Abstract
From whole organisms to individual cells, responses to environmental conditions are influenced by genetic makeup, where the effect of genetic variation on a trait depends on the environmental context. RNA-sequencing quantifies gene expression as a molecular trait, and is capable of capturing both genetic and environmental effects. In this study, we explore opportunities of using allele-specific expression (ASE) to discover cis-acting genotype-environment interactions (GxE)-genetic effects on gene expression that depend on an environmental condition. Treating 17 common, clinical traits as approximations of the cellular environment of 267 skeletal muscle biopsies, we identify 10 candidate environmental response expression quantitative trait loci (reQTLs) across 6 traits (12 unique gene-environment trait pairs; 10% FDR per trait) including sex, systolic blood pressure, and low-density lipoprotein cholesterol. Although using ASE is in principle a promising approach to detect GxE effects, replication of such signals can be challenging as validation requires harmonization of environmental traits across cohorts and a sufficient sampling of heterozygotes for a transcribed SNP. Comprehensive discovery and replication will require large human transcriptome datasets, or the integration of multiple transcribed SNPs, coupled with standardized clinical phenotyping.
- Published
- 2018
- Full Text
- View/download PDF
29. Annotation-free quantification of RNA splicing using LeafCutter.
- Author
-
Li YI, Knowles DA, Humphrey J, Barbeira AN, Dickinson SP, Im HK, and Pritchard JK
- Subjects
- Animals, Disease genetics, Gene Expression Profiling, Genetic Variation, Introns, Molecular Sequence Annotation, Quantitative Trait Loci, Alternative Splicing, Sequence Analysis, RNA methods, Software
- Abstract
The excision of introns from pre-mRNA is an essential step in mRNA processing. We developed LeafCutter to study sample and population variation in intron splicing. LeafCutter identifies variable splicing events from short-read RNA-seq data and finds events of high complexity. Our approach obviates the need for transcript annotations and circumvents the challenges in estimating relative isoform or exon usage in complex splicing events. LeafCutter can be used both to detect differential splicing between sample groups and to map splicing quantitative trait loci (sQTLs). Compared with contemporary methods, our approach identified 1.4-2.1 times more sQTLs, many of which helped us ascribe molecular effects to disease-associated variants. Transcriptome-wide associations between LeafCutter intron quantifications and 40 complex traits increased the number of associated disease genes at a 5% false discovery rate by an average of 2.1-fold compared with that detected through the use of gene expression levels alone. LeafCutter is fast, scalable, easy to use, and available online.
- Published
- 2018
- Full Text
- View/download PDF
30. Inferring Relevant Cell Types for Complex Traits by Using Single-Cell Gene Expression.
- Author
-
Calderon D, Bhaskar A, Knowles DA, Golan D, Raj T, Fu AQ, and Pritchard JK
- Subjects
- Alzheimer Disease diagnosis, Alzheimer Disease pathology, Computer Simulation, Fetus, Genome-Wide Association Study, Humans, Microglia pathology, Models, Genetic, Oligodendroglia pathology, Prefrontal Cortex metabolism, Prefrontal Cortex pathology, Quantitative Trait Loci, Schizophrenia diagnosis, Schizophrenia pathology, Single-Cell Analysis methods, Transcriptome, Alzheimer Disease genetics, Microglia metabolism, Oligodendroglia metabolism, Schizophrenia genetics, Single-Cell Analysis statistics & numerical data, Software
- Abstract
Previous studies have prioritized trait-relevant cell types by looking for an enrichment of genome-wide association study (GWAS) signal within functional regions. However, these studies are limited in cell resolution by the lack of functional annotations from difficult-to-characterize or rare cell populations. Measurement of single-cell gene expression has become a popular method for characterizing novel cell types, and yet limited work has linked single-cell RNA sequencing (RNA-seq) to phenotypes of interest. To address this deficiency, we present RolyPoly, a regression-based polygenic model that can prioritize trait-relevant cell types and genes from GWAS summary statistics and gene expression data. RolyPoly is designed to use expression data from either bulk tissue or single-cell RNA-seq. In this study, we demonstrated RolyPoly's accuracy through simulation and validated previously known tissue-trait associations. We discovered a significant association between microglia and late-onset Alzheimer disease and an association between schizophrenia and oligodendrocytes and replicating fetal cortical cells. Additionally, RolyPoly computes a trait-relevance score for each gene to reflect the importance of expression specific to a cell type. We found that differentially expressed genes in the prefrontal cortex of individuals with Alzheimer disease were significantly enriched with genes ranked highly by RolyPoly gene scores. Overall, our method represents a powerful framework for understanding the effect of common variants on cell types contributing to complex traits., (Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2017
- Full Text
- View/download PDF
31. Allele-specific expression reveals interactions between genetic variation and environment.
- Author
-
Knowles DA, Davis JR, Edgington H, Raj A, Favé MJ, Zhu X, Potash JB, Weissman MM, Shi J, Levinson DF, Awadalla P, Mostafavi S, Montgomery SB, and Battle A
- Subjects
- Adult, Cohort Studies, Female, Humans, Male, Models, Genetic, Quantitative Trait Loci, Alleles, Epigenesis, Genetic, Gene Expression Regulation, Genetic Variation
- Abstract
Identifying interactions between genetics and the environment (GxE) remains challenging. We have developed EAGLE, a hierarchical Bayesian model for identifying GxE interactions based on associations between environmental variables and allele-specific expression. Combining whole-blood RNA-seq with extensive environmental annotations collected from 922 human individuals, we identified 35 GxE interactions, compared with only four using standard GxE interaction testing. EAGLE provides new opportunities for researchers to identify GxE interactions using functional genomic data.
- Published
- 2017
- Full Text
- View/download PDF
32. Therapeutic reduction of ataxin-2 extends lifespan and reduces pathology in TDP-43 mice.
- Author
-
Becker LA, Huang B, Bieri G, Ma R, Knowles DA, Jafar-Nejad P, Messing J, Kim HJ, Soriano A, Auburger G, Pulst SM, Taylor JP, Rigo F, and Gitler AD
- Subjects
- Amyotrophic Lateral Sclerosis metabolism, Amyotrophic Lateral Sclerosis physiopathology, Animals, Ataxin-2 genetics, Central Nervous System metabolism, Cytoplasmic Granules metabolism, DNA-Binding Proteins chemistry, DNA-Binding Proteins genetics, Disease Progression, Female, Gene Knockdown Techniques, Humans, Male, Mice, Mice, Knockout, Mice, Transgenic, Motor Skills physiology, Oligonucleotides, Antisense administration & dosage, Oligonucleotides, Antisense genetics, Protein Aggregation, Pathological genetics, Stress, Physiological, Survival Analysis, Amyotrophic Lateral Sclerosis genetics, Amyotrophic Lateral Sclerosis therapy, Ataxin-2 deficiency, DNA-Binding Proteins metabolism, Longevity, Oligonucleotides, Antisense therapeutic use, Protein Aggregation, Pathological therapy
- Abstract
Amyotrophic lateral sclerosis (ALS) is a rapidly progressing neurodegenerative disease that is characterized by motor neuron loss and that leads to paralysis and death 2-5 years after disease onset. Nearly all patients with ALS have aggregates of the RNA-binding protein TDP-43 in their brains and spinal cords, and rare mutations in the gene encoding TDP-43 can cause ALS. There are no effective TDP-43-directed therapies for ALS or related TDP-43 proteinopathies, such as frontotemporal dementia. Antisense oligonucleotides (ASOs) and RNA-interference approaches are emerging as attractive therapeutic strategies in neurological diseases. Indeed, treatment of a rat model of inherited ALS (caused by a mutation in Sod1) with ASOs against Sod1 has been shown to substantially slow disease progression. However, as SOD1 mutations account for only around 2-5% of ALS cases, additional therapeutic strategies are needed. Silencing TDP-43 itself is probably not appropriate, given its critical cellular functions. Here we present a promising alternative therapeutic strategy for ALS that involves targeting ataxin-2. A decrease in ataxin-2 suppresses TDP-43 toxicity in yeast and flies, and intermediate-length polyglutamine expansions in the ataxin-2 gene increase risk of ALS. We used two independent approaches to test whether decreasing ataxin-2 levels could mitigate disease in a mouse model of TDP-43 proteinopathy. First, we crossed ataxin-2 knockout mice with TDP-43 (also known as TARDBP) transgenic mice. The decrease in ataxin-2 reduced aggregation of TDP-43, markedly increased survival and improved motor function. Second, in a more therapeutically applicable approach, we administered ASOs targeting ataxin-2 to the central nervous system of TDP-43 transgenic mice. This single treatment markedly extended survival. Because TDP-43 aggregation is a component of nearly all cases of ALS, targeting ataxin-2 could represent a broadly effective therapeutic strategy.
- Published
- 2017
- Full Text
- View/download PDF
33. Small RNA Sequencing in Cells and Exosomes Identifies eQTLs and 14q32 as a Region of Active Export.
- Author
-
Tsang EK, Abell NS, Li X, Anaya V, Karczewski KJ, Knowles DA, Sierra RG, Smith KS, and Montgomery SB
- Subjects
- Cell Line, Chromosomes, Human, Pair 14 genetics, Gene Expression Regulation, High-Throughput Nucleotide Sequencing, Humans, Lymphocyte Activation genetics, RNA, Small Interfering genetics, Sequence Analysis, RNA, Exosomes genetics, MicroRNAs genetics, Quantitative Trait Loci genetics
- Abstract
Exosomes are small extracellular vesicles that carry heterogeneous cargo, including RNA, between cells. Increasing evidence suggests that exosomes are important mediators of intercellular communication and biomarkers of disease. Despite this, the variability of exosomal RNA between individuals has not been well quantified. To assess this variability, we sequenced the small RNA of cells and exosomes from a 17-member family. Across individuals, we show that selective export of miRNAs occurs not only at the level of specific transcripts, but that a cluster of 74 mature miRNAs on chromosome 14q32 is massively exported in exosomes while mostly absent from cells. We also observe more interindividual variability between exosomal samples than between cellular ones and identify four miRNA expression quantitative trait loci shared between cells and exosomes. Our findings indicate that genomically colocated miRNAs can be exported together and highlight the variability in exosomal miRNA levels between individuals as relevant for exosome use as diagnostics., (Copyright © 2017 Tsang et al.)
- Published
- 2017
- Full Text
- View/download PDF
34. Batch effects and the effective design of single-cell gene expression studies.
- Author
-
Tung PY, Blischak JD, Hsiao CJ, Knowles DA, Burnett JE, Pritchard JK, and Gilad Y
- Subjects
- Gene Expression, High-Throughput Nucleotide Sequencing, Humans, Induced Pluripotent Stem Cells cytology, Induced Pluripotent Stem Cells metabolism, Principal Component Analysis, RNA chemistry, RNA isolation & purification, Sequence Analysis, RNA, RNA metabolism, Single-Cell Analysis
- Abstract
Single-cell RNA sequencing (scRNA-seq) can be used to characterize variation in gene expression levels at high resolution. However, the sources of experimental noise in scRNA-seq are not yet well understood. We investigated the technical variation associated with sample processing using the single-cell Fluidigm C1 platform. To do so, we processed three C1 replicates from three human induced pluripotent stem cell (iPSC) lines. We added unique molecular identifiers (UMIs) to all samples, to account for amplification bias. We found that the major source of variation in the gene expression data was driven by genotype, but we also observed substantial variation between the technical replicates. We observed that the conversion of reads to molecules using the UMIs was impacted by both biological and technical variation, indicating that UMI counts are not an unbiased estimator of gene expression levels. Based on our results, we suggest a framework for effective scRNA-seq studies.
- Published
- 2017
- Full Text
- View/download PDF
35. Impact of the X Chromosome and sex on regulatory variation.
- Author
-
Kukurba KR, Parsana P, Balliu B, Smith KS, Zappala Z, Knowles DA, Favé MJ, Davis JR, Li X, Zhu X, Potash JB, Weissman MM, Shi J, Kundaje A, Levinson DF, Awadalla P, Mostafavi S, Battle A, and Montgomery SB
- Subjects
- Female, Gene Expression Profiling, Gene Expression Regulation, Genetic Predisposition to Disease, Genome, Human, Humans, Male, Polymorphism, Single Nucleotide, Quantitative Trait Loci, Sex Characteristics, Chromosomes, Human, X genetics, Transcriptome
- Abstract
The X Chromosome, with its unique mode of inheritance, contributes to differences between the sexes at a molecular level, including sex-specific gene expression and sex-specific impact of genetic variation. Improving our understanding of these differences offers to elucidate the molecular mechanisms underlying sex-specific traits and diseases. However, to date, most studies have either ignored the X Chromosome or had insufficient power to test for the sex-specific impact of genetic variation. By analyzing whole blood transcriptomes of 922 individuals, we have conducted the first large-scale, genome-wide analysis of the impact of both sex and genetic variation on patterns of gene expression, including comparison between the X Chromosome and autosomes. We identified a depletion of expression quantitative trait loci (eQTL) on the X Chromosome, especially among genes under high selective constraint. In contrast, we discovered an enrichment of sex-specific regulatory variants on the X Chromosome. To resolve the molecular mechanisms underlying such effects, we generated chromatin accessibility data through ATAC-sequencing to connect sex-specific chromatin accessibility to sex-specific patterns of expression and regulatory variation. As sex-specific regulatory variants discovered in our study can inform sex differences in heritable disease prevalence, we integrated our data with genome-wide association study data for multiple immune traits identifying several traits with significant sex biases in genetic susceptibilities. Together, our study provides genome-wide insight into how genetic variation, the X Chromosome, and sex shape human gene regulation and disease., (© 2016 Kukurba et al.; Published by Cold Spring Harbor Laboratory Press.)
- Published
- 2016
- Full Text
- View/download PDF
36. RNA splicing is a primary link between genetic variation and disease.
- Author
-
Li YI, van de Geijn B, Raj A, Knowles DA, Petti AA, Golan D, Gilad Y, and Pritchard JK
- Subjects
- Cell Line, Chromatin metabolism, Genome-Wide Association Study, Humans, Lymphocytes immunology, Phenotype, Polymorphism, Single Nucleotide, Gene Expression Regulation, Genetic Variation, Immune System Diseases genetics, Quantitative Trait Loci, RNA Splicing genetics
- Abstract
Noncoding variants play a central role in the genetics of complex traits, but we still lack a full understanding of the molecular pathways through which they act. We quantified the contribution of cis-acting genetic effects at all major stages of gene regulation from chromatin to proteins, in Yoruba lymphoblastoid cell lines (LCLs). About ~65% of expression quantitative trait loci (eQTLs) have primary effects on chromatin, whereas the remaining eQTLs are enriched in transcribed regions. Using a novel method, we also detected 2893 splicing QTLs, most of which have little or no effect on gene-level expression. These splicing QTLs are major contributors to complex traits, roughly on a par with variants that affect gene expression levels. Our study provides a comprehensive view of the mechanisms linking genetic variation to variation in human gene regulation., (Copyright © 2016, American Association for the Advancement of Science.)
- Published
- 2016
- Full Text
- View/download PDF
37. An Efficient Multiple-Testing Adjustment for eQTL Studies that Accounts for Linkage Disequilibrium between Variants.
- Author
-
Davis JR, Fresard L, Knowles DA, Pala M, Bustamante CD, Battle A, and Montgomery SB
- Subjects
- Humans, Linkage Disequilibrium, Quantitative Trait Loci
- Abstract
Methods for multiple-testing correction in local expression quantitative trait locus (cis-eQTL) studies are a trade-off between statistical power and computational efficiency. Bonferroni correction, though computationally trivial, is overly conservative and fails to account for linkage disequilibrium between variants. Permutation-based methods are more powerful, though computationally far more intensive. We present an alternative correction method called eigenMT, which runs over 500 times faster than permutations and has adjusted p values that closely approximate empirical ones. To achieve this speed while also maintaining the accuracy of permutation-based methods, we estimate the effective number of independent variants tested for association with a particular gene, termed Meff, by using the eigenvalue decomposition of the genotype correlation matrix. We employ a regularized estimator of the correlation matrix to ensure Meff is robust and yields adjusted p values that closely approximate p values from permutations. Finally, using a common genotype matrix, we show that eigenMT can be applied with even greater efficiency to studies across tissues or conditions. Our method provides a simpler, more efficient approach to multiple-testing correction than existing methods and fits within existing pipelines for eQTL discovery., (Copyright © 2016 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)
- Published
- 2016
- Full Text
- View/download PDF
38. Relational Learning and Network Modelling Using Infinite Latent Attribute Models.
- Author
-
Palla K, Knowles DA, and Ghahramani Z
- Subjects
- Computer Simulation, Informatics methods, Machine Learning, Models, Theoretical
- Abstract
Latent variable models for network data extract a summary of the relational structure underlying an observed network. The simplest possible models subdivide nodes of the network into clusters; the probability of a link between any two nodes then depends only on their cluster assignment. Currently available models can be classified by whether clusters are disjoint or are allowed to overlap. These models can explain a "flat" clustering structure. Hierarchical Bayesian models provide a natural approach to capture more complex dependencies. We propose a model in which objects are characterised by a latent feature vector. Each feature is itself partitioned into disjoint groups (subclusters), corresponding to a second layer of hierarchy. In experimental comparisons, the model achieves significantly improved predictive performance on social and biological link prediction tasks. The results indicate that models with a single layer hierarchy over-simplify real networks.
- Published
- 2015
- Full Text
- View/download PDF
39. Pitman Yor Diffusion Trees for Bayesian Hierarchical Clustering.
- Author
-
Knowles DA and Ghahramani Z
- Abstract
In this paper we introduce the Pitman Yor Diffusion Tree (PYDT), a Bayesian non-parametric prior over tree structures which generalises the Dirichlet Diffusion Tree [30] and removes the restriction to binary branching structure. The generative process is described and shown to result in an exchangeable distribution over data points. We prove some theoretical properties of the model including showing its construction as the continuum limit of a nested Chinese restaurant process model. We then present two alternative MCMC samplers which allow us to model uncertainty over tree structures, and a computationally efficient greedy Bayesian EM search algorithm. Both algorithms use message passing on the tree structure. The utility of the model and algorithms is demonstrated on synthetic and real world data, both continuous and binary.
- Published
- 2015
- Full Text
- View/download PDF
40. Transcriptome sequencing of a large human family identifies the impact of rare noncoding variants.
- Author
-
Li X, Battle A, Karczewski KJ, Zappala Z, Knowles DA, Smith KS, Kukurba KR, Wu E, Simon N, and Montgomery SB
- Subjects
- Family, Haplotypes genetics, High-Throughput Nucleotide Sequencing, Humans, Lymphocytes metabolism, White People genetics, Genome, Human, Polymorphism, Single Nucleotide genetics, Quantitative Trait Loci, RNA, Untranslated genetics, Sequence Analysis, RNA, Transcriptome
- Abstract
Recent and rapid human population growth has led to an excess of rare genetic variants that are expected to contribute to an individual's genetic burden of disease risk. To date, much of the focus has been on rare protein-coding variants, for which potential impact can be estimated from the genetic code, but determining the impact of rare noncoding variants has been more challenging. To improve our understanding of such variants, we combined high-quality genome sequencing and RNA sequencing data from a 17-individual, three-generation family to contrast expression quantitative trait loci (eQTLs) and splicing quantitative trait loci (sQTLs) within this family to eQTLs and sQTLs within a population sample. Using this design, we found that eQTLs and sQTLs with large effects in the family were enriched with rare regulatory and splicing variants (minor allele frequency < 0.01). They were also more likely to influence essential genes and genes involved in complex disease. In addition, we tested the capacity of diverse noncoding annotation to predict the impact of rare noncoding variants. We found that distance to the transcription start site, evolutionary constraint, and epigenetic annotation were considerably more informative for predicting the impact of rare variants than for predicting the impact of common variants. These results highlight that rare noncoding variants are important contributors to individual gene-expression profiles and further demonstrate a significant capability for genomic annotation to predict the impact of rare noncoding variants., (Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2014
- Full Text
- View/download PDF
41. Allelic expression of deleterious protein-coding variants across human tissues.
- Author
-
Kukurba KR, Zhang R, Li X, Smith KS, Knowles DA, How Tan M, Piskol R, Lek M, Snyder M, Macarthur DG, Li JB, and Montgomery SB
- Subjects
- Exome, Humans, Polymerase Chain Reaction, Alleles, Proteins genetics
- Abstract
Personal exome and genome sequencing provides access to loss-of-function and rare deleterious alleles whose interpretation is expected to provide insight into individual disease burden. However, for each allele, accurate interpretation of its effect will depend on both its penetrance and the trait's expressivity. In this regard, an important factor that can modify the effect of a pathogenic coding allele is its level of expression; a factor which itself characteristically changes across tissues. To better inform the degree to which pathogenic alleles can be modified by expression level across multiple tissues, we have conducted exome, RNA and deep, targeted allele-specific expression (ASE) sequencing in ten tissues obtained from a single individual. By combining such data, we report the impact of rare and common loss-of-function variants on allelic expression exposing stronger allelic bias for rare stop-gain variants and informing the extent to which rare deleterious coding alleles are consistently expressed across tissues. This study demonstrates the potential importance of transcriptome data to the interpretation of pathogenic protein-coding variants.
- Published
- 2014
- Full Text
- View/download PDF
42. Distinct epigenomic features in end-stage failing human hearts.
- Author
-
Movassagh M, Choy MK, Knowles DA, Cordeddu L, Haider S, Down T, Siggens L, Vujic A, Simeoni I, Penkett C, Goddard M, Lio P, Bennett MR, and Foo RS
- Subjects
- Case-Control Studies, CpG Islands genetics, CpG Islands physiology, DNA Methylation physiology, Heart Failure diagnosis, Heart Failure physiopathology, Histones genetics, Histones metabolism, Homeodomain Proteins genetics, Homeodomain Proteins metabolism, Humans, Male, Prognosis, Disease Progression, Epigenomics, Gene Expression Regulation physiology, Heart Failure genetics
- Abstract
Background: The epigenome refers to marks on the genome, including DNA methylation and histone modifications, that regulate the expression of underlying genes. A consistent profile of gene expression changes in end-stage cardiomyopathy led us to hypothesize that distinct global patterns of the epigenome may also exist., Methods and Results: We constructed genome-wide maps of DNA methylation and histone-3 lysine-36 trimethylation (H3K36me3) enrichment for cardiomyopathic and normal human hearts. More than 506 Mb sequences per library were generated by high-throughput sequencing, allowing us to assign methylation scores to ≈28 million CG dinucleotides in the human genome. DNA methylation was significantly different in promoter CpG islands, intragenic CpG islands, gene bodies, and H3K36me3-enriched regions of the genome. DNA methylation differences were present in promoters of upregulated genes but not downregulated genes. H3K36me3 enrichment itself was also significantly different in coding regions of the genome. Specifically, abundance of RNA transcripts encoded by the DUX4 locus correlated to differential DNA methylation and H3K36me3 enrichment. In vitro, Dux gene expression was responsive to a specific inhibitor of DNA methyltransferase, and Dux siRNA knockdown led to reduced cell viability., Conclusions: Distinct epigenomic patterns exist in important DNA elements of the cardiac genome in human end-stage cardiomyopathy. The epigenome may control the expression of local or distal genes with critical functions in myocardial stress response. If epigenomic patterns track with disease progression, assays for the epigenome may be useful for assessing prognosis in heart failure. Further studies are needed to determine whether and how the epigenome contributes to the development of cardiomyopathy.
- Published
- 2011
- Full Text
- View/download PDF
43. Direct alpha-oxytosylation of carbonyl compounds: one-pot synthesis of heterocycles.
- Author
-
John OR, Killeen NM, Knowles DA, Yau SC, Bagley MC, and Tomkinson NC
- Abstract
N-Methyl-O-tosylhydroxylamine is an effective reagent for the direct alpha-oxytosylation of carbonyl compounds. The reactions proceed smoothly at room temperature in the presence of both moisture and air and functional group tolerance in the substrate is good. With nonsymmetrical substrates regioselectivity for primary over secondary centers is observed and complete regiospecificity for primary over tertiary centers is obtained. Addition of a bis-heteronucleophile directly to the crude reaction mixture in a one-pot process leads to the corresponding heterocyclic product.
- Published
- 2007
- Full Text
- View/download PDF
44. Our elderly.
- Author
-
Knowles DA
- Subjects
- Congresses as Topic, Disabled Persons, Humans, London, Aged
- Published
- 1978
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.