33 results on '"Chelsea J-T, Ju"'
Search Results
2. Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition.
- Author
-
Ruirui Li 0002, Chelsea J.-T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, and Andreas Stolcke
- Published
- 2021
- Full Text
- View/download PDF
3. MARU: Meta-context Aware Random Walks for Heterogeneous Network Representation Learning.
- Author
-
Jyun-Yu Jiang, Zeyu Li 0001, Chelsea J.-T. Ju, and Wei Wang 0010
- Published
- 2020
- Full Text
- View/download PDF
4. Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases.
- Author
-
Junheng Hao, Chelsea J.-T. Ju, Muhao Chen, Yizhou Sun, Carlo Zaniolo, and Wei Wang 0010
- Published
- 2020
- Full Text
- View/download PDF
5. JEDI: circular RNA prediction based on junction encoders and deep interaction among splice sites.
- Author
-
Jyun-Yu Jiang, Chelsea J.-T. Ju, Junheng Hao, Muhao Chen, and Wei Wang 0010
- Published
- 2021
- Full Text
- View/download PDF
6. Adversarial Reweighting for Speaker Verification Fairness.
- Author
-
Minho Jin, Chelsea J.-T. Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, and Andreas Stolcke
- Published
- 2022
- Full Text
- View/download PDF
7. CORALS: Who Are My Potential New Customers? Tapping into the Wisdom of Customers' Decisions.
- Author
-
Ruirui Li 0002, Jyun-Yu Jiang, Chelsea J.-T. Ju, and Wei Wang 0010
- Published
- 2019
- Full Text
- View/download PDF
8. Prediction of microbial communities for urban metagenomics using neural network approach
- Author
-
Guangyu Zhou, Jyun-Yu Jiang, Chelsea J.-T. Ju, and Wei Wang
- Subjects
Urban metagenomics ,Multi-label classification ,Neural network ,Medicine ,Genetics ,QH426-470 - Abstract
Abstract Background Microbes are greatly associated with human health and disease, especially in densely populated cities. It is essential to understand the microbial ecosystem in an urban environment for cities to monitor the transmission of infectious diseases and detect potentially urgent threats. To achieve this goal, the DNA sample collection and analysis have been conducted at subway stations in major cities. However, city-scale sampling with the fine-grained geo-spatial resolution is expensive and laborious. In this paper, we introduce MetaMLAnn, a neural network based approach to infer microbial communities at unsampled locations given information reflecting different factors, including subway line networks, sampling material types, and microbial composition patterns. Results We evaluate the effectiveness of MetaMLAnn based on the public metagenomics dataset collected from multiple locations in the New York and Boston subway systems. The experimental results suggest that MetaMLAnn consistently performs better than other five conventional classifiers under different taxonomic ranks. At genus level, MetaMLAnn can achieve F1 scores of 0.63 and 0.72 on the New York and the Boston datasets, respectively. Conclusions By exploiting heterogeneous features, MetaMLAnn captures the hidden interactions between microbial compositions and the urban environment, which enables precise predictions of microbial communities at unmeasured locations.
- Published
- 2019
- Full Text
- View/download PDF
9. Functional disease architectures reveal unique biological role of transposable elements
- Author
-
Farhad Hormozdiari, Bryce van de Geijn, Joseph Nasser, Omer Weissbrod, Steven Gazal, Chelsea J. -T. Ju, Luke O’ Connor, Margaux L. A. Hujoel, Jesse Engreitz, Fereydoun Hormozdiari, and Alkes L. Price
- Subjects
Science - Abstract
Transposable elements (TE) make up a large component of the human genome and have been shown to contribute to human diseases. Here, Hormozdiari et al. estimate the contribution of TEs to the heritability of 41 complex traits and diseases and find enrichment of SINEs in blood traits.
- Published
- 2019
- Full Text
- View/download PDF
10. Inferring Microbial Communities for City Scale Metagenomics Using Neural Networks.
- Author
-
Guangyu Zhou, Jyun-Yu Jiang, Chelsea J.-T. Ju, and Wei Wang 0010
- Published
- 2018
- Full Text
- View/download PDF
11. Multifaceted protein-protein interaction prediction based on Siamese residual RCNN.
- Author
-
Muhao Chen, Chelsea J.-T. Ju, Guangyu Zhou, Xuelu Chen, Tianran Zhang, Kai-Wei Chang, Carlo Zaniolo, and Wei Wang 0010
- Published
- 2019
- Full Text
- View/download PDF
12. Fleximer: Accurate Quantification of RNA-Seq via Variable-Length k-mers.
- Author
-
Chelsea J.-T. Ju, Ruirui Li 0002, Zhengliang Wu, Jyun-Yu Jiang, Zhao Yang, and Wei Wang 0010
- Published
- 2017
- Full Text
- View/download PDF
13. Non-local convolutional neural networks (nlcnn) for speaker recognition.
- Author
-
Haici Yang, Hongda Mao, Ruirui Li 0002, Chelsea J.-T. Ju, and Oguz Elibol
- Published
- 2020
14. Efficient Approach to Correct Read Alignment for Pseudogene Abundance Estimates.
- Author
-
Chelsea J.-T. Ju, Zhuangtian Zhao, and Wei Wang 0010
- Published
- 2017
- Full Text
- View/download PDF
15. PseudoLasso: leveraging read alignment in homologous regions to correct pseudogene expression estimates via RNASeq.
- Author
-
Chelsea J.-T. Ju, Zhuangtian Zhao, and Wei Wang 0010
- Published
- 2014
- Full Text
- View/download PDF
16. Multifaceted protein–protein interaction prediction based on Siamese residual RCNN
- Author
-
Xuelu Chen, Wei Wang, Carlo Zaniolo, Tianran Zhang, Muhao Chen, Chelsea J.-T. Ju, Guangyu Zhou, and Kai-Wei Chang
- Subjects
Statistics and Probability ,Computer science ,Residual ,Machine learning ,computer.software_genre ,Biochemistry ,Convolutional neural network ,03 medical and health sciences ,0302 clinical medicine ,Ismb/Eccb 2019 Conference Proceedings ,Amino Acid Sequence ,Molecular Biology ,Peptide sequence ,Mutual influence ,030304 developmental biology ,0303 health sciences ,Sequence ,Artificial neural network ,business.industry ,Computational Biology ,Proteins ,Macromolecular Sequence, Structure, and Function ,Ligand (biochemistry) ,Computer Science Applications ,Computational Mathematics ,ComputingMethodologies_PATTERNRECOGNITION ,Computational Theory and Mathematics ,030220 oncology & carcinogenesis ,Protein–protein interaction prediction ,Neural Networks, Computer ,Artificial intelligence ,business ,computer ,Algorithms ,Protein Binding - Abstract
MotivationSequence-based protein–protein interaction (PPI) prediction represents a fundamental computational biology problem. To address this problem, extensive research efforts have been made to extract predefined features from the sequences. Based on these features, statistical algorithms are learned to classify the PPIs. However, such explicit features are usually costly to extract, and typically have limited coverage on the PPI information.ResultsWe present an end-to-end framework, PIPR (Protein–Protein Interaction Prediction Based on Siamese Residual RCNN), for PPI predictions using only the protein sequences. PIPR incorporates a deep residual recurrent convolutional neural network in the Siamese architecture, which leverages both robust local features and contextualized information, which are significant for capturing the mutual influence of proteins sequences. PIPR relieves the data pre-processing efforts that are required by other systems, and generalizes well to different application scenarios. Experimental evaluations show that PIPR outperforms various state-of-the-art systems on the binary PPI prediction problem. Moreover, it shows a promising performance on more challenging problems of interaction type prediction and binding affinity estimation, where existing approaches fall short.Availability and implementationThe implementation is available at https://github.com/muhaochen/seq_ppi.git.Supplementary informationSupplementary data are available at Bioinformatics online.
- Published
- 2019
17. Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition
- Author
-
Zeya Chen, Andreas Stolcke, Ruirui Li, Oguz H. Elibol, Chelsea J.-T. Ju, and Hongda Mao
- Subjects
FOS: Computer and information sciences ,Fusion ,Computer Science - Machine Learning ,Sound (cs.SD) ,Computer Science - Computation and Language ,Computer science ,Speech recognition ,Speech input ,Speaker recognition ,Combined approach ,Computer Science - Sound ,Machine Learning (cs.LG) ,Score fusion ,Audio and Speech Processing (eess.AS) ,FOS: Electrical engineering, electronic engineering, information engineering ,Speaker identification ,Baseline (configuration management) ,Joint (audio engineering) ,Computation and Language (cs.CL) ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
By implicitly recognizing a user based on his/her speech input, speaker identification enables many downstream applications, such as personalized system behavior and expedited shopping checkouts. Based on whether the speech content is constrained or not, both text-dependent (TD) and text-independent (TI) speaker recognition models may be used. We wish to combine the advantages of both types of models through an ensemble system to make more reliable predictions. However, any such combined approach has to be robust to incomplete inputs, i.e., when either TD or TI input is missing. As a solution we propose a fusion of embeddings network foenet architecture, combining joint learning with neural attention. We compare foenet with four competitive baseline methods on a dataset of voice assistant inputs, and show that it achieves higher accuracy than the baseline and score fusion methods, especially in the presence of incomplete inputs.
- Published
- 2021
- Full Text
- View/download PDF
18. JEDI: Circular RNA Prediction based on Junction Encoders and Deep Interaction among Splice Sites
- Author
-
Wei Wang, Muhao Chen, Chelsea J.-T. Ju, Jyun-Yu Jiang, and Junheng Hao
- Subjects
Statistics and Probability ,Neural Networks ,AcademicSubjects/SCI01060 ,Bioinformatics ,Computer science ,1.1 Normal biological development and functioning ,RNA Splicing ,Circular ,Endogeny ,Computational biology ,ENCODE ,Biochemistry ,Mathematical Sciences ,Transcriptome ,03 medical and health sciences ,Computer ,0302 clinical medicine ,Circular RNA ,Information and Computing Sciences ,Genetics ,splice ,Molecular Biology ,Gene ,030304 developmental biology ,Regulatory and Functional Genomics ,Regulation of gene expression ,0303 health sciences ,Artificial neural network ,business.industry ,Deep learning ,Nucleic acid sequence ,RNA, Circular ,Biological Sciences ,Computer Science Applications ,Computational Mathematics ,Recurrent neural network ,Computational Theory and Mathematics ,RNA splicing ,RNA ,Long Noncoding ,RNA, Long Noncoding ,Neural Networks, Computer ,RNA Splice Sites ,Artificial intelligence ,Primary sequence ,business ,030217 neurology & neurosurgery - Abstract
Motivation Circular RNA (circRNA) is a novel class of long non-coding RNAs that have been broadly discovered in the eukaryotic transcriptome. The circular structure arises from a non-canonical splicing process, where the donor site backspliced to an upstream acceptor site. These circRNA sequences are conserved across species. More importantly, rising evidence suggests their vital roles in gene regulation and association with diseases. As the fundamental effort toward elucidating their functions and mechanisms, several computational methods have been proposed to predict the circular structure from the primary sequence. Recently, advanced computational methods leverage deep learning to capture the relevant patterns from RNA sequences and model their interactions to facilitate the prediction. However, these methods fail to fully explore positional information of splice junctions and their deep interaction. Results We present a robust end-to-end framework, Junction Encoder with Deep Interaction (JEDI), for circRNA prediction using only nucleotide sequences. JEDI first leverages the attention mechanism to encode each junction site based on deep bidirectional recurrent neural networks and then presents the novel cross-attention layer to model deep interaction among these sites for backsplicing. Finally, JEDI can not only predict circRNAs but also interpret relationships among splice sites to discover backsplicing hotspots within a gene region. Experiments demonstrate JEDI significantly outperforms state-of-the-art approaches in circRNA prediction on both isoform level and gene level. Moreover, JEDI also shows promising results on zero-shot backsplicing discovery, where none of the existing approaches can achieve. Availability and implementation The implementation of our framework is available at https://github.com/hallogameboy/JEDI. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2020
- Full Text
- View/download PDF
19. Mutation effect estimation on protein-protein interactions using deep contextualized representation learning
- Author
-
Zheng Wang, Wei Wang, Jyun-Yu Jiang, Guangyu Zhou, Muhao Chen, and Chelsea J.-T. Ju
- Subjects
Computer science ,0206 medical engineering ,02 engineering and technology ,Computational biology ,medicine.disease_cause ,ENCODE ,Protein–protein interaction ,03 medical and health sciences ,Protein sequencing ,Methods Article ,medicine ,030304 developmental biology ,chemistry.chemical_classification ,0303 health sciences ,Mutation ,business.industry ,Deep learning ,Point mutation ,030302 biochemistry & molecular biology ,Wild type ,Ligand (biochemistry) ,Perceptron ,Amino acid ,chemistry ,Mutation (genetic algorithm) ,Artificial intelligence ,business ,Feature learning ,020602 bioinformatics - Abstract
The functional impact of protein mutations is reflected on the alteration of conformation and thermodynamics of protein–protein interactions (PPIs). Quantifying the changes of two interacting proteins upon mutations is commonly carried out by computational approaches. Hence, extensive research efforts have been put to the extraction of energetic or structural features on proteins, followed by statistical learning methods to estimate the effects of mutations on PPI properties. Nonetheless, such features require extensive human labors and expert knowledge to obtain, and have limited abilities to reflect point mutations. We present an end-to-end deep learning framework, MuPIPR (Mutation Effects in Protein–protein Interaction PRediction Using Contextualized Representations), to estimate the effects of mutations on PPIs. MuPIPR incorporates a contextualized representation mechanism of amino acids to propagate the effects of a point mutation to surrounding amino acid representations, therefore amplifying the subtle change in a long protein sequence. On top of that, MuPIPR leverages a Siamese residual recurrent convolutional neural encoder to encode a wild-type protein pair and its mutation pair. Multi-layer perceptron regressors are applied to the protein pair representations to predict the quantifiable changes of PPI properties upon mutations. Experimental evaluations show that, with only sequence information, MuPIPR outperforms various state-of-the-art systems on estimating the changes of binding affinity for SKEMPI v1, and offers comparable performance on SKEMPI v2. Meanwhile, MuPIPR also demonstrates state-of-the-art performance on estimating the changes of buried surface areas. The software implementation is available at https://github.com/guangyu-zhou/MuPIPR.
- Published
- 2019
- Full Text
- View/download PDF
20. Functional disease architectures reveal unique biological role of transposable elements
- Author
-
Margaux L. A. Hujoel, Fereydoun Hormozdiari, Joseph Nasser, Chelsea J.-T. Ju, Farhad Hormozdiari, Jesse M. Engreitz, Steven Gazal, Alkes L. Price, Bryce van de Geijn, Luke J. O’Connor, and Omer Weissbrod
- Subjects
0301 basic medicine ,Inheritance Patterns ,General Physics and Astronomy ,02 engineering and technology ,Genome-wide association studies ,Genome ,0302 clinical medicine ,Disease ,lcsh:Science ,Short Interspersed Nucleotide Elements ,Regulation of gene expression ,0303 health sciences ,Brain Diseases ,Multidisciplinary ,Single Nucleotide ,021001 nanoscience & nanotechnology ,DNA transposable elements ,0210 nano-technology ,Algorithms ,Human ,Transposable element ,Retroelements ,Evolution ,Science ,Quantitative Trait Loci ,Genetic predisposition to disease ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Article ,General Biochemistry, Genetics and Molecular Biology ,Autoimmune Diseases ,Evolution, Molecular ,03 medical and health sciences ,Genetics ,Humans ,Sine ,Polymorphism ,030304 developmental biology ,Genome, Human ,Human Genome ,Molecular ,General Chemistry ,Heritability ,Noncoding DNA ,Human genetics ,Genetic architecture ,030104 developmental biology ,Gene Expression Regulation ,Evolutionary biology ,DNA Transposable Elements ,Genetic markers ,lcsh:Q ,Human genome ,Generic health relevance ,030217 neurology & neurosurgery - Abstract
Transposable elements (TE) comprise roughly half of the human genome. Though initially derided as junk DNA, they have been widely hypothesized to contribute to the evolution of gene regulation. However, the contribution of TE to the genetic architecture of diseases remains unknown. Here, we analyze data from 41 independent diseases and complex traits to draw three conclusions. First, TE are uniquely informative for disease heritability. Despite overall depletion for heritability (54% of SNPs, 39 ± 2% of heritability), TE explain substantially more heritability than expected based on their depletion for known functional annotations. This implies that TE acquire function in ways that differ from known functional annotations. Second, older TE contribute more to disease heritability, consistent with acquiring biological function. Third, Short Interspersed Nuclear Elements (SINE) are far more enriched for blood traits than for other traits. Our results can help elucidate the biological roles that TE play in the genetic architecture of diseases., Transposable elements (TE) make up a large component of the human genome and have been shown to contribute to human diseases. Here, Hormozdiari et al. estimate the contribution of TEs to the heritability of 41 complex traits and diseases and find enrichment of SINEs in blood traits.
- Published
- 2019
21. Lasagna: Multifaceted Protein-Protein Interaction Prediction Based on Siamese Residual RCNN
- Author
-
Chelsea J.-T. Ju, Kai-Wei Chang, Guangyu Zhou, Muhao Chen, Tianran Zhang, Wei Wang, Carlo Zaniolo, and Xuelu Chen
- Subjects
Sequence ,business.industry ,Computer science ,A protein ,Ligand (biochemistry) ,Machine learning ,computer.software_genre ,Residual ,Convolutional neural network ,ComputingMethodologies_PATTERNRECOGNITION ,Protein–protein interaction prediction ,Artificial intelligence ,business ,computer ,Mutual influence - Abstract
Sequence-based protein-protein interaction (PPI) prediction represents a fundamental computational biology problem. To address this problem, extensive research efforts have been made to extract predefined features from the sequences. Based on these features, statistical algorithms are learned to classify the PPIs. However, such explicit features are usually costly to extract, and typically have limited coverage on the PPI information. Hence, we present an end-to-end framework, Lasagna, for PPI predictions using only the primary sequences of a protein pair. Lasagna incorporates a deep residual recurrent convolutional neural network in the Siamese learning architecture, which leverages both robust local features and contextualized information that are significant for capturing the mutual influence of protein sequences. Our framework relieves the data pre-processing efforts that are required by other systems, and generalizes well to different application scenarios. Experimental evaluations show that Lasagna outperforms various state-of-the-art systems on the binary PPI prediction problem. Moreover, it shows a promising performance on more challenging problems of interaction type prediction and binding affinity estimation, where existing approaches fall short.
- Published
- 2018
- Full Text
- View/download PDF
22. Inferring Microbial Communities for City Scale Metagenomics Using Neural Networks
- Author
-
Chelsea J.-T. Ju, Jyun-Yu Jiang, Guangyu Zhou, and Wei Wang
- Subjects
0301 basic medicine ,Multi-label classification ,03 medical and health sciences ,030104 developmental biology ,Subway line ,Exploit ,Artificial neural network ,Metagenomics ,Computer science ,Sampling (statistics) ,City scale ,Data science ,Urban environment - Abstract
Microbes play a critical role in human health and disease, especially in cities with high population densities. Understanding the microbial ecosystem in an urban environment is essential for monitoring the transmission of infectious diseases and identifying potentially urgent threats. To achieve this goal, researchers have started to collect and analyze metagenomic samples from subway stations in major cities. However, it is too costly and time-consuming to achieve city-wide sampling with fine-grained geo-spatial resolution. In this paper, we present MetaMLAnn, a neural network based approach to infer microbial communities at unmeasured locations, based upon information from various data sources in an urban environment, including subway line information, sampling material, and microbial compositions. MetaMLAnn exploits these heterogeneous features to capture the latent dependencies between microbial compositions and the urban environment, thereby precisely inferring microbial communities at unsampled locations. Moreover, we propose a regularization framework to incorporate the species relatedness as prior knowledge. We evaluate our approach using the public metagenomics dataset collected from multiple subway stations in New York and Boston. The experimental results show that MetaMLAnn consistently outperforms five conventional classifiers across several evaluation metrics. The code, features and labels are available at https://github.com/zgy921028/MetaMLAnn
- Published
- 2018
- Full Text
- View/download PDF
23. TahcoRoll: An Efficient Approach for Signature Profiling in Genomic Data through Variable-Length k-mers
- Author
-
Chelsea J.-T. Ju, Ruirui Li, Wei Wang, Jyun-Yu Jiang, and Zeyu Li
- Subjects
De Bruijn sequence ,chemistry.chemical_classification ,Source code ,Computer science ,media_common.quotation_subject ,Biological classification ,computer.software_genre ,Data structure ,Genome ,Transcriptome ,chemistry ,Metagenomics ,Profiling (information science) ,Nucleotide ,Data mining ,Nanopore sequencing ,computer ,media_common - Abstract
k-mer profiling has been one of the trending approaches to analyze read data generated by high-throughput sequencing technologies. The tasks of k-mer profiling include, but are not limited to, counting the frequencies and determining the occurrences of short sequences in a dataset. The notion of k-mer has been extensively used to build de Bruijn graphs in genome or transcriptome assembly, which requires examining all possible k-mers presented in the dataset. Recently, an alternative way of profiling has been proposed, which constructs a set of representative k-mers as genomic markers and profiles their occurrences in the sequencing data. This technique has been applied in both transcript quantification through RNA-Seq and taxonomic classification of metagenomic reads. Most of these applications use a set of fixed-size k-mers since the majority of existing k-mer counters are inadequate to process genomic sequences with variable-length k-mers. However, choosing the appropriate k is challenging, as it varies for different applications. As a pioneer work to profile a set of variable-length k-mers, we propose TahcoRoll in order to enhance the Aho-Corasick algorithm. More specifically, we use one bit to represent each nucleotide, and integrate the rolling hash technique to construct an efficient in-memory data structure for this task. Using both synthetic and real datasets, results show that TahcoRoll outperforms existing approaches in either or both time and memory efficiency without using any disk space. In addition, compared to the most efficient state-of-the-art k-mer counters, such as KMC and MSBWT, TahcoRoll is the only approach that can process long read data from both PacBio and Oxford Nanopore on a commodity desktop computer. The source code of TahcoRoll is implemented in C++14, and available at https://github.com/chelseaju/TahcoRoll.git.
- Published
- 2017
- Full Text
- View/download PDF
24. Leveraging molecular quantitative trait loci to understand the genetic architecture of diseases and complex traits
- Author
-
Farhad Hormozdiari, Alexander Gusev, Chelsea J.-T. Ju, Bryce van de Geijn, Armin P. Schoech, Steven Gazal, Yakir A. Reshef, Alkes L. Price, Luke J. O’Connor, Hilary K. Finucane, Eleazar Eskin, Xuanyao Liu, and Po-Ru Loh
- Subjects
0301 basic medicine ,Multifactorial Inheritance ,Quantitative Trait Loci ,food and beverages ,Genomics ,Genome-wide association study ,Computational biology ,Quantitative trait locus ,Biology ,Heritability ,Polymorphism, Single Nucleotide ,Genetic architecture ,Article ,03 medical and health sciences ,030104 developmental biology ,Phenotype ,Quantitative Trait, Heritable ,Expression quantitative trait loci ,Genetics ,Trait ,Humans ,Disease ,Genetic association ,Genome-Wide Association Study - Abstract
There is increasing evidence that many risk loci found using genome-wide association studies are molecular quantitative trait loci (QTLs). Here we introduce a new set of functional annotations based on causal posterior probabilities of fine-mapped molecular cis-QTLs, using data from the Genotype-Tissue Expression (GTEx) and BLUEPRINT consortia. We show that these annotations are more strongly enriched for heritability (5.84× for eQTLs; P = 1.19 × 10-31) across 41 diseases and complex traits than annotations containing all significant molecular QTLs (1.80× for expression (e)QTLs). eQTL annotations obtained by meta-analyzing all GTEx tissues generally performed best, whereas tissue-specific eQTL annotations produced stronger enrichments for blood- and brain-related diseases and traits. eQTL annotations restricted to loss-of-function intolerant genes were even more enriched for heritability (17.06×; P = 1.20 × 10-35). All molecular QTLs except splicing QTLs remained significantly enriched in joint analysis, indicating that each of these annotations is uniquely informative for disease and complex trait architectures.
- Published
- 2017
25. Widespread Allelic Heterogeneity in Complex Traits
- Author
-
Chelsea J.-T. Ju, Sriram Sankararaman, Anthony Zhu, Sagiv Shifman, Hyejung Won, Eleazar Eskin, Gleb Kichaev, Jong Wha J. Joo, Bogdan Pasaniuc, Farhad Hormozdiari, and Ayellet V. Segrè
- Subjects
Models, Molecular ,0301 basic medicine ,Quantitative Trait Loci ,causal variants ,Genome-wide association study ,Locus (genetics) ,Biology ,eQTL ,Medical and Health Sciences ,Article ,Linkage Disequilibrium ,complex traits ,03 medical and health sciences ,Databases ,0302 clinical medicine ,Gene Frequency ,Genetic ,Models ,Databases, Genetic ,Genetics ,Humans ,Association mapping ,Allele frequency ,Genetics (clinical) ,Alleles ,Genetic Association Studies ,030304 developmental biology ,Genetic association ,Genetics & Heredity ,0303 health sciences ,allelic heterogeneity ,Prevention ,Human Genome ,Molecular ,Biological Sciences ,Genetic architecture ,030104 developmental biology ,Phenotype ,Sample size determination ,Expression quantitative trait loci ,gene expression ,Allelic heterogeneity ,030217 neurology & neurosurgery ,Biotechnology - Abstract
Recent successes in genome-wide association studies (GWASs) make it possible to address important questions about the genetic architecture of complex traits, such as allele frequency and effect size. One lesser-known aspect of complex traits is the extent of allelic heterogeneity (AH) arising from multiple causal variants at a locus. We developed a computational method to infer the probability of AH and applied it to three GWAS and four expression quantitative trait loci (eQTL) datasets. We identified a total of 4152 loci with strong evidence of AH. The proportion of all loci with identified AH is 4-23% in eQTLs, 35% in GWAS of High-Density Lipoprotein (HDL), and 23% in schizophrenia. For eQTLs, we observed a strong correlation between sample size and the proportion of loci with AH (R2=0.85, P = 2.2e-16), indicating that statistical power prevents identification of AH in other loci. Understanding the extent of AH may guide the development of new methods for fine mapping and association mapping of complex traits.
- Published
- 2017
26. Root herbivory: molecular analysis of the maize transcriptome upon infestation by Southern corn rootworm, Diabrotica undecimpunctata howardi
- Author
-
Susan D. Lawrence, Chelsea J.-T. Ju, Walid El Kayal, Janice E. K. Cooke, and Nicole G. Novak
- Subjects
Physiology ,Cyclopentanes ,Plant Science ,medicine.disease_cause ,Plant Roots ,Zea mays ,Transcriptome ,chemistry.chemical_compound ,Plant Growth Regulators ,Gene Expression Regulation, Plant ,Infestation ,Botany ,Genetics ,medicine ,Animals ,Herbivory ,Oxylipins ,Gene ,Oligonucleotide Array Sequence Analysis ,Plant Diseases ,Diabrotica undecimpunctata ,biology ,Gene Expression Profiling ,Jasmonic acid ,fungi ,food and beverages ,Cell Biology ,General Medicine ,biology.organism_classification ,WRKY protein domain ,Up-Regulation ,Coleoptera ,chemistry ,RNA, Plant ,Larva ,Shoot ,Salicylic Acid ,Plant Shoots ,Salicylic acid ,Signal Transduction - Abstract
While many studies have characterized changes to the transcriptome of plants attacked by shoot-eating insect pests, few have examined transcriptome-level effects of root pests. Maize (Zea mays) seedlings were subjected to infestation for approximately 2 weeks by the root herbivore southern corn rootworm (SCR) Diabrotica undecimpunctata howardi, and changes in transcript abundance within both roots and shoots were analyzed using a 57K element microarray. A total of 541 genes showed statistically significant changes in transcript abundance in infested roots, including genes encoding many pathogenesis-related proteins such as chitinases, proteinase inhibitors, peroxidases and β-1,3-glucanases. Several WRKY transcription factors – often associated with biotic responses – exhibited increased transcript abundance upon SCR feeding. Differentially expressed (DE) genes were also detected in shoots of infested vs control plants. Quantitative Reverse Transcriptase Polymerase Chain Reaction (RT-PCR) was used to confirm patterns of transcript abundance for several significant DE genes using an independent experiment with a 2–6 day period of SCR infestation. Because of the well-documented roles that jasmonic acid (JA) or salicylic acid (SA) play in herbivory responses, the effect of exogenous JA or SA application on transcript abundance corresponding to the same subset of SCR-responsive genes was assessed. The response of these genes at the level of transcript abundance to SA and JA differed between roots and shoots and also differed among the genes that were examined. These data suggested that SA- and JA-dependent and independent signals contributed to the transcriptome-level changes in maize roots and shoots in response to SCR infestation.
- Published
- 2012
- Full Text
- View/download PDF
27. Molecular events of apical bud formation in white spruce, Picea glauca
- Author
-
Walid El Kayal, Susanne King-Jones, Eri Adams, L. Irina Zaharia, Chelsea J.-T. Ju, Carmen C. G. Allen, Janice E. K. Cooke, and Suzanne R. Abrams
- Subjects
chemistry.chemical_classification ,Physiology ,Apical dominance ,fungi ,Plant Science ,Biology ,Indeterminate growth ,Gene expression profiling ,White (mutation) ,chemistry.chemical_compound ,chemistry ,Auxin ,Shoot ,Gene expression ,Botany ,Abscisic acid - Abstract
Bud formation is an adaptive trait that temperate forest trees have acquired to facilitate seasonal synchronization. We have characterized transcriptome-level changes that occur during bud formation of white spruce [Picea glauca (Moench) Voss], a primarily determinate species in which preformed stem units contained within the apical bud constitute most of next season's growth. Microarray analysis identified 4460 differentially expressed sequences in shoot tips during short day-induced bud formation. Cluster analysis revealed distinct temporal patterns of expression, and functional classification of genes in these clusters implied molecular processes that coincide with anatomical changes occurring in the developing bud. Comparing expression profiles in developing buds under long day and short day conditions identified possible photoperiod-responsive genes that may not be essential for bud development. Several genes putatively associated with hormone signalling were identified, and hormone quantification revealed distinct profiles for abscisic acid (ABA), cytokinins, auxin and their metabolites that can be related to morphological changes to the bud. Comparison of gene expression profiles during bud formation in different tissues revealed 108 genes that are differentially expressed only in developing buds and show greater transcript abundance in developing buds than other tissues. These findings provide a temporal roadmap of bud formation in white spruce.
- Published
- 2011
- Full Text
- View/download PDF
28. Examining the molecular interaction between potato (Solanum tuberosum) and Colorado potato beetle Leptinotarsa decemlineata
- Author
-
Susan D. Lawrence, Nicole G. Novak, Janice E.K. CookeJ.E. Cooke, and Chelsea J.-T. Ju
- Subjects
Expressed sequence tag ,Ecology ,biology ,fungi ,Colorado potato beetle ,food and beverages ,Plant Science ,biology.organism_classification ,Solanum tuberosum ,Botany ,PEST analysis ,Gene ,Leptinotarsa ,Ecology, Evolution, Behavior and Systematics ,Solanaceae ,Tropinone reductase I - Abstract
Colorado potato beetle (CPB) is a devastating herbivorous pest of solanaceous plants. Despite the economic impact, little is known about the molecular interaction of CPB with these plants. Using an 11 421 expressed sequence tag (EST) potato microarray, we identified 320 genes differentially expressed in potato leaves in response to CPB herbivory. Amongst these were genes putatively encoding proteinase inhibitors along with enzymes of terpenoid, alkaloid, and phenylpropanoid biosynthetic pathways, suggesting the defensive chemistries that constitute potato’s defense against CPB herbivory. Several genes, such as those encoding proteinase inhibitors, represent mechanisms implicated in other plant–herbivory interactions, and could correspond with general defensive chemistry strategies. In other cases, products of the differentially expressed genes may represent taxa-specific defensive chemistry. For example, the presumed alkaloid products of a putative tropinone reductase I are specific to a subset of the Solanaceae. Two herbivory-induced genes, not specific to potato, are implicated in the synthesis of volatiles known to attract CPB predators. Comparison of continuous herbivore attack versus recovery from CPB attack indicates that fewer genes involved in defensive chemistry are induced after continuous feeding than after feeding and recovery, suggesting the plant’s ability to mount a full defense response is enhanced under light versus heavy attack.
- Published
- 2008
- Full Text
- View/download PDF
29. Temporal Dynamics of Plasma Metabolites in ISO-induced Cardiac Remodeling in Mice
- Author
-
Peipei Ping, Ding Wang, Chelsea J.-T. Ju, Howard Choi, Wei Wang, Quan Cao, Jennifer S. Polson, and David A. Liem
- Subjects
Chemistry ,Dynamics (mechanics) ,Biophysics ,Plasma ,Cardiology and Cardiovascular Medicine ,Molecular Biology - Published
- 2017
- Full Text
- View/download PDF
30. Abstract 112: Targeted Metabolomics Profiling of Heart Failure Patients Undergoing Mechanical Circulatory Support
- Author
-
X'avia Chan, J.H. Howard Choi, Chelsea J.-T. Ju, Wei Wang, Jun Zhang, Jason Tabaraki, David Liem, Martin Cadeiras, Mario Deng, and Peipei Ping
- Subjects
Physiology ,Cardiology and Cardiovascular Medicine - Abstract
Metabolomics investigations hold promise for the characterization of small molecules, metabolites, which govern the ultimate manifestation of cardiac phenotypes. In this study, we employed a mass spectrometry-based metabolomics approach to identify metabolic marker(s), which dynamically reflect the cardiac performance of heart failure patients amid the implantation of mechanical circulatory support. Using the MRM-based and triple quadrupole technology platform, we have quantified 266 metabolites native to human plasma and collected from thirteen heart failure patients. The temporal profile of these metabolites was sampled from 1 day prior to the implantation of mechanical circulatory support, as well as 1-, 3-, 5-, and 7-day following their surgical interventions. We identified subgroups of these metabolites with coordinated behaviors that are interesting to their diseased phenotypes. In a pair-wise correlation analysis, 36.8% (98 out of 266) of metabolites were significantly correlated. Intriguingly, majority of which (65 out of 98) are representing the functional groups of phosphatidylcholines; several of them are known to have close associations with the pathogenesis of cardiovascular diseases. In addition, there are 33 metabolites contributing to multiple functional groups, including twelve of them belong to sphingomyelines, ten of them in the family of lysophosphatidylcholines, eight amino acids (Gln, Ser, Ala, His, Lys, Gly, Thr, and Arg), as well as three fatty acids (eicosapentaenoic acid, pentadecenoic acid, and heptadecenoic acid). The behaviors of these 266 metabolites have constituted individualized metabolic fingerprints. Delineation of the intrinsic relationships among alterations in distinct metabolite groups and their reflected cardiac function will enable us to identify new metabolic markers aiding stratification and/or prediction on the clinical outcome of each individual patient undergoing the treatment of mechanical circulatory support. This personalized metabolic fingerprint will offer unique prognostic utilities, supporting clinical decision-making process to deliver intervention that is most effective and beneficial to an individual.
- Published
- 2014
- Full Text
- View/download PDF
31. Integrated transcriptomic and proteomic profiling of white spruce stems during the transition from active growth to dormancy
- Author
-
Leonardo M, Galindo González, Walid, El Kayal, Chelsea J-T, Ju, Carmen C G, Allen, Susanne, King-Jones, and Janice E K, Cooke
- Subjects
Cold Temperature ,Proteomics ,Cambium ,Cell Wall ,Gene Expression Regulation, Plant ,Stress, Physiological ,Gene Expression Profiling ,Photoperiod ,Picea ,Adaptation, Physiological ,Oligonucleotide Array Sequence Analysis ,Plant Proteins ,Trees - Abstract
In the autumn, stems of woody perennials such as forest trees undergo a transition from active growth to dormancy. We used microarray transcriptomic profiling in combination with a proteomics analysis to elucidate processes that occur during this growth-to-dormancy transition in a conifer, white spruce (Picea glauca[Moench] Voss). Several differentially expressed genes were likely associated with the developmental transition that occurs during growth cessation in the cambial zone and the concomitant completion of cell maturation in vascular tissues. Genes encoding for cell wall and membrane biosynthetic enzymes showed transcript abundance patterns consistent with completion of cell maturation, and also of cell wall and membrane modifications potentially enabling cells to withstand the harsh conditions of winter. Several differentially expressed genes were identified that encoded putative regulators of cambial activity, cell development and of the photoperiodic pathway. Reconfiguration of carbon allocation figured centrally in the tree's overwintering preparations. For example, genes associated with carbon-based defences such as terpenoids were down-regulated, while many genes associated with protein-based defences and other stress mitigation mechanisms were up-regulated. Several of these correspond to proteins that were accumulated during the growth-to-dormancy transition, emphasizing the importance of stress protection in the tree's adaptive response to overwintering.
- Published
- 2011
32. Molecular events of apical bud formation in white spruce, Picea glauca
- Author
-
Walid, El Kayal, Carmen C G, Allen, Chelsea J-T, Ju, Eri, Adams, Susanne, King-Jones, L Irina, Zaharia, Suzanne R, Abrams, and Janice E K, Cooke
- Subjects
Cytokinins ,Indoleacetic Acids ,Gene Expression Regulation, Plant ,RNA, Plant ,Gene Expression Profiling ,Photoperiod ,Quebec ,Cluster Analysis ,Picea ,Plant Shoots ,Abscisic Acid ,Oligonucleotide Array Sequence Analysis - Abstract
Bud formation is an adaptive trait that temperate forest trees have acquired to facilitate seasonal synchronization. We have characterized transcriptome-level changes that occur during bud formation of white spruce [Picea glauca (Moench) Voss], a primarily determinate species in which preformed stem units contained within the apical bud constitute most of next season's growth. Microarray analysis identified 4460 differentially expressed sequences in shoot tips during short day-induced bud formation. Cluster analysis revealed distinct temporal patterns of expression, and functional classification of genes in these clusters implied molecular processes that coincide with anatomical changes occurring in the developing bud. Comparing expression profiles in developing buds under long day and short day conditions identified possible photoperiod-responsive genes that may not be essential for bud development. Several genes putatively associated with hormone signalling were identified, and hormone quantification revealed distinct profiles for abscisic acid (ABA), cytokinins, auxin and their metabolites that can be related to morphological changes to the bud. Comparison of gene expression profiles during bud formation in different tissues revealed 108 genes that are differentially expressed only in developing buds and show greater transcript abundance in developing buds than other tissues. These findings provide a temporal roadmap of bud formation in white spruce.
- Published
- 2010
33. Potato, Solanum tuberosum, defense against Colorado potato beetle, Leptinotarsa decemlineata (Say): microarray gene expression profiling of potato by Colorado potato beetle regurgitant treatment of wounded leaves
- Author
-
Janice E. K. Cooke, Nicole G. Novak, Susan D. Lawrence, and Chelsea J.-T. Ju
- Subjects
Nitrogen ,Down-Regulation ,Biochemistry ,Microbiology ,Gene Expression Regulation, Plant ,Botany ,Gene expression ,Animals ,Photosynthesis ,Secondary metabolism ,Gene ,Ecology, Evolution, Behavior and Systematics ,Oligonucleotide Array Sequence Analysis ,Solanum tuberosum ,Regulation of gene expression ,biology ,Gene Expression Profiling ,fungi ,Colorado potato beetle ,food and beverages ,General Medicine ,biology.organism_classification ,Up-Regulation ,Gene expression profiling ,Coleoptera ,Plant Leaves ,Real-time polymerase chain reaction ,Food ,Protein Biosynthesis ,Carbohydrate Metabolism ,Solanaceae - Abstract
Colorado potato beetle (CPB) is a leading pest of solanaceous plants. Despite the economic importance of this pest, surprisingly few studies have been carried out to characterize its molecular interaction with the potato plant. In particular, little is known about the effect of CPB elicitors on gene expression associated with the plant's defense response. In order to discover putative CPB elicitor-responsive genes, the TIGR 11,421 EST Solanaceae microarray was used to identify genes that are differentially expressed in response to the addition of CPB regurgitant to wounded potato leaves. By applying a cutoff corresponding to an adjusted P-value of0.01 and a fold change of1.5 or0.67, we found that 73 of these genes are induced by regurgitant treatment of wounded leaves when compared to wounding alone, whereas 54 genes are repressed by this treatment. This gene set likely includes regurgitant-responsive genes as well as wounding-responsive genes whose expression patterns are further enhanced by the presence of regurgitant. Real-time polymerase chain reaction was used to validate differential expression by regurgitant treatment for five of these genes. In general, genes that encoded proteins involved in secondary metabolism and stress were induced by regurgitant; genes associated with photosynthesis were repressed. One induced gene that encodes aromatic amino acid decarboxylase is responsible for synthesis of the precursor of 2-phenylethanol. This is significant because 2-phenylethanol is recognized by the CPB predator Perillus bioculatis. In addition, three of the 16 type 1 and type 2 proteinase inhibitor clones present on the potato microarray were repressed by application of CPB regurgitant to wounded leaves. Given that proteinase inhibitors are known to interfere with digestion of proteins in the insect midgut, repression of these proteinase inhibitors by CPB may inhibit this component of the plant's defense arsenal. These data suggest that beyond the wound response, CPB elicitors play a role in mediating the plant/insect interaction.
- Published
- 2008
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.