12 results on '"Love MI"'
Search Results
2. DeCompress: tissue compartment deconvolution of targeted mRNA expression panels using compressed sensing.
- Author
-
Bhattacharya A, Hamilton AM, Troester MA, and Love MI
- Subjects
- Algorithms, Animals, Breast Neoplasms genetics, Breast Neoplasms metabolism, Databases, Genetic, Female, Genomics, Humans, Lung Neoplasms genetics, Lung Neoplasms metabolism, Male, Neoplasms metabolism, Prostatic Neoplasms genetics, Prostatic Neoplasms metabolism, Quantitative Trait Loci, RNA, Messenger genetics, RNA-Seq, Receptors, CCR3 genetics, Receptors, CCR3 metabolism, Single-Cell Analysis, Benchmarking methods, Computational Biology methods, Gene Expression Profiling methods, Neoplasms genetics, RNA, Messenger metabolism
- Abstract
Targeted mRNA expression panels, measuring up to 800 genes, are used in academic and clinical settings due to low cost and high sensitivity for archived samples. Most samples assayed on targeted panels originate from bulk tissue comprised of many cell types, and cell-type heterogeneity confounds biological signals. Reference-free methods are used when cell-type-specific expression references are unavailable, but limited feature spaces render implementation challenging in targeted panels. Here, we present DeCompress, a semi-reference-free deconvolution method for targeted panels. DeCompress leverages a reference RNA-seq or microarray dataset from similar tissue to expand the feature space of targeted panels using compressed sensing. Ensemble reference-free deconvolution is performed on this artificially expanded dataset to estimate cell-type proportions and gene signatures. In simulated mixtures, four public cell line mixtures, and a targeted panel (1199 samples; 406 genes) from the Carolina Breast Cancer Study, DeCompress recapitulates cell-type proportions with less error than reference-free methods and finds biologically relevant compartments. We integrate compartment estimates into cis-eQTL mapping in breast cancer, identifying a tumor-specific cis-eQTL for CCR3 (C-C Motif Chemokine Receptor 3) at a risk locus. DeCompress improves upon reference-free methods without requiring expression profiles from pure cell populations, with applications in genomic analyses and clinical settings., (© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2021
- Full Text
- View/download PDF
3. MOSTWAS: Multi-Omic Strategies for Transcriptome-Wide Association Studies.
- Author
-
Bhattacharya A, Li Y, and Love MI
- Subjects
- Gene Expression Profiling methods, Humans, Models, Genetic, Organ Specificity genetics, Phenotype, Polymorphism, Single Nucleotide, Quantitative Trait Loci, Reproducibility of Results, Computational Biology methods, Genome-Wide Association Study methods, Software
- Abstract
Traditional predictive models for transcriptome-wide association studies (TWAS) consider only single nucleotide polymorphisms (SNPs) local to genes of interest and perform parameter shrinkage with a regularization process. These approaches ignore the effect of distal-SNPs or other molecular effects underlying the SNP-gene association. Here, we outline multi-omics strategies for transcriptome imputation from germline genetics to allow more powerful testing of gene-trait associations by prioritizing distal-SNPs to the gene of interest. In one extension, we identify mediating biomarkers (CpG sites, microRNAs, and transcription factors) highly associated with gene expression and train predictive models for these mediators using their local SNPs. Imputed values for mediators are then incorporated into the final predictive model of gene expression, along with local SNPs. In the second extension, we assess distal-eQTLs (SNPs associated with genes not in a local window around it) for their mediation effect through mediating biomarkers local to these distal-eSNPs. Distal-eSNPs with large indirect mediation effects are then included in the transcriptomic prediction model with the local SNPs around the gene of interest. Using simulations and real data from ROS/MAP brain tissue and TCGA breast tumors, we show considerable gains of percent variance explained (1-2% additive increase) of gene expression and TWAS power to detect gene-trait associations. This integrative approach to transcriptome-wide imputation and association studies aids in identifying the complex interactions underlying genetic regulation within a tissue and important risk genes for various traits and disorders., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2021
- Full Text
- View/download PDF
4. Consistency and overfitting of multi-omics methods on experimental data.
- Author
-
McCabe SD, Lin DY, and Love MI
- Subjects
- Computational Biology methods, Genomics methods
- Abstract
Knowledge on the relationship between different biological modalities (RNA, chromatin, etc.) can help further our understanding of the processes through which biological components interact. The ready availability of multi-omics datasets has led to the development of numerous methods for identifying sources of common variation across biological modalities. However, evaluation of the performance of these methods, in terms of consistency, has been difficult because most methods are unsupervised. We present a comparison of sparse multiple canonical correlation analysis (Sparse mCCA), angle-based joint and individual variation explained (AJIVE) and multi-omics factor analysis (MOFA) using a cross-validation approach to assess overfitting and consistency. Both large and small-sample datasets were used to evaluate performance, and a permuted null dataset was used to identify overfitting through the application of our framework and approach. In the large-sample setting, we found that all methods demonstrated consistency and lack of overfitting; however, in the small-sample size setting, AJIVE provided the most stable results. We provide an R package so that our framework and approach can be applied to evaluate other methods and datasets., (© The authors 2019. Published by Oxford University Press on behalf of the Institute of Mathematics and its Applications. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
5. Tximeta: Reference sequence checksums for provenance identification in RNA-seq.
- Author
-
Love MI, Soneson C, Hickey PF, Johnson LK, Pierce NT, Shepherd L, Morgan M, and Patro R
- Subjects
- Algorithms, Animals, Drosophila melanogaster, Genomics, Humans, Mice, Models, Statistical, Pattern Recognition, Automated, Programming Languages, Reproducibility of Results, Software, Transcriptome, Computational Biology methods, Gene Expression Profiling, RNA-Seq
- Abstract
Correct annotation metadata is critical for reproducible and accurate RNA-seq analysis. When files are shared publicly or among collaborators with incorrect or missing annotation metadata, it becomes difficult or impossible to reproduce bioinformatic analyses from raw data. It also makes it more difficult to locate the transcriptomic features, such as transcripts or genes, in their proper genomic context, which is necessary for overlapping expression data with other datasets. We provide a solution in the form of an R/Bioconductor package tximeta that performs numerous annotation and metadata gathering tasks automatically on behalf of users during the import of transcript quantification files. The correct reference transcriptome is identified via a hashed checksum stored in the quantification output, and key transcript databases are downloaded and cached locally. The computational paradigm of automatically adding annotation metadata based on reference sequence checksums can greatly facilitate genomic workflows, by helping to reduce overhead during bioinformatic analyses, preventing costly bioinformatic mistakes, and promoting computational reproducibility. The tximeta package is available at https://bioconductor.org/packages/tximeta., Competing Interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: RP is a co-founder of Ocean Genomics.
- Published
- 2020
- Full Text
- View/download PDF
6. Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.
- Author
-
Love MI, Soneson C, and Patro R
- Subjects
- Animals, Gene Expression Profiling instrumentation, Sequence Analysis, RNA instrumentation, Computational Biology, Gene Expression Profiling methods, Gene Expression Regulation, RNA biosynthesis, RNA genetics, Sequence Analysis, RNA methods, Software
- Abstract
Detection of differential transcript usage (DTU) from RNA-seq data is an important bioinformatic analysis that complements differential gene expression analysis. Here we present a simple workflow using a set of existing R/Bioconductor packages for analysis of DTU. We show how these packages can be used downstream of RNA-seq quantification using the Salmon software package. The entire pipeline is fast, benefiting from inference steps by Salmon to quantify expression at the transcript level. The workflow includes live, runnable code chunks for analysis using DRIMSeq and DEXSeq, as well as for performing two-stage testing of DTU using the stageR package, a statistical framework to screen at the gene level and then confirm which transcripts within the significant genes show evidence of DTU. We evaluate these packages and other related packages on a simulated dataset with parameters estimated from real data., Competing Interests: No competing interests were disclosed.
- Published
- 2018
- Full Text
- View/download PDF
7. Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.
- Author
-
Love MI, Soneson C, and Patro R
- Subjects
- Animals, Gene Expression Profiling instrumentation, Sequence Analysis, RNA instrumentation, Computational Biology, Gene Expression Profiling methods, Gene Expression Regulation, RNA biosynthesis, RNA genetics, Sequence Analysis, RNA methods, Software
- Abstract
Detection of differential transcript usage (DTU) from RNA-seq data is an important bioinformatic analysis that complements differential gene expression analysis. Here we present a simple workflow using a set of existing R/Bioconductor packages for analysis of DTU. We show how these packages can be used downstream of RNA-seq quantification using the Salmon software package. The entire pipeline is fast, benefiting from inference steps by Salmon to quantify expression at the transcript level. The workflow includes live, runnable code chunks for analysis using DRIMSeq and DEXSeq, as well as for performing two-stage testing of DTU using the stageR package, a statistical framework to screen at the gene level and then confirm which transcripts within the significant genes show evidence of DTU. We evaluate these packages and other related packages on a simulated dataset with parameters estimated from real data., Competing Interests: No competing interests were disclosed.
- Published
- 2018
- Full Text
- View/download PDF
8. Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.
- Author
-
Love MI, Soneson C, and Patro R
- Subjects
- Animals, Gene Expression Profiling instrumentation, Sequence Analysis, RNA instrumentation, Computational Biology, Gene Expression Profiling methods, Gene Expression Regulation, RNA biosynthesis, RNA genetics, Sequence Analysis, RNA methods, Software
- Abstract
Detection of differential transcript usage (DTU) from RNA-seq data is an important bioinformatic analysis that complements differential gene expression analysis. Here we present a simple workflow using a set of existing R/Bioconductor packages for analysis of DTU. We show how these packages can be used downstream of RNA-seq quantification using the Salmon software package. The entire pipeline is fast, benefiting from inference steps by Salmon to quantify expression at the transcript level. The workflow includes live, runnable code chunks for analysis using DRIMSeq and DEXSeq, as well as for performing two-stage testing of DTU using the stageR package, a statistical framework to screen at the gene level and then confirm which transcripts within the significant genes show evidence of DTU. We evaluate these packages and other related packages on a simulated dataset with parameters estimated from real data., Competing Interests: No competing interests were disclosed.
- Published
- 2018
- Full Text
- View/download PDF
9. Orchestrating high-throughput genomic analysis with Bioconductor.
- Author
-
Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, Bravo HC, Davis S, Gatto L, Girke T, Gottardo R, Hahne F, Hansen KD, Irizarry RA, Lawrence M, Love MI, MacDonald J, Obenchain V, Oleś AK, Pagès H, Reyes A, Shannon P, Smyth GK, Tenenbaum D, Waldron L, and Morgan M
- Subjects
- Programming Languages, User-Computer Interface, Computational Biology, Gene Expression Profiling, Genomics methods, High-Throughput Screening Assays methods, Software
- Abstract
Bioconductor is an open-source, open-development software project for the analysis and comprehension of high-throughput data in genomics and molecular biology. The project aims to enable interdisciplinary research, collaboration and rapid development of scientific software. Based on the statistical programming language R, Bioconductor comprises 934 interoperable packages contributed by a large, diverse community of scientists. Packages cover a range of bioinformatic and statistical applications. They undergo formal initial review and continuous automated testing. We present an overview for prospective users and contributors.
- Published
- 2015
- Full Text
- View/download PDF
10. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens.
- Author
-
Li W, Xu H, Xiao T, Cong L, Love MI, Zhang F, Irizarry RA, Liu JS, Brown M, and Liu XS
- Subjects
- Cell Line, Tumor, Gene Expression Regulation, Neoplastic, Genetic Predisposition to Disease, HL-60 Cells, Humans, Algorithms, CRISPR-Associated Proteins genetics, CRISPR-Cas Systems, Computational Biology methods, Gene Knockout Techniques, Genes, Essential, Neoplasms genetics
- Abstract
We propose the Model-based Analysis of Genome-wide CRISPR/Cas9 Knockout (MAGeCK) method for prioritizing single-guide RNAs, genes and pathways in genome-scale CRISPR/Cas9 knockout screens. MAGeCK demonstrates better performance compared with existing methods, identifies both positively and negatively selected genes simultaneously, and reports robust results across different experimental conditions. Using public datasets, MAGeCK identified novel essential genes and pathways, including EGFR in vemurafenib-treated A375 cells harboring a BRAF mutation. MAGeCK also detected cell type-specific essential genes, including BCR and ABL1, in KBM7 cells bearing a BCR-ABL fusion, and IGF1R in HL-60 cells, which depends on the insulin signaling pathway for proliferation.
- Published
- 2014
- Full Text
- View/download PDF
11. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.
- Author
-
Love MI, Huber W, and Anders S
- Subjects
- Algorithms, High-Throughput Nucleotide Sequencing, Models, Genetic, Sequence Analysis, RNA, Computational Biology methods, RNA analysis, Software
- Abstract
In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. We present DESeq2, a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. The DESeq2 package is available at http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html webcite.
- Published
- 2014
- Full Text
- View/download PDF
12. Breakpointer: using local mapping artifacts to support sequence breakpoint discovery from single-end reads.
- Author
-
Sun R, Love MI, Zemojtel T, Emde AK, Chung HR, Vingron M, and Haas SA
- Subjects
- Artifacts, Humans, Algorithms, Computational Biology methods, Genomic Structural Variation, Sequence Analysis, DNA methods
- Abstract
Summary: We developed Breakpointer, a fast algorithm to locate breakpoints of structural variants (SVs) from single-end reads produced by next-generation sequencing. By taking advantage of local non-uniform read distribution and misalignments created by SVs, Breakpointer scans the alignment of single-end reads to identify regions containing potential breakpoints. The detection of such breakpoints can indicate insertions longer than the read length and SVs located in repetitve regions which might be missd by other methods. Thus, Breakpointer complements existing methods to locate SVs from single-end reads., Availability: https://github.com/ruping/Breakpointer, Contact: ruping@molgen.mpg.de, Supplementary Information: Supplementary material is available at Bioinformatics online.
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.