394 results on '"Single-Cell Analysis"'
Search Results
2. VI-VS: calibrated identification of feature dependencies in single-cell multiomics.
- Author
-
Boyeau, Pierre, Bates, Stephen, Ergen, Can, Jordan, Michael, and Yosef, Nir
- Subjects
Single-Cell Analysis ,Software ,Machine Learning ,Humans ,Genomics ,Multiomics - Abstract
Unveiling functional relationships between various molecular cell phenotypes from data using machine learning models is a key promise of multiomics. Existing methods either use flexible but hard-to-interpret models or simpler, misspecified models. VI-VS (Variational Inference for Variable Selection) balances flexibility and interpretability to identify relevant feature relationships in multiomic data. It uses deep generative models to identify conditionally dependent features, with false discovery rate control. VI-VS is available as an open-source Python package, providing a robust solution to identify features more likely representing genuine causal relationships.
- Published
- 2024
3. Visualizing scRNA-Seq data at population scale with GloScope.
- Author
-
Wang, Hao, Torous, William, Gong, Boying, and Purdom, Elizabeth
- Subjects
Batch effect detection and visualization ,Density estimation ,Single-cell sequencing data ,scRNA-Seq ,Single-Cell Analysis ,Software ,RNA-Seq ,Humans ,Computational Biology ,Animals ,Single-Cell Gene Expression Analysis - Abstract
Increasingly, scRNA-Seq studies explore cell populations across different samples and the effect of sample heterogeneity on organisms phenotype. However, relatively few bioinformatic methods have been developed which adequately address the variation between samples for such population-level analyses. We propose a framework for representing the entire single-cell profile of a sample, which we call a GloScope representation. We implement GloScope on scRNA-Seq datasets from study designs ranging from 12 to over 300 samples and demonstrate how GloScope allows researchers to perform essential bioinformatic tasks at the sample-level, in particular visualization and quality control assessment.
- Published
- 2024
4. In vivo perturb-seq of cancer and microenvironment cells dissects oncologic drivers and radiotherapy responses in glioblastoma.
- Author
-
Liu, S, Zou, Christopher, Pak, Joanna, Morse, Alexandra, Pang, Dillon, Casey-Clyde, Timothy, Borah, Ashir, Wu, David, Seo, Kyounghee, OLoughlin, Thomas, Lim, Daniel, Ozawa, Tomoko, Berger, Mitchel, Kamber, Roarke, Weiss, William, Raleigh, David, and Gilbert, Luke
- Subjects
CRISPR ,CRISPRi ,Cancer ,Functional genomics ,GBM ,Glioblastoma ,Microenvironment ,Perturb-seq ,Radiotherapy ,Glioblastoma ,Tumor Microenvironment ,Animals ,Mice ,Humans ,Brain Neoplasms ,Single-Cell Analysis ,Cell Line ,Tumor ,Gene Expression Regulation ,Neoplastic - Abstract
BACKGROUND: Genetic perturbation screens with single-cell readouts have enabled rich phenotyping of gene function and regulatory networks. These approaches have been challenging in vivo, especially in adult disease models such as cancer, which include mixtures of malignant and microenvironment cells. Glioblastoma (GBM) is a fatal cancer, and methods of systematically interrogating gene function and therapeutic targets in vivo, especially in combination with standard of care treatment such as radiotherapy, are lacking. RESULTS: Here, we iteratively develop a multiplex in vivo perturb-seq CRISPRi platform for single-cell genetic screens in cancer and tumor microenvironment cells that leverages intracranial convection enhanced delivery of sgRNA libraries into mouse models of GBM. Our platform enables potent silencing of drivers of in vivo growth and tumor maintenance as well as genes that sensitize GBM to radiotherapy. We find radiotherapy rewires transcriptional responses to genetic perturbations in an in vivo-dependent manner, revealing heterogenous patterns of treatment sensitization or resistance in GBM. Furthermore, we demonstrate targeting of genes that function in the tumor microenvironment, enabling alterations of ligand-receptor interactions between immune and stromal cells following in vivo CRISPRi perturbations that can affect tumor cell phagocytosis. CONCLUSION: In sum, we demonstrate the utility of multiplexed perturb-seq for in vivo single-cell dissection of adult cancer and normal tissue biology across multiple cell types in the context of therapeutic intervention, a platform with potential for broad application.
- Published
- 2024
5. Associating transcription factors to single-cell trajectories with DREAMIT.
- Author
-
Maulding, Nathan, Seninge, Lucas, and Stuart, Joshua
- Subjects
Single-Cell Analysis ,Transcription Factors ,Humans ,Gene Regulatory Networks ,Sequence Analysis ,RNA - Abstract
Inferring gene regulatory networks from single-cell RNA-sequencing trajectories has been an active area of research yet methods are still needed to identify regulators governing cell transitions. We developed DREAMIT (Dynamic Regulation of Expression Across Modules in Inferred Trajectories) to annotate transcription-factor activity along single-cell trajectory branches, using ensembles of relations to target genes. Using a benchmark representing several different tissues, as well as external validation with ATAC-Seq and Perturb-Seq data on hematopoietic cells, the method was found to have higher tissue-specific sensitivity and specificity over competing approaches.
- Published
- 2024
6. scCDC: a computational method for gene-specific contamination detection and correction in single-cell and single-nucleus RNA-seq data
- Author
-
Wang, Weijian, Cen, Yihui, Lu, Zezhen, Xu, Yueqing, Sun, Tianyi, Xiao, Ying, Liu, Wanlu, Li, Jingyi Jessica, and Wang, Chaochen
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Cell Nucleus ,Animals ,Humans ,Sequence Analysis ,RNA ,Computational Biology ,Software ,Single-Cell Analysis ,RNA-Seq ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
In droplet-based single-cell and single-nucleus RNA-seq assays, systematic contamination of ambient RNA molecules biases the quantification of gene expression levels. Existing methods correct the contamination for all genes globally. However, there lacks specific evaluation of correction efficacy for varying contamination levels. Here, we show that DecontX and CellBender under-correct highly contaminating genes, while SoupX and scAR over-correct lowly/non-contaminating genes. Here, we develop scCDC as the first method to detect the contamination-causing genes and only correct expression levels of these genes, some of which are cell-type markers. Compared with existing decontamination methods, scCDC excels in decontaminating highly contaminating genes while avoiding over-correction of other genes.
- Published
- 2024
7. Bento: a toolkit for subcellular analysis of spatial transcriptomics data
- Author
-
Mah, Clarence K, Ahmed, Noorsher, Lopez, Nicole A, Lam, Dylan C, Pong, Avery, Monell, Alexander, Kern, Colin, Han, Yuanyuan, Prasad, Gino, Cesnik, Anthony J, Lundberg, Emma, Zhu, Quan, Carter, Hannah, and Yeo, Gene W
- Subjects
Biochemistry and Cell Biology ,Biological Sciences ,Bioengineering ,Networking and Information Technology R&D (NITRD) ,Genetics ,Biotechnology ,1.1 Normal biological development and functioning ,Generic health relevance ,Ecosystem ,Gene Expression Profiling ,Cell Communication ,Propanolamines ,Single-Cell Analysis ,Transcriptome ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
The spatial organization of molecules in a cell is essential for their functions. While current methods focus on discerning tissue architecture, cell-cell interactions, and spatial expression patterns, they are limited to the multicellular scale. We present Bento, a Python toolkit that takes advantage of single-molecule information to enable spatial analysis at the subcellular scale. Bento ingests molecular coordinates and segmentation boundaries to perform three analyses: defining subcellular domains, annotating localization patterns, and quantifying gene-gene colocalization. We demonstrate MERFISH, seqFISH + , Molecular Cartography, and Xenium datasets. Bento is part of the open-source Scverse ecosystem, enabling integration with other single-cell analysis tools.
- Published
- 2024
8. deMULTIplex2: robust sample demultiplexing for scRNA-seq
- Author
-
Zhu, Qin, Conrad, Daniel N, and Gartner, Zev J
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Human Genome ,Genetics ,Single-Cell Gene Expression Analysis ,Single-Cell Analysis ,Algorithms ,Sequence Analysis ,RNA ,High-Throughput Nucleotide Sequencing ,scRNA-seq ,Sample multiplexing ,Demultiplex ,Generalized linear models ,Expectation-maximization ,Expectation–maximization ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Sample multiplexing enables pooled analysis during single-cell RNA sequencing workflows, thereby increasing throughput and reducing batch effects. A challenge for all multiplexing techniques is to link sample-specific barcodes with cell-specific barcodes, then demultiplex sample identity post-sequencing. However, existing demultiplexing tools fail under many real-world conditions where barcode cross-contamination is an issue. We therefore developed deMULTIplex2, an algorithm inspired by a mechanistic model of barcode cross-contamination. deMULTIplex2 employs generalized linear models and expectation-maximization to probabilistically determine the sample identity of each cell. Benchmarking reveals superior performance across various experimental conditions, particularly on large or noisy datasets with unbalanced sample compositions.
- Published
- 2024
9. Evaluation of deep learning-based feature selection for single-cell RNA sequencing data analysis.
- Author
-
Huang, Hao, Liu, Chunlei, Wagle, Manoj, and Yang, Pengyi
- Subjects
Gene Expression Profiling ,Reproducibility of Results ,Deep Learning ,Single-Cell Analysis ,Data Analysis ,Sequence Analysis ,RNA ,Cluster Analysis ,Algorithms - Abstract
BACKGROUND: Feature selection is an essential task in single-cell RNA-seq (scRNA-seq) data analysis and can be critical for gene dimension reduction and downstream analyses, such as gene marker identification and cell type classification. Most popular methods for feature selection from scRNA-seq data are based on the concept of differential distribution wherein a statistical model is used to detect changes in gene expression among cell types. Recent development of deep learning-based feature selection methods provides an alternative approach compared to traditional differential distribution-based methods in that the importance of a gene is determined by neural networks. RESULTS: In this work, we explore the utility of various deep learning-based feature selection methods for scRNA-seq data analysis. We sample from Tabula Muris and Tabula Sapiens atlases to create scRNA-seq datasets with a range of data properties and evaluate the performance of traditional and deep learning-based feature selection methods for cell type classification, feature selection reproducibility and diversity, and computational time. CONCLUSIONS: Our study provides a reference for future development and application of deep learning-based feature selection methods for single-cell omics data analyses.
- Published
- 2023
10. Single-cell isoform analysis in human immune cells
- Author
-
Volden, Roger and Vollmers, Christopher
- Subjects
Biochemistry and Cell Biology ,Bioinformatics and Computational Biology ,Biological Sciences ,Biomedical and Clinical Sciences ,Genetics ,Immunology ,Human Genome ,Generic health relevance ,Inflammatory and immune system ,Gene Expression Profiling ,High-Throughput Nucleotide Sequencing ,Humans ,Leukocytes ,Mononuclear ,Protein Isoforms ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
High-throughput single-cell analysis today is facilitated by protocols like the 10X Genomics platform or Drop-Seq which generate cDNA pools in which the origin of a transcript is encoded at its 5' or 3' end. Here, we used R2C2 to sequence and demultiplex 12 million full-length cDNA molecules generated by the 10X Genomics platform from ~3000 peripheral blood mononuclear cells. We use these reads, independent from Illumina data, to identify B cell, T cell, and monocyte clusters and generate isoform-level transcriptomes for cells and cell types. Finally, we extract paired adaptive immune receptor sequences unique to each T and B cell.
- Published
- 2022
11. ZetaSuite: computational analysis of two-dimensional high-throughput data from multi-target screens and single-cell transcriptomics
- Author
-
Hao, Yajing, Zhang, Shuyang, Shao, Changwei, Li, Junhui, Zhao, Guofeng, Zhang, Dong-Er, and Fu, Xiang-Dong
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Biotechnology ,Genetics ,Networking and Information Technology R&D (NITRD) ,Human Genome ,Cancer ,Bioengineering ,Generic health relevance ,Genomics ,High-Throughput Screening Assays ,RNA Interference ,Single-Cell Analysis ,Software ,Transcriptome ,Zeta statistics ,two-dimensional RNAi screening ,Single-cell RNA-seq ,Cancer dependency ,Cancer checkpoint ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Two-dimensional high-throughput data have become increasingly common in functional genomics studies, which raises new challenges in data analysis. Here, we introduce a new statistic called Zeta, initially developed to identify global splicing regulators from a two-dimensional RNAi screen, a high-throughput screen coupled with high-throughput functional readouts, and ZetaSuite, a software package to facilitate general application of the Zeta statistics. We compare our approach with existing methods using multiple benchmarked datasets and then demonstrate the broad utility of ZetaSuite in processing public data from large-scale cancer dependency screens and single-cell transcriptomics studies to elucidate novel biological insights.
- Published
- 2022
12. Demuxafy: improvement in droplet assignment by integrating multiple single-cell demultiplexing and doublet detection methods
- Author
-
Drew Neavin, Anne Senabouth, Himanshi Arora, Jimmy Tsz Hang Lee, Aida Ripoll-Cladellas, sc-eQTLGen Consortium, Lude Franke, Shyam Prabhakar, Chun Jimmie Ye, Davis J. McCarthy, Marta Melé, Martin Hemberg, and Joseph E. Powell
- Subjects
Single-cell analysis ,Genetic demultiplexing ,Doublet detecting ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract Recent innovations in single-cell RNA-sequencing (scRNA-seq) provide the technology to investigate biological questions at cellular resolution. Pooling cells from multiple individuals has become a common strategy, and droplets can subsequently be assigned to a specific individual by leveraging their inherent genetic differences. An implicit challenge with scRNA-seq is the occurrence of doublets—droplets containing two or more cells. We develop Demuxafy, a framework to enhance donor assignment and doublet removal through the consensus intersection of multiple demultiplexing and doublet detecting methods. Demuxafy significantly improves droplet assignment by separating singlets from doublets and classifying the correct individual.
- Published
- 2024
- Full Text
- View/download PDF
13. Statistics or biology: the zero-inflation controversy about scRNA-seq data
- Author
-
Jiang, Ruochen, Sun, Tianyi, Song, Dongyuan, and Li, Jingyi Jessica
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Benchmarking ,Biology ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Exome Sequencing ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Researchers view vast zeros in single-cell RNA-seq data differently: some regard zeros as biological signals representing no or low gene expression, while others regard zeros as missing data to be corrected. To help address the controversy, here we discuss the sources of biological and non-biological zeros; introduce five mechanisms of adding non-biological zeros in computational benchmarking; evaluate the impacts of non-biological zeros on data analysis; benchmark three input data types: observed counts, imputed counts, and binarized counts; discuss the open questions regarding non-biological zeros; and advocate the importance of transparent analysis.
- Published
- 2022
14. scINSIGHT for interpreting single-cell gene expression from biologically heterogeneous data
- Author
-
Qian, Kun, Fu, Shiwei, Li, Hongwei, and Li, Wei Vivian
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Generic health relevance ,Algorithms ,Gene Expression ,Gene Expression Profiling ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Exome Sequencing ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
The increasing number of scRNA-seq data emphasizes the need for integrative analysis to interpret similarities and differences between single-cell samples. Although different batch effect removal methods have been developed, none are suitable for heterogeneous single-cell samples coming from multiple biological conditions. We propose a method, scINSIGHT, to learn coordinated gene expression patterns that are common among, or specific to, different biological conditions, and identify cellular identities and processes across single-cell samples. We compare scINSIGHT with state-of-the-art methods using simulated and real data, which demonstrate its improved performance. Our results show the applicability of scINSIGHT in diverse biomedical and clinical problems.
- Published
- 2022
15. Feature selection revisited in the single-cell era.
- Author
-
Yang, Pengyi, Huang, Hao, and Liu, Chunlei
- Subjects
Computational Biology ,Epigenomics ,Gene Expression Profiling ,High-Throughput Nucleotide Sequencing ,Humans ,Single-Cell Analysis ,Transcriptome - Abstract
Recent advances in single-cell biotechnologies have resulted in high-dimensional datasets with increased complexity, making feature selection an essential technique for single-cell data analysis. Here, we revisit feature selection techniques and summarise recent developments. We review their application to a range of single-cell data types generated from traditional cytometry and imaging technologies and the latest array of single-cell omics technologies. We highlight some of the challenges and future directions and finally consider their scalability and make general recommendations on each type of feature selection method. We hope this review stimulates future research and application of feature selection in the single-cell era.
- Published
- 2021
16. Mapping and modeling the genomic basis of differential RNA isoform expression at single-cell resolution with LR-Split-seq
- Author
-
Rebboah, Elisabeth, Reese, Fairlie, Williams, Katherine, Balderrama-Gutierrez, Gabriela, McGill, Cassandra, Trout, Diane, Rodriguez, Isaryhia, Liang, Heidi, Wold, Barbara J, and Mortazavi, Ali
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,Biotechnology ,Generic health relevance ,Animals ,Cell Differentiation ,Cell Line ,Cell Nucleus ,Chromatin ,Genomics ,Mice ,Models ,Genetic ,Myogenin ,PAX7 Transcription Factor ,RNA Isoforms ,RNA-Seq ,Single-Cell Analysis ,Transcription Initiation Site ,Transcription ,Genetic ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
The rise in throughput and quality of long-read sequencing should allow unambiguous identification of full-length transcript isoforms. However, its application to single-cell RNA-seq has been limited by throughput and expense. Here we develop and characterize long-read Split-seq (LR-Split-seq), which uses combinatorial barcoding to sequence single cells with long reads. Applied to the C2C12 myogenic system, LR-split-seq associates isoforms to cell types with relative economy and design flexibility. We find widespread evidence of changing isoform expression during differentiation including alternative transcription start sites (TSS) and/or alternative internal exon usage. LR-Split-seq provides an affordable method for identifying cluster-specific isoforms in single cells.
- Published
- 2021
17. MAAPER: model-based analysis of alternative polyadenylation using 3′ end-linked reads
- Author
-
Li, Wei Vivian, Zheng, Dinghai, Wang, Ruijia, and Tian, Bin
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Generic health relevance ,3' Untranslated Regions ,Animals ,Computational Biology ,Gene Expression ,Humans ,Mice ,Models ,Theoretical ,NIH 3T3 Cells ,Polyadenylation ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Transcriptome ,Alternative polyadenylation ,RNA sequencing ,Bioinformatic tool ,' end reads ,Cellular stress ,Trophoblasts ,3′ end reads ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Most eukaryotic genes express alternative polyadenylation (APA) isoforms. A growing number of RNA sequencing methods, especially those used for single-cell transcriptome analysis, generate reads close to the polyadenylation site (PAS), termed nearSite reads, hence inherently containing information about APA isoform abundance. Here, we present a probabilistic model-based method named MAAPER to utilize nearSite reads for APA analysis. MAAPER predicts PASs with high accuracy and sensitivity and examines different types of APA events with robust statistics. We show MAAPER's performance with both bulk and single-cell data and its applicability in unpaired or paired experimental designs.
- Published
- 2021
18. scMC learns biological variation through the alignment of multiple single-cell genomics datasets
- Author
-
Zhang, Lihua and Nie, Qing
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Human Genome ,1.4 Methodologies and measurements ,Generic health relevance ,Algorithms ,Chromatin Immunoprecipitation Sequencing ,Databases ,Genetic ,Epigenomics ,Genomics ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Transcriptome ,Single-cell genomics data ,Data integration ,Biological variation ,Technical variation ,Batch effect removal ,Single-cell genomics data ,Data integration ,Biological variation ,Technical variation ,Batch effect removal ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Distinguishing biological from technical variation is crucial when integrating and comparing single-cell genomics datasets across different experiments. Existing methods lack the capability in explicitly distinguishing these two variations, often leading to the removal of both variations. Here, we present an integration method scMC to remove the technical variation while preserving the intrinsic biological variation. scMC learns biological variation via variance analysis to subtract technical variation inferred in an unsupervised manner. Application of scMC to both simulated and real datasets from single-cell RNA-seq and ATAC-seq experiments demonstrates its capability of detecting context-shared and context-specific biological signals via accurate alignment.
- Published
- 2021
19. CellWalker integrates single-cell and bulk data to resolve regulatory elements across cell types in complex tissues
- Author
-
Przytycki, Pawel F and Pollard, Katherine S
- Subjects
Biochemistry and Cell Biology ,Bioinformatics and Computational Biology ,Genetics ,Biological Sciences ,Neurosciences ,Brain Disorders ,Human Genome ,Underpinning research ,1.1 Normal biological development and functioning ,Chromatin Immunoprecipitation Sequencing ,Computational Biology ,Gene Expression Regulation ,Genetic Loci ,Molecular Sequence Annotation ,Organ Specificity ,Regulatory Sequences ,Nucleic Acid ,Reproducibility of Results ,Single-Cell Analysis ,Software ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Single-cell and bulk genomics assays have complementary strengths and weaknesses, and alone neither strategy can fully capture regulatory elements across the diversity of cells in complex tissues. We present CellWalker, a method that integrates single-cell open chromatin (scATAC-seq) data with gene expression (RNA-seq) and other data types using a network model that simultaneously improves cell labeling in noisy scATAC-seq and annotates cell type-specific regulatory elements in bulk data. We demonstrate CellWalker's robustness to sparse annotations and noise using simulations and combined RNA-seq and ATAC-seq in individual cells. We then apply CellWalker to the developing brain. We identify cells transitioning between transcriptional states, resolve regulatory elements to cell types, and observe that autism and other neurological traits can be mapped to specific cell types through their regulatory elements.
- Published
- 2021
20. scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured.
- Author
-
Sun, Tianyi, Song, Dongyuan, Li, Wei Vivian, and Li, Jingyi Jessica
- Subjects
Goblet Cells ,Animals ,Humans ,Mice ,Calibration ,Cell Count ,Cluster Analysis ,Genomics ,Gene Expression Regulation ,Computer Simulation ,Software ,Single-Cell Analysis ,RNA-Seq ,Mental Health ,Genetics ,Biotechnology ,Generic health relevance ,Environmental Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
A pressing challenge in single-cell transcriptomics is to benchmark experimental protocols and computational methods. A solution is to use computational simulators, but existing simulators cannot simultaneously achieve three goals: preserving genes, capturing gene correlations, and generating any number of cells with varying sequencing depths. To fill this gap, we propose scDesign2, a transparent simulator that achieves all three goals and generates high-fidelity synthetic data for multiple single-cell gene expression count-based technologies. In particular, scDesign2 is advantageous in its transparent use of probabilistic models and its ability to capture gene correlations via copulas.
- Published
- 2021
21. PseudotimeDE: inference of differential gene expression along cell pseudotime with well-calibrated p-values from single-cell RNA sequencing data
- Author
-
Song, Dongyuan and Li, Jingyi Jessica
- Subjects
Biotechnology ,Human Genome ,Genetics ,Generic health relevance ,Algorithms ,Cell Lineage ,Computational Biology ,Gene Expression Profiling ,Gene Expression Regulation ,Developmental ,Gene Ontology ,High-Throughput Nucleotide Sequencing ,Organ Specificity ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Transcriptome ,Environmental Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
To investigate molecular mechanisms underlying cell state changes, a crucial analysis is to identify differentially expressed (DE) genes along the pseudotime inferred from single-cell RNA-sequencing data. However, existing methods do not account for pseudotime inference uncertainty, and they have either ill-posed p-values or restrictive models. Here we propose PseudotimeDE, a DE gene identification method that adapts to various pseudotime inference methods, accounts for pseudotime inference uncertainty, and outputs well-calibrated p-values. Comprehensive simulations and real-data applications verify that PseudotimeDE outperforms existing methods in false discovery rate control and power.
- Published
- 2021
22. Clipper: p-value-free FDR control on high-throughput data from two conditions
- Author
-
Ge, Xinzhou, Chen, Yiling Elaine, Song, Dongyuan, McDermott, MeiLu, Woyshner, Kyla, Manousopoulou, Antigoni, Wang, Ning, Li, Wei, Wang, Leo D, and Li, Jingyi Jessica
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Chromatin Immunoprecipitation Sequencing ,Chromosomes ,Computer Simulation ,Data Interpretation ,Statistical ,High-Throughput Nucleotide Sequencing ,Humans ,Mass Spectrometry ,Peptides ,Proteomics ,RNA-Seq ,Single-Cell Analysis ,Software ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
High-throughput biological data analysis commonly involves identifying features such as genes, genomic regions, and proteins, whose values differ between two conditions, from numerous features measured simultaneously. The most widely used criterion to ensure the analysis reliability is the false discovery rate (FDR), which is primarily controlled based on p-values. However, obtaining valid p-values relies on either reasonable assumptions of data distribution or large numbers of replicates under both conditions. Clipper is a general statistical framework for FDR control without relying on p-values or specific data distributions. Clipper outperforms existing methods for a broad range of applications in high-throughput data analysis.
- Published
- 2021
23. scAI: an unsupervised approach for the integrative analysis of parallel single-cell transcriptomic and epigenomic profiles
- Author
-
Jin, Suoqin, Zhang, Lihua, and Nie, Qing
- Subjects
Biochemistry and Cell Biology ,Biological Sciences ,Human Genome ,Genetics ,2.1 Biological and endogenous factors ,1.1 Normal biological development and functioning ,A549 Cells ,Epigenome ,Gene Expression Profiling ,Genetic Heterogeneity ,Genomics ,Humans ,Single-Cell Analysis ,Transcriptome ,Unsupervised Machine Learning ,Integrative analysis ,Single-cell multiomics ,Simultaneous measurements ,Sparse epigenomic profile ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Simultaneous measurements of transcriptomic and epigenomic profiles in the same individual cells provide an unprecedented opportunity to understand cell fates. However, effective approaches for the integrative analysis of such data are lacking. Here, we present a single-cell aggregation and integration (scAI) method to deconvolute cellular heterogeneity from parallel transcriptomic and epigenomic profiles. Through iterative learning, scAI aggregates sparse epigenomic signals in similar cells learned in an unsupervised manner, allowing coherent fusion with transcriptomic measurements. Simulation studies and applications to three real datasets demonstrate its capability of dissecting cellular heterogeneity within both transcriptomic and epigenomic layers and understanding transcriptional regulatory mechanisms.
- Published
- 2020
24. Inference of single-cell phylogenies from lineage tracing data using Cassiopeia
- Author
-
Jones, Matthew G, Khodaverdian, Alex, Quinn, Jeffrey J, Chan, Michelle M, Hussmann, Jeffrey A, Wang, Robert, Xu, Chenling, Weissman, Jonathan S, and Yosef, Nir
- Subjects
Information and Computing Sciences ,Biological Sciences ,Genetics ,Bioengineering ,Generic health relevance ,Algorithms ,CRISPR-Cas Systems ,Cell Lineage ,Humans ,Mutation ,Phylogeny ,Single-Cell Analysis ,scRNA-seq ,Single cell ,Lineage tracing ,CRISPR ,Environmental Sciences ,Bioinformatics - Abstract
The pairing of CRISPR/Cas9-based gene editing with massively parallel single-cell readouts now enables large-scale lineage tracing. However, the rapid growth in complexity of data from these assays has outpaced our ability to accurately infer phylogenetic relationships. First, we introduce Cassiopeia-a suite of scalable maximum parsimony approaches for tree reconstruction. Second, we provide a simulation framework for evaluating algorithms and exploring lineage tracer design principles. Finally, we generate the most complex experimental lineage tracing dataset to date, 34,557 human cells continuously traced over 15 generations, and use it for benchmarking phylogenetic inference approaches. We show that Cassiopeia outperforms traditional methods by several metrics and under a wide variety of parameter regimes, and provide insight into the principles for the design of improved Cas9-enabled recorders. Together, these should broadly enable large-scale mammalian lineage tracing efforts. Cassiopeia and its benchmarking resources are publicly available at www.github.com/YosefLab/Cassiopeia.
- Published
- 2020
25. Linked optical and gene expression profiling of single cells at high-throughput
- Author
-
Zhang, Jesse Q, Siltanen, Christian A, Liu, Leqian, Chang, Kai-Chun, Gartner, Zev J, and Abate, Adam R
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Cancer Genomics ,Biotechnology ,Cancer ,Human Genome ,1.1 Normal biological development and functioning ,2.1 Biological and endogenous factors ,Generic health relevance ,3T3 Cells ,Animals ,Cells ,Cultured ,Flow Cytometry ,Gene Expression Profiling ,HEK293 Cells ,Humans ,Mice ,Microfluidics ,Nanopore Sequencing ,RNA-Seq ,Single-Cell Analysis ,Index sorting ,Single-cell RNA sequencing ,Flow cytometry ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Single-cell RNA sequencing has emerged as a powerful tool for characterizing cells, but not all phenotypes of interest can be observed through changes in gene expression. Linking sequencing with optical analysis has provided insight into the molecular basis of cellular function, but current approaches have limited throughput. Here, we present a high-throughput platform for linked optical and gene expression profiling of single cells. We demonstrate accurate fluorescence and gene expression measurements on thousands of cells in a single experiment. We use the platform to characterize DNA and RNA changes through the cell cycle and correlate antibody fluorescence with gene expression. The platform's ability to isolate rare cell subsets and perform multiple measurements, including fluorescence and sequencing-based analysis, holds potential for scalable multi-modal single-cell analysis.
- Published
- 2020
26. scAlign: a tool for alignment, integration, and rare cell identification from scRNA-seq data
- Author
-
Johansen, Nelson and Quon, Gerald
- Subjects
Biological Sciences ,Genetics ,Vector-Borne Diseases ,Rare Diseases ,Good Health and Well Being ,Animals ,Biomarkers ,Cluster Analysis ,Gene Expression Regulation ,Germ Cells ,Humans ,Islets of Langerhans ,Mice ,Inbred C57BL ,Plasmodium falciparum ,Principal Component Analysis ,Sequence Alignment ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Software ,scRNA-seq ,Data integration ,Data harmonization ,Alignment ,Deep learning ,Neural networks ,Response to stimulus ,Batch effects ,Domain adaptation ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
scRNA-seq dataset integration occurs in different contexts, such as the identification of cell type-specific differences in gene expression across conditions or species, or batch effect correction. We present scAlign, an unsupervised deep learning method for data integration that can incorporate partial, overlapping, or a complete set of cell labels, and estimate per-cell differences in gene expression across datasets. scAlign performance is state-of-the-art and robust to cross-dataset variation in cell type-specific expression and cell type composition. We demonstrate that scAlign reveals gene expression programs for rare populations of malaria parasites. Our framework is widely applicable to integration challenges in other domains.
- Published
- 2019
27. scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data
- Author
-
Li, Ruoxin and Quon, Gerald
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Biotechnology ,Prevention ,Human Genome ,Gene Expression Profiling ,Genomics ,Models ,Genetic ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Software ,scRNA-seq ,Dimensionality reduction ,scATAC-seq ,Technical noise ,Gene detection ,Gene quantification ,Cell type identification ,Trajectory inference ,Variable gene selection ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Technical variation in feature measurements, such as gene expression and locus accessibility, is a key challenge of large-scale single-cell genomic datasets. We show that this technical variation in both scRNA-seq and scATAC-seq datasets can be mitigated by analyzing feature detection patterns alone and ignoring feature quantification measurements. This result holds when datasets have low detection noise relative to quantification noise. We demonstrate state-of-the-art performance of detection pattern models using our new framework, scBFA, for both cell type identification and trajectory inference. Performance gains can also be realized in one line of R code in existing pipelines.
- Published
- 2019
28. Refining colorectal cancer classification and clinical stratification through a single-cell atlas
- Author
-
Ateeq M. Khaliq, Cihat Erdogan, Zeyneb Kurt, Sultan Sevgi Turgut, Miles W. Grunvald, Tim Rand, Sonal Khare, Jeffrey A. Borgia, Dana M. Hayden, Sam G. Pappas, Henry R. Govekar, Audrey E. Kam, Jochen Reiser, Kiran Turaga, Milan Radovich, Yong Zang, Yingjie Qiu, Yunlong Liu, Melissa L. Fishel, Anita Turk, Vineet Gupta, Ram Al-Sabti, Janakiraman Subramanian, Timothy M. Kuzel, Anguraj Sadanandam, Levi Waldron, Arif Hussain, Mohammad Saleem, Bassel El-Rayes, Ameen A. Salahudeen, and Ashiq Masood
- Subjects
Cancer-associated fibroblast ,CMS classification ,Colorectal cancer ,Single-cell analysis ,Immunotherapy ,Stromal signatures ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract Background Colorectal cancer (CRC) consensus molecular subtypes (CMS) have different immunological, stromal cell, and clinicopathological characteristics. Single-cell characterization of CMS subtype tumor microenvironments is required to elucidate mechanisms of tumor and stroma cell contributions to pathogenesis which may advance subtype-specific therapeutic development. We interrogate racially diverse human CRC samples and analyze multiple independent external cohorts for a total of 487,829 single cells enabling high-resolution depiction of the cellular diversity and heterogeneity within the tumor and microenvironmental cells. Results Tumor cells recapitulate individual CMS subgroups yet exhibit significant intratumoral CMS heterogeneity. Both CMS1 microsatellite instability (MSI-H) CRCs and microsatellite stable (MSS) CRC demonstrate similar pathway activations at the tumor epithelial level. However, CD8+ cytotoxic T cell phenotype infiltration in MSI-H CRCs may explain why these tumors respond to immune checkpoint inhibitors. Cellular transcriptomic profiles in CRC exist in a tumor immune stromal continuum in contrast to discrete subtypes proposed by studies utilizing bulk transcriptomics. We note a dichotomy in tumor microenvironments across CMS subgroups exists by which patients with high cancer-associated fibroblasts (CAFs) and C1Q+TAM content exhibit poor outcomes, providing a higher level of personalization and precision than would distinct subtypes. Additionally, we discover CAF subtypes known to be associated with immunotherapy resistance. Conclusions Distinct CAFs and C1Q+ TAMs are sufficient to explain CMS predictive ability and a simpler signature based on these cellular phenotypes could stratify CRC patient prognosis with greater precision. Therapeutically targeting specific CAF subtypes and C1Q + TAMs may promote immunotherapy responses in CRC patients.
- Published
- 2022
- Full Text
- View/download PDF
29. A Reproducibility-Based Computational Framework Identifies an Inducible, Enhanced Antiviral State in Dendritic Cells from HIV-1 Elite Controllers
- Author
-
Martin-Gayo, Enrique, Cole, Michael B, Kolb, Kellie E, Ouyang, Zhengyu, Cronin, Jacqueline, Kazer, Samuel W, Ordovas-Montanes, Jose, Lichterfeld, Mathias, Walker, Bruce D, Yosef, Nir, Shalek, Alex K, and Yu, Xu G
- Subjects
Medical Microbiology ,Biomedical and Clinical Sciences ,Immunology ,HIV/AIDS ,Infectious Diseases ,Clinical Research ,Vaccine Related ,Genetics ,Inflammatory and immune system ,Infection ,Biomarkers ,Dendritic Cells ,Gene Expression Profiling ,Genomics ,HIV Infections ,HIV-1 ,Humans ,Reproducibility of Results ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Dendritic cell ,Single-cell RNA-seq ,Single-cell genomics ,Elite controller ,Adjuvant ,Reproducibility ,Differential expression ,Environmental Sciences ,Biological Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
BackgroundHuman immunity relies on the coordinated responses of many cellular subsets and functional states. Inter-individual variations in cellular composition and communication could thus potentially alter host protection. Here, we explore this hypothesis by applying single-cell RNA-sequencing to examine viral responses among the dendritic cells (DCs) of three elite controllers (ECs) of HIV-1 infection.ResultsTo overcome the potentially confounding effects of donor-to-donor variability, we present a generally applicable computational framework for identifying reproducible patterns in gene expression across donors who share a unifying classification. Applying it, we discover a highly functional antiviral DC state in ECs whose fractional abundance after in vitro exposure to HIV-1 correlates with higher CD4+ T cell counts and lower HIV-1 viral loads, and that effectively primes polyfunctional T cell responses in vitro. By integrating information from existing genomic databases into our reproducibility-based analysis, we identify and validate select immunomodulators that increase the fractional abundance of this state in primary peripheral blood mononuclear cells from healthy individuals in vitro.ConclusionsOverall, our results demonstrate how single-cell approaches can reveal previously unappreciated, yet important, immune behaviors and empower rational frameworks for modulating systems-level immune responses that may prove therapeutically and prophylactically useful.
- Published
- 2018
30. Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications
- Author
-
Van den Berge, Koen, Perraudeau, Fanny, Soneson, Charlotte, Love, Michael I, Risso, Davide, Vert, Jean-Philippe, Robinson, Mark D, Dudoit, Sandrine, and Clement, Lieven
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Gene Expression Profiling ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Software ,Single-cell RNA sequencing ,Differential expression ,Zero-inflated negative binomial ,Weights ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
Dropout events in single-cell RNA sequencing (scRNA-seq) cause many transcripts to go undetected and induce an excess of zero read counts, leading to power issues in differential expression (DE) analysis. This has triggered the development of bespoke scRNA-seq DE methods to cope with zero inflation. Recent evaluations, however, have shown that dedicated scRNA-seq tools provide no advantage compared to traditional bulk RNA-seq tools. We introduce a weighting strategy, based on a zero-inflated negative binomial model, that identifies excess zero counts and generates gene- and cell-specific weights to unlock bulk RNA-seq DE pipelines for zero-inflated data, boosting performance for scRNA-seq.
- Published
- 2018
31. Enhancer regulatory networks globally connect non-coding breast cancer loci to cancer genes.
- Author
-
Wang Y, Armendariz DA, Wang L, Zhao H, Xie S, and Hon GC
- Subjects
- Humans, Female, Cell Line, Tumor, Single-Cell Analysis, Genes, Neoplasm, Breast Neoplasms genetics, Enhancer Elements, Genetic, Gene Regulatory Networks, Gene Expression Regulation, Neoplastic
- Abstract
Background: Genetic studies have associated thousands of enhancers with breast cancer (BC). However, the vast majority have not been functionally characterized. Thus, it remains unclear how BC-associated enhancers contribute to cancer., Results: Here, we perform single-cell CRISPRi screens of 3513 regulatory elements associated with breast cancer to measure the impact of these regions on transcriptional phenotypes. Analysis of > 500,000 single-cell transcriptomes in two breast cancer cell lines shows that perturbation of BC-associated enhancers disrupts breast cancer gene programs. We observe BC-associated enhancers that directly or indirectly regulate the expression of cancer genes. We also find one-to-multiple and multiple-to-one network motifs where enhancers indirectly regulate cancer genes. Notably, multiple BC-associated enhancers indirectly regulate TP53. Comparative studies illustrate subtype specific functions between enhancers in ER + and ER - cells. Finally, we develop the pySpade package to facilitate analysis of single-cell enhancer screens., Conclusions: Overall, we demonstrate that enhancers form regulatory networks that link cancer genes in the genome, providing a more comprehensive understanding of the contribution of enhancers to breast cancer development., Competing Interests: Declarations. Ethics approval and consent to participate: Not applicable. Competing interests: Since 20 April 2020, S.X. has been an employee of Genentech and has equity in Roche. The remaining authors declare no competing interests., (© 2025. The Author(s).)
- Published
- 2025
- Full Text
- View/download PDF
32. Spatiotemporal dynamics of early oogenesis in pigs.
- Author
-
Ge W, Niu YL, Li YK, Li L, Wang H, Li WW, Qiao T, Feng YN, Feng YQ, Liu J, Wang JJ, Sun XF, Cheng SF, Li L, and Shen W
- Subjects
- Animals, Female, Swine, Humans, Single-Cell Analysis, Transcriptome, Ovary metabolism, Ovary cytology, Cell Communication, Spatio-Temporal Analysis, Granulosa Cells metabolism, Granulosa Cells cytology, Oogenesis
- Abstract
Background: In humans and other mammals, the process of oogenesis initiates asynchronously in specific ovarian regions, leading to the localization of dormant and growing follicles in the cortex and medulla, respectively; however, the current understanding of this process remains insufficient., Results: Here, we integrate single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) to comprehend spatial-temporal gene expression profiles and explore the spatial organization of ovarian microenvironments during early oogenesis in pigs. Projection of the germ cell clusters at different stages of oogenesis into the spatial atlas unveils a "cortical to medullary (C-M)" distribution of germ cells in the developing porcine ovaries. Cross-species analysis between pigs and humans unveils a conserved C-M distribution pattern of germ cells during oogenesis, highlighting the utility of pigs as valuable models for studying human oogenesis in a spatial context. RNA velocity analysis with ST identifies the molecular characteristics and spatial dynamics of granulosa cell lineages originating from the cortical and medullary regions in pig ovaries. Spatial co-occurrence analysis and intercellular communication analysis unveils a distinct cell-cell communication pattern between germ cells and somatic cells in the cortex and medulla regions. Notably, in vitro culture of ovarian tissues verifies that intercellular NOTCH signaling and extracellular matrix (ECM) proteins played crucial roles in initiating meiotic and oogenic programs, highlighting an underappreciated role of ovarian microenvironments in orchestrating germ cell fates., Conclusions: Overall, our work provides insight into the spatial characteristics of early oogenesis and the regulatory role of ovarian microenvironments in germ cell fate within a spatial context., Competing Interests: Declarations. Ethics approval and consent to participate: All experimental procedures involving animal experiments were approved by the Ethics Committee of Qingdao Agricultural University (No. SYXK-20220–021). Consent for publication: Not applicable. Competing interests: The authors declare that they have no competing interests., (© 2024. The Author(s).)
- Published
- 2025
- Full Text
- View/download PDF
33. scStateDynamics: deciphering the drug-responsive tumor cell state dynamics by modeling single-cell level expression changes.
- Author
-
Guo W, Li X, Wang D, Yan N, Hu Q, Yang F, Zhang X, Yao J, and Gu J
- Subjects
- Humans, Antineoplastic Agents pharmacology, Antineoplastic Agents therapeutic use, Transcriptome, Gene Expression Regulation, Neoplastic, Drug Resistance, Neoplasm genetics, Gene Expression Profiling, Single-Cell Analysis, Algorithms, Neoplasms genetics, Neoplasms drug therapy, Neoplasms metabolism
- Abstract
Understanding tumor cell heterogeneity and plasticity is crucial for overcoming drug resistance. Single-cell technologies enable analyzing cell states at a given condition, but catenating static cell snapshots to characterize dynamic drug responses remains challenging. Here, we propose scStateDynamics, an algorithm to infer tumor cell state dynamics and identify common drug effects by modeling single-cell level gene expression changes. Its reliability is validated on both simulated and lineage tracing data. Application to real tumor drug treatment datasets identifies more subtle cell subclusters with different drug responses beyond static transcriptome similarity and disentangles drug action mechanisms from the cell-level expression changes., Competing Interests: Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors Fan Yang and Jianhua Yao are from a commercial company (AI Lab, Tencent, Shenzhen, China). All the other authors declare no competing interests., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
34. CaClust: linking genotype to transcriptional heterogeneity of follicular lymphoma using BCR and exomic variants.
- Author
-
Oksza-Orzechowski K, Quinten E, Shafighi S, Kiełbasa SM, van Kessel HW, de Groen RAL, Vermaat JSP, Sepúlveda Yáñez JH, Navarrete MA, Veelken H, van Bergen CAM, and Szczurek E
- Subjects
- Humans, Single-Cell Analysis, Mutation, Lymphoma, Follicular genetics, Genotype, Receptors, Antigen, B-Cell genetics, Genetic Heterogeneity
- Abstract
Tumours exhibit high genotypic and transcriptional heterogeneity. Both affect cancer progression and treatment, but have been predominantly studied separately in follicular lymphoma. To comprehensively investigate the evolution and genotype-to-phenotype maps in follicular lymphoma, we introduce CaClust, a probabilistic graphical model integrating deep whole exome, single-cell RNA and B-cell receptor sequencing data to infer clone genotypes, cell-to-clone mapping, and single-cell genotyping. CaClust outperforms a state-of-the-art model on simulated and patient data. In-depth analyses of single cells from four samples showcase effects of driver mutations, follicular lymphoma evolution, possible therapeutic targets, and single-cell genotyping that agrees with an independent targeted resequencing experiment., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
35. Single-cell profiling of human gliomas reveals macrophage ontogeny as a basis for regional differences in macrophage activation in the tumor microenvironment.
- Author
-
Müller, Sören, Kohanbash, Gary, Liu, S John, Alvarado, Beatriz, Carrera, Diego, Bhaduri, Aparna, Watchmaker, Payal B, Yagnik, Garima, Di Lullo, Elizabeth, Malatesta, Martina, Amankulor, Nduka M, Kriegstein, Arnold R, Lim, Daniel A, Aghi, Manish, Okada, Hideho, and Diaz, Aaron
- Subjects
Macrophages ,Animals ,Humans ,Mice ,Glioma ,Prognosis ,Immunotherapy ,Survival Analysis ,Gene Expression Profiling ,Computational Biology ,Macrophage Activation ,Gene Expression Regulation ,Neoplastic ,Single-Cell Analysis ,High-Throughput Nucleotide Sequencing ,Tumor Microenvironment ,Transcriptome ,Gene Ontology ,Macrophage ,Single-cell sequencing ,Gene Expression Regulation ,Neoplastic ,Bioinformatics ,Environmental Sciences ,Biological Sciences ,Information and Computing Sciences - Abstract
BACKGROUND:Tumor-associated macrophages (TAMs) are abundant in gliomas and immunosuppressive TAMs are a barrier to emerging immunotherapies. It is unknown to what extent macrophages derived from peripheral blood adopt the phenotype of brain-resident microglia in pre-treatment gliomas. The relative proportions of blood-derived macrophages and microglia have been poorly quantified in clinical samples due to a paucity of markers that distinguish these cell types in malignant tissue. RESULTS:We perform single-cell RNA-sequencing of human gliomas and identify phenotypic differences in TAMs of distinct lineages. We isolate TAMs from patient biopsies and compare them with macrophages from non-malignant human tissue, glioma atlases, and murine glioma models. We present a novel signature that distinguishes TAMs by ontogeny in human gliomas. Blood-derived TAMs upregulate immunosuppressive cytokines and show an altered metabolism compared to microglial TAMs. They are also enriched in perivascular and necrotic regions. The gene signature of blood-derived TAMs, but not microglial TAMs, correlates with significantly inferior survival in low-grade glioma. Surprisingly, TAMs frequently co-express canonical pro-inflammatory (M1) and alternatively activated (M2) genes in individual cells. CONCLUSIONS:We conclude that blood-derived TAMs significantly infiltrate pre-treatment gliomas, to a degree that varies by glioma subtype and tumor compartment. Blood-derived TAMs do not universally conform to the phenotype of microglia, but preferentially express immunosuppressive cytokines and show an altered metabolism. Our results argue against status quo therapeutic strategies that target TAMs indiscriminately and in favor of strategies that specifically target immunosuppressive blood-derived TAMs.
- Published
- 2017
36. Single-cell epigenomic variability reveals functional cancer heterogeneity
- Author
-
Litzenburger, Ulrike M, Buenrostro, Jason D, Wu, Beijing, Shen, Ying, Sheffield, Nathan C, Kathiria, Arwa, Greenleaf, William J, and Chang, Howard Y
- Subjects
Biochemistry and Cell Biology ,Biomedical and Clinical Sciences ,Biological Sciences ,Cancer Genomics ,Human Genome ,Cancer ,Genetics ,2.1 Biological and endogenous factors ,Antigens ,Surface ,Biomarkers ,Cell Line ,Tumor ,Epigenesis ,Genetic ,Epigenomics ,Genetic Heterogeneity ,Genetic Variation ,High-Throughput Nucleotide Sequencing ,Humans ,Immunophenotyping ,K562 Cells ,Neoplasms ,Nucleotide Motifs ,Reproducibility of Results ,Single-Cell Analysis ,Cancer stem cells ,Gene expression noise ,Open chromatin ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
BackgroundCell-to-cell heterogeneity is a major driver of cancer evolution, progression, and emergence of drug resistance. Epigenomic variation at the single-cell level can rapidly create cancer heterogeneity but is difficult to detect and assess functionally.ResultsWe develop a strategy to bridge the gap between measurement and function in single-cell epigenomics. Using single-cell chromatin accessibility and RNA-seq data in K562 leukemic cells, we identify the cell surface marker CD24 as co-varying with chromatin accessibility changes linked to GATA transcription factors in single cells. Fluorescence-activated cell sorting of CD24 high versus low cells prospectively isolated GATA1 and GATA2 high versus low cells. GATA high versus low cells express differential gene regulatory networks, differential sensitivity to the drug imatinib mesylate, and differential self-renewal capacity. Lineage tracing experiments show that GATA/CD24hi cells have the capability to rapidly reconstitute the heterogeneity within the entire starting population, suggesting that GATA expression levels drive a phenotypically relevant source of epigenomic plasticity.ConclusionSingle-cell chromatin accessibility can guide prospective characterization of cancer heterogeneity. Epigenomic subpopulations in cancer impact drug sensitivity and the clonal dynamics of cancer evolution.
- Published
- 2017
37. Simultaneous profiling of transcriptome and DNA methylome from a single cell
- Author
-
Hu, Youjin, Huang, Kevin, An, Qin, Du, Guizhen, Hu, Ganlu, Xue, Jinfeng, Zhu, Xianmin, Wang, Cun-Yu, Xue, Zhigang, and Fan, Guoping
- Subjects
Biological Sciences ,Bioinformatics and Computational Biology ,Genetics ,Cancer ,Cancer Genomics ,Biotechnology ,Human Genome ,2.1 Biological and endogenous factors ,1.1 Normal biological development and functioning ,Generic health relevance ,Animals ,Cells ,Cultured ,CpG Islands ,DNA Methylation ,Gene Expression Profiling ,Mice ,Mice ,Inbred C57BL ,Neurons ,Polymorphism ,Single Nucleotide ,Promoter Regions ,Genetic ,Sequence Analysis ,DNA ,Sequence Analysis ,RNA ,Single-Cell Analysis ,Transcriptome ,Single-cell methylome ,Single-cell transcriptome ,Sensory neurons ,Dorsal root ganglion ,Gene regulation ,Environmental Sciences ,Information and Computing Sciences ,Bioinformatics - Abstract
BackgroundSingle-cell transcriptome and single-cell methylome technologies have become powerful tools to study RNA and DNA methylation profiles of single cells at a genome-wide scale. A major challenge has been to understand the direct correlation of DNA methylation and gene expression within single-cells. Due to large cell-to-cell variability and the lack of direct measurements of transcriptome and methylome of the same cell, the association is still unclear.ResultsHere, we describe a novel method (scMT-seq) that simultaneously profiles both DNA methylome and transcriptome from the same cell. In sensory neurons, we consistently identify transcriptome and methylome heterogeneity among single cells but the majority of the expression variance is not explained by proximal promoter methylation, with the exception of genes that do not contain CpG islands. By contrast, gene body methylation is positively associated with gene expression for only those genes that contain a CpG island promoter. Furthermore, using single nucleotide polymorphism patterns from our hybrid mouse model, we also find positive correlation of allelic gene body methylation with allelic expression.ConclusionsOur method can be used to detect transcriptome, methylome, and single nucleotide polymorphism information within single cells to dissect the mechanisms of epigenetic gene regulation.
- Published
- 2016
38. Mapping lineage-traced cells across time points with moslin.
- Author
-
Lange M, Piran Z, Klein M, Spanjaard B, Klein D, Junker JP, Theis FJ, and Nitzan M
- Subjects
- Animals, Embryonic Development, Heart physiology, Regeneration, Single-Cell Analysis, Embryo, Nonmammalian, Caenorhabditis elegans genetics, Caenorhabditis elegans growth & development, Cell Lineage, Zebrafish genetics, Zebrafish physiology
- Abstract
Simultaneous profiling of single-cell gene expression and lineage history holds enormous potential for studying cellular decision-making. Recent computational approaches combine both modalities into cellular trajectories; however, they cannot make use of all available lineage information in destructive time-series experiments. Here, we present moslin, a Gromov-Wasserstein-based model to couple cellular profiles across time points based on lineage and gene expression information. We validate our approach in simulations and demonstrate on Caenorhabditis elegans embryonic development how moslin predicts fate probabilities and putative decision driver genes. Finally, we use moslin to delineate lineage relationships among transiently activated fibroblast states during zebrafish heart regeneration., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
39. Scalable identification of lineage-specific gene regulatory networks from metacells with NetID.
- Author
-
Wang W, Wang Y, Lyu R, and Grün D
- Subjects
- Humans, Hematopoiesis genetics, Cell Differentiation genetics, Animals, Transcriptome, Gene Regulatory Networks, Single-Cell Analysis, Cell Lineage genetics
- Abstract
The identification of gene regulatory networks (GRNs) is crucial for understanding cellular differentiation. Single-cell RNA sequencing data encode gene-level covariations at high resolution, yet data sparsity and high dimensionality hamper accurate and scalable GRN reconstruction. To overcome these challenges, we introduce NetID leveraging homogenous metacells while avoiding spurious gene-gene correlations. Benchmarking demonstrates superior performance of NetID compared to imputation-based methods. By incorporating cell fate probability information, NetID facilitates the prediction of lineage-specific GRNs and recovers known network motifs governing bone marrow hematopoiesis, making it a powerful toolkit for deciphering gene regulatory control of cellular differentiation from large-scale single-cell transcriptome data., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
40. Characterization of regeneration initiating cells during Xenopus laevis tail regeneration.
- Author
-
Sindelka R, Naraine R, Abaffy P, Zucha D, Kraus D, Netusil J, Smetana K Jr, Lacina L, Endaya BB, Neuzil J, Psenicka M, and Kubista M
- Subjects
- Animals, Transcriptome, Single-Cell Analysis, Extracellular Matrix metabolism, Wound Healing, Xenopus laevis, Regeneration, Tail
- Abstract
Background: Embryos are regeneration and wound healing masters. They rapidly close wounds and scarlessly remodel and regenerate injured tissue. Regeneration has been extensively studied in many animal models using new tools such as single-cell analysis. However, until now, they have been based primarily on experiments assessing from 1 day post injury., Results: In this paper, we reveal that critical steps initiating regeneration occur within hours after injury. We discovered the regeneration initiating cells (RICs) using single-cell and spatial transcriptomics of the regenerating Xenopus laevis tail. RICs are formed transiently from the basal epidermal cells, and their expression signature suggests they are important for modifying the surrounding extracellular matrix thus regulating development. The absence or deregulation of RICs leads to excessive extracellular matrix deposition and defective regeneration., Conclusion: RICs represent a newly discovered transient cell state involved in the initiation of the regeneration process., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
41. DeepKINET: a deep generative model for estimating single-cell RNA splicing and degradation rates.
- Author
-
Mizukoshi C, Kojima Y, Nomura S, Hayashi S, Abe K, and Shimamura T
- Subjects
- Humans, Breast Neoplasms genetics, Breast Neoplasms metabolism, RNA Stability, Prosencephalon metabolism, RNA-Binding Proteins metabolism, RNA-Binding Proteins genetics, Animals, Female, Single-Cell Analysis, RNA Splicing
- Abstract
Messenger RNA splicing and degradation are critical for gene expression regulation, the abnormality of which leads to diseases. Previous methods for estimating kinetic rates have limitations, assuming uniform rates across cells. DeepKINET is a deep generative model that estimates splicing and degradation rates at single-cell resolution from scRNA-seq data. DeepKINET outperforms existing methods on simulated and metabolic labeling datasets. Applied to forebrain and breast cancer data, it identifies RNA-binding proteins responsible for kinetic rate diversity. DeepKINET also analyzes the effects of splicing factor mutations on target genes in erythroid lineage cells. DeepKINET effectively reveals cellular heterogeneity in post-transcriptional regulation., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
42. Enhlink infers distal and context-specific enhancer-promoter linkages.
- Author
-
Poirion OB, Zuo W, Spruce C, Baker CN, Daigle SL, Olson A, Skelly DA, Chesler EJ, Baker CL, and White BS
- Subjects
- Animals, Mice, Software, Quantitative Trait Loci, Corpus Striatum metabolism, Single-Cell Analysis, Enhancer Elements, Genetic, Promoter Regions, Genetic
- Abstract
Enhlink is a computational tool for scATAC-seq data analysis, facilitating precise interrogation of enhancer function at the single-cell level. It employs an ensemble approach incorporating technical and biological covariates to infer condition-specific regulatory DNA linkages. Enhlink can integrate multi-omic data for enhanced specificity, when available. Evaluation with simulated and real data, including multi-omic datasets from the mouse striatum and novel promoter capture Hi-C data, demonstrate that Enhlink outperfoms alternative methods. Coupled with eQTL analysis, it identified a putative super-enhancer in striatal neurons. Overall, Enhlink offers accuracy, power, and potential for revealing novel biological insights in gene regulation., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
43. StaVia: spatially and temporally aware cartography with higher-order random walks for cell atlases.
- Author
-
Stassen SV, Kobashi M, Lam EY, Huang Y, Ho JWK, and Tsia KK
- Subjects
- Animals, Gastrulation, Computational Biology methods, Zebrafish, Single-Cell Analysis
- Abstract
Single-cell atlases pose daunting computational challenges pertaining to the integration of spatial and temporal information and the visualization of trajectories across large atlases. We introduce StaVia, a computational framework that synergizes multi-faceted single-cell data with higher-order random walks that leverage the memory of cells' past states, fused with a cartographic Atlas View that offers intuitive graph visualization. This spatially aware cartography captures relationships between cell populations based on their spatial location as well as their gene expression and developmental stage. We demonstrate this using zebrafish gastrulation data, underscoring its potential to dissect complex biological landscapes in both spatial and temporal contexts., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
44. Prevalence of and gene regulatory constraints on transcriptional adaptation in single cells.
- Author
-
Mellis IA, Melzer ME, Bodkin N, and Goyal Y
- Subjects
- Regulon, Humans, Animals, Transcription, Genetic, Adaptation, Physiological genetics, Gene Expression Regulation, Transcriptome, Single-Cell Analysis, Gene Regulatory Networks, Transcription Factors metabolism, Transcription Factors genetics
- Abstract
Background: Cells and tissues have a remarkable ability to adapt to genetic perturbations via a variety of molecular mechanisms. Nonsense-induced transcriptional compensation, a form of transcriptional adaptation, has recently emerged as one such mechanism, in which nonsense mutations in a gene trigger upregulation of related genes, possibly conferring robustness at cellular and organismal levels. However, beyond a handful of developmental contexts and curated sets of genes, no comprehensive genome-wide investigation of this behavior has been undertaken for mammalian cell types and conditions. How the regulatory-level effects of inherently stochastic compensatory gene networks contribute to phenotypic penetrance in single cells remains unclear., Results: We analyze existing bulk and single-cell transcriptomic datasets to uncover the prevalence of transcriptional adaptation in mammalian systems across diverse contexts and cell types. We perform regulon gene expression analyses of transcription factor target sets in both bulk and pooled single-cell genetic perturbation datasets. Our results reveal greater robustness in expression of regulons of transcription factors exhibiting transcriptional adaptation compared to those of transcription factors that do not. Stochastic mathematical modeling of minimal compensatory gene networks qualitatively recapitulates several aspects of transcriptional adaptation, including paralog upregulation and robustness to mutation. Combined with machine learning analysis of network features of interest, our framework offers potential explanations for which regulatory steps are most important for transcriptional adaptation., Conclusions: Our integrative approach identifies several putative hits-genes demonstrating possible transcriptional adaptation-to follow-up on experimentally and provides a formal quantitative framework to test and refine models of transcriptional adaptation., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
45. Inferring clonal somatic mutations directed by X chromosome inactivation status in single cells.
- Author
-
Demirci I, Larsson AJM, Chen X, Hartman J, Sandberg R, and Frisén J
- Subjects
- Humans, Female, Leukocytes, Mononuclear metabolism, Chromosomes, Human, X genetics, Clone Cells, T-Lymphocytes metabolism, Male, DNA, Mitochondrial genetics, X Chromosome Inactivation, Mutation, Single-Cell Analysis
- Abstract
Analysis of clonal dynamics in human tissues is enabled by somatic genetic variation. Here, we show that analysis of mitochondrial mutations in single cells is dramatically improved in females when using X chromosome inactivation to select informative clonal mutations. Applying this strategy to human peripheral mononuclear blood cells reveals clonal structures within T cells that otherwise are blurred by non-informative mutations, including the separation of gamma-delta T cells, suggesting this approach can be used to decipher clonal dynamics of cells in human tissues., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
46. Combined single-cell profiling of expression and DNA methylation reveals splicing regulation and heterogeneity
- Author
-
Stephanie M. Linker, Lara Urban, Stephen J. Clark, Mariya Chhatriwala, Shradha Amatya, Davis J. McCarthy, Ingo Ebersberger, Ludovic Vallier, Wolf Reik, Oliver Stegle, and Marc Jan Bonder
- Subjects
Single-cell analysis ,Alternative splicing ,DNA methylation ,Splicing prediction ,Cell differentiation ,Multi-omics ,Biology (General) ,QH301-705.5 ,Genetics ,QH426-470 - Abstract
Abstract Background Alternative splicing is a key regulatory mechanism in eukaryotic cells and increases the effective number of functionally distinct gene products. Using bulk RNA sequencing, splicing variation has been studied across human tissues and in genetically diverse populations. This has identified disease-relevant splicing events, as well as associations between splicing and genomic features, including sequence composition and conservation. However, variability in splicing between single cells from the same tissue or cell type and its determinants remains poorly understood. Results We applied parallel DNA methylation and transcriptome sequencing to differentiating human induced pluripotent stem cells to characterize splicing variation (exon skipping) and its determinants. Our results show that variation in single-cell splicing can be accurately predicted based on local sequence composition and genomic features. We observe moderate but consistent contributions from local DNA methylation profiles to splicing variation across cells. A combined model that is built based on genomic features as well as DNA methylation information accurately predicts different splicing modes of individual cassette exons. These categories include the conventional inclusion and exclusion patterns, but also more subtle modes of cell-to-cell variation in splicing. Finally, we identified and characterized associations between DNA methylation and splicing changes during cell differentiation. Conclusions Our study yields new insights into alternative splicing at the single-cell level and reveals a previously underappreciated link between DNA methylation variation and splicing.
- Published
- 2019
- Full Text
- View/download PDF
47. Single-cell decoding of drug induced transcriptomic reprogramming in triple negative breast cancers.
- Author
-
Kabeer F, Tran H, Andronescu M, Singh G, Lee H, Salehi S, Wang B, Biele J, Brimhall J, Gee D, Cerda V, O'Flanagan C, Algara T, Kono T, Beatty S, Zaikova E, Lai D, Lee E, Moore R, Mungall AJ, Williams MJ, Roth A, Campbell KR, Shah SP, and Aparicio S
- Subjects
- Humans, Animals, Female, Mice, DNA Copy Number Variations, Antineoplastic Agents pharmacology, Antineoplastic Agents therapeutic use, Gene Expression Regulation, Neoplastic drug effects, Epithelial-Mesenchymal Transition genetics, Triple Negative Breast Neoplasms genetics, Triple Negative Breast Neoplasms drug therapy, Single-Cell Analysis, Transcriptome, Drug Resistance, Neoplasm genetics
- Abstract
Background: The encoding of cell intrinsic drug resistance states in breast cancer reflects the contributions of genomic and non-genomic variations and requires accurate estimation of clonal fitness from co-measurement of transcriptomic and genomic data. Somatic copy number (CN) variation is the dominant mutational mechanism leading to transcriptional variation and notably contributes to platinum chemotherapy resistance cell states. Here, we deploy time series measurements of triple negative breast cancer (TNBC) single-cell transcriptomes, along with co-measured single-cell CN fitness, identifying genomic and transcriptomic mechanisms in drug-associated transcriptional cell states., Results: We present scRNA-seq data (53,641 filtered cells) from serial passaging TNBC patient-derived xenograft (PDX) experiments spanning 2.5 years, matched with genomic single-cell CN data from the same samples. Our findings reveal distinct clonal responses within TNBC tumors exposed to platinum. Clones with high drug fitness undergo clonal sweeps and show subtle transcriptional reversion, while those with weak fitness exhibit dynamic transcription upon drug withdrawal. Pathway analysis highlights convergence on epithelial-mesenchymal transition and cytokine signaling, associated with resistance. Furthermore, pseudotime analysis demonstrates hysteresis in transcriptional reversion, indicating generation of new intermediate transcriptional states upon platinum exposure., Conclusions: Within a polyclonal tumor, clones with strong genotype-associated fitness under platinum remained fixed, minimizing transcriptional reversion upon drug withdrawal. Conversely, clones with weaker fitness display non-genomic transcriptional plasticity. This suggests CN-associated and CN-independent transcriptional states could both contribute to platinum resistance. The dominance of genomic or non-genomic mechanisms within polyclonal tumors has implications for drug sensitivity, restoration, and re-treatment strategies., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
48. Detection of allele-specific expression in spatial transcriptomics with spASE.
- Author
-
Zou LS, Cable DM, Barrera-Lopez IA, Zhao T, Murray E, Aryee MJ, Chen F, and Irizarry RA
- Subjects
- Animals, Mice, Hippocampus metabolism, Gene Expression Profiling, Single-Cell Analysis, Alleles, Transcriptome, Cerebellum metabolism
- Abstract
Spatial transcriptomics technologies permit the study of the spatial distribution of RNA at near-single-cell resolution genome-wide. However, the feasibility of studying spatial allele-specific expression (ASE) from these data remains uncharacterized. Here, we introduce spASE, a computational framework for detecting and estimating spatial ASE. To tackle the challenges presented by cell type mixtures and a low signal to noise ratio, we implement a hierarchical model involving additive mixtures of spatial smoothing splines. We apply our method to allele-resolved Visium and Slide-seq from the mouse cerebellum and hippocampus and report new insight into the landscape of spatial and cell type-specific ASE therein., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
49. Effect of genomic and cellular environments on gene expression noise.
- Author
-
Hong CKY, Ramu A, Zhao S, and Cohen BA
- Subjects
- Humans, Genes, Reporter, Transcriptome, Genomics methods, Single-Cell Analysis
- Abstract
Background: Individual cells from isogenic populations often display large cell-to-cell differences in gene expression. This "noise" in expression derives from several sources, including the genomic and cellular environment in which a gene resides. Large-scale maps of genomic environments have revealed the effects of epigenetic modifications and transcription factor occupancy on mean expression levels, but leveraging such maps to explain expression noise will require new methods to assay how expression noise changes at locations across the genome., Results: To address this gap, we present Single-cell Analysis of Reporter Gene Expression Noise and Transcriptome (SARGENT), a method that simultaneously measures the noisiness of reporter genes integrated throughout the genome and the global mRNA profiles of individual reporter-gene-containing cells. Using SARGENT, we perform the first comprehensive genome-wide survey of how genomic locations impact gene expression noise. We find that the mean and noise of expression correlate with different histone modifications. We quantify the intrinsic and extrinsic components of reporter gene noise and, using the associated mRNA profiles, assign the extrinsic component to differences between the CD24+ "stem-like" substate and the more "differentiated" substate. SARGENT also reveals the effects of transgene integrations on endogenous gene expression, which will help guide the search for "safe-harbor" loci., Conclusions: Taken together, we show that SARGENT is a powerful tool to measure both the mean and noise of gene expression at locations across the genome and that the data generatd by SARGENT reveals important insights into the regulation of gene expression noise genome-wide., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
50. HATCHet2: clone- and haplotype-specific copy number inference from bulk tumor sequencing data.
- Author
-
Myers MA, Arnold BJ, Bansal V, Balaban M, Mullen KM, Zaccaria S, and Raphael BJ
- Subjects
- Humans, Male, Sequence Analysis, DNA methods, Neoplasms genetics, Gene Frequency, Single-Cell Analysis, Haplotypes, DNA Copy Number Variations, Algorithms, Prostatic Neoplasms genetics
- Abstract
Bulk DNA sequencing of multiple samples from the same tumor is becoming common, yet most methods to infer copy-number aberrations (CNAs) from this data analyze individual samples independently. We introduce HATCHet2, an algorithm to identify haplotype- and clone-specific CNAs simultaneously from multiple bulk samples. HATCHet2 extends the earlier HATCHet method by improving identification of focal CNAs and introducing a novel statistic, the minor haplotype B-allele frequency (mhBAF), that enables identification of mirrored-subclonal CNAs. We demonstrate HATCHet2's improved accuracy using simulations and a single-cell sequencing dataset. HATCHet2 analysis of 10 prostate cancer patients reveals previously unreported mirrored-subclonal CNAs affecting cancer genes., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.