Back to Search
Start Over
Integrating Deep Supervised, Self-Supervised and Unsupervised Learning for Single-Cell RNA-seq Clustering and Annotation.
- Source :
-
Genes [Genes (Basel)] 2020 Jul 14; Vol. 11 (7). Date of Electronic Publication: 2020 Jul 14. - Publication Year :
- 2020
-
Abstract
- As single-cell RNA sequencing technologies mature, massive gene expression profiles can be obtained. Consequently, cell clustering and annotation become two crucial and fundamental procedures affecting other specific downstream analyses. Most existing single-cell RNA-seq (scRNA-seq) data clustering algorithms do not take into account the available cell annotation results on the same tissues or organisms from other laboratories. Nonetheless, such data could assist and guide the clustering process on the target dataset. Identifying marker genes through differential expression analysis to manually annotate large amounts of cells also costs labor and resources. Therefore, in this paper, we propose a novel end-to-end cell supervised clustering and annotation framework called scAnCluster, which fully utilizes the cell type labels available from reference data to facilitate the cell clustering and annotation on the unlabeled target data. Our algorithm integrates deep supervised learning, self-supervised learning and unsupervised learning techniques together, and it outperforms other customized scRNA-seq supervised clustering methods in both simulation and real data. It is particularly worth noting that our method performs well on the challenging task of discovering novel cell types that are absent in the reference data.
- Subjects :
- Cluster Analysis
Computer Simulation
Gene Expression Profiling
Genetic Markers genetics
RNA-Seq statistics & numerical data
Sequence Analysis, RNA methods
Sequence Analysis, RNA statistics & numerical data
Single-Cell Analysis statistics & numerical data
Unsupervised Machine Learning statistics & numerical data
Exome Sequencing methods
Exome Sequencing statistics & numerical data
Molecular Sequence Annotation
RNA-Seq methods
Single-Cell Analysis methods
Transcriptome genetics
Subjects
Details
- Language :
- English
- ISSN :
- 2073-4425
- Volume :
- 11
- Issue :
- 7
- Database :
- MEDLINE
- Journal :
- Genes
- Publication Type :
- Academic Journal
- Accession number :
- 32674393
- Full Text :
- https://doi.org/10.3390/genes11070792