Back to Search Start Over

VPAC: Variational projection for accurate clustering of single-cell transcriptomic data.

Authors :
Chen, Shengquan
Hua, Kui
Cui, Hongfei
Jiang, Rui
Source :
BMC Bioinformatics. 5/1/2019 Supplement 7, Vol. 20 Issue 7, p1-13. 13p. 1 Color Photograph, 2 Diagrams, 1 Chart, 5 Graphs.
Publication Year :
2019

Abstract

Background: Single-cell RNA-sequencing (scRNA-seq) technologies have advanced rapidly in recent years and enabled the quantitative characterization at a microscopic resolution. With the exponential growth of the number of cells profiled in individual scRNA-seq experiments, the demand for identifying putative cell types from the data has become a great challenge that appeals for novel computational methods. Although a variety of algorithms have recently been proposed for single-cell clustering, such limitations as low accuracy, inferior robustness, and inadequate stability greatly impede the scope of applications of these methods. Results: We propose a novel model-based algorithm, named VPAC, for accurate clustering of single-cell transcriptomic data through variational projection, which assumes that single-cell samples follow a Gaussian mixture distribution in a latent space. Through comprehensive validation experiments, we demonstrate that VPAC can not only be applied to datasets of discrete counts and normalized continuous data, but also scale up well to various data dimensionality, different dataset size and different data sparsity. We further illustrate the ability of VPAC to detect genes with strong unique signatures of a specific cell type, which may shed light on the studies in system biology. We have released a user-friendly python package of VPAC in Github (https://github.com/ShengquanChen/VPAC). Users can directly import our VPAC class and conduct clustering without tedious installation of dependency packages. Conclusions: VPAC enables highly accurate clustering of single-cell transcriptomic data via a statistical model. We expect to see wide applications of our method to not only transcriptome studies for fully understanding the cell identity and functionality, but also the clustering of more general data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14712105
Volume :
20
Issue :
7
Database :
Academic Search Index
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
136186689
Full Text :
https://doi.org/10.1186/s12859-019-2742-4