Back to Search Start Over

Discrete latent embedding of single-cell chromatin accessibility sequencing data for uncovering cell heterogeneity.

Authors :
Cui X
Chen X
Li Z
Gao Z
Chen S
Jiang R
Source :
Nature computational science [Nat Comput Sci] 2024 May; Vol. 4 (5), pp. 346-359. Date of Electronic Publication: 2024 May 10.
Publication Year :
2024

Abstract

Single-cell epigenomic data has been growing continuously at an unprecedented pace, but their characteristics such as high dimensionality and sparsity pose substantial challenges to downstream analysis. Although deep learning models-especially variational autoencoders-have been widely used to capture low-dimensional feature embeddings, the prevalent Gaussian assumption somewhat disagrees with real data, and these models tend to struggle to incorporate reference information from abundant cell atlases. Here we propose CASTLE, a deep generative model based on the vector-quantized variational autoencoder framework to extract discrete latent embeddings that interpretably characterize single-cell chromatin accessibility sequencing data. We validate the performance and robustness of CASTLE for accurate cell-type identification and reasonable visualization compared with state-of-the-art methods. We demonstrate the advantages of CASTLE for effective incorporation of existing massive reference datasets in a weakly supervised or supervised manner. We further demonstrate CASTLE's capacity for intuitively distilling cell-type-specific feature spectra that unveil cell heterogeneity and biological implications quantitatively.<br /> (© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc.)

Details

Language :
English
ISSN :
2662-8457
Volume :
4
Issue :
5
Database :
MEDLINE
Journal :
Nature computational science
Publication Type :
Academic Journal
Accession number :
38730185
Full Text :
https://doi.org/10.1038/s43588-024-00625-4