Back to Search
Start Over
Discrete latent embedding of single-cell chromatin accessibility sequencing data for uncovering cell heterogeneity.
- Source :
-
Nature computational science [Nat Comput Sci] 2024 May; Vol. 4 (5), pp. 346-359. Date of Electronic Publication: 2024 May 10. - Publication Year :
- 2024
-
Abstract
- Single-cell epigenomic data has been growing continuously at an unprecedented pace, but their characteristics such as high dimensionality and sparsity pose substantial challenges to downstream analysis. Although deep learning models-especially variational autoencoders-have been widely used to capture low-dimensional feature embeddings, the prevalent Gaussian assumption somewhat disagrees with real data, and these models tend to struggle to incorporate reference information from abundant cell atlases. Here we propose CASTLE, a deep generative model based on the vector-quantized variational autoencoder framework to extract discrete latent embeddings that interpretably characterize single-cell chromatin accessibility sequencing data. We validate the performance and robustness of CASTLE for accurate cell-type identification and reasonable visualization compared with state-of-the-art methods. We demonstrate the advantages of CASTLE for effective incorporation of existing massive reference datasets in a weakly supervised or supervised manner. We further demonstrate CASTLE's capacity for intuitively distilling cell-type-specific feature spectra that unveil cell heterogeneity and biological implications quantitatively.<br /> (© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc.)
Details
- Language :
- English
- ISSN :
- 2662-8457
- Volume :
- 4
- Issue :
- 5
- Database :
- MEDLINE
- Journal :
- Nature computational science
- Publication Type :
- Academic Journal
- Accession number :
- 38730185
- Full Text :
- https://doi.org/10.1038/s43588-024-00625-4