Back to Search Start Over

Accounting for technical noise in Bayesian graphical models of single-cell RNA-sequencing data.

Authors :
Oh J
Chang C
Long Q
Source :
Biostatistics (Oxford, England) [Biostatistics] 2022 Dec 12; Vol. 24 (1), pp. 161-176.
Publication Year :
2022

Abstract

Single-cell RNA-sequencing (scRNAseq) data contain a high level of noise, especially in the form of zero-inflation, that is, the presence of an excessively large number of zeros. This is largely due to dropout events and amplification biases that occur in the preparation stage of single-cell experiments. Recent scRNAseq experiments have been augmented with unique molecular identifiers (UMI) and External RNA Control Consortium (ERCC) molecules which can be used to account for zero-inflation. However, most of the current methods on graphical models are developed under the assumption of the multivariate Gaussian distribution or its variants, and thus they are not able to adequately account for an excessively large number of zeros in scRNAseq data. In this article, we propose a single-cell latent graphical model (scLGM)-a Bayesian hierarchical model for estimating the conditional dependency network among genes using scRNAseq data. Taking advantage of UMI and ERCC data, scLGM explicitly models the two sources of zero-inflation. Our simulation study and real data analysis demonstrate that the proposed approach outperforms several existing methods.<br /> (© The Author 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)

Details

Language :
English
ISSN :
1468-4357
Volume :
24
Issue :
1
Database :
MEDLINE
Journal :
Biostatistics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
34520533
Full Text :
https://doi.org/10.1093/biostatistics/kxab011