Back to Search
Start Over
Exponential family measurement error models for single-cell CRISPR screens.
- Source :
-
Biostatistics . Oct2024, Vol. 25 Issue 4, p1254-1272. 19p. - Publication Year :
- 2024
-
Abstract
- CRISPR genome engineering and single-cell RNA sequencing have accelerated biological discovery. Single-cell CRISPR screens unite these two technologies, linking genetic perturbations in individual cells to changes in gene expression and illuminating regulatory networks underlying diseases. Despite their promise, single-cell CRISPR screens present considerable statistical challenges. We demonstrate through theoretical and real data analyses that a standard method for estimation and inference in single-cell CRISPR screens—"thresholded regression"—exhibits attenuation bias and a bias-variance tradeoff as a function of an intrinsic, challenging-to-select tuning parameter. To overcome these difficulties, we introduce GLM-EIV ("GLM-based errors-in-variables"), a new method for single-cell CRISPR screen analysis. GLM-EIV extends the classical errors-in-variables model to responses and noisy predictors that are exponential family-distributed and potentially impacted by the same set of confounding variables. We develop a computational infrastructure to deploy GLM-EIV across hundreds of processors on clouds (e.g. Microsoft Azure) and high-performance clusters. Leveraging this infrastructure, we apply GLM-EIV to analyze two recent, large-scale, single-cell CRISPR screen datasets, yielding several new insights. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 14654644
- Volume :
- 25
- Issue :
- 4
- Database :
- Academic Search Index
- Journal :
- Biostatistics
- Publication Type :
- Academic Journal
- Accession number :
- 180255609
- Full Text :
- https://doi.org/10.1093/biostatistics/kxae010