1. Predicting cellular responses to complex perturbations in high‐throughput screens
- Author
-
Lotfollahi, Mohammad, Susmelj, Anna Klimovskaia, De Donno, Carlo, Hetzel, Leon, Ji, Yuge, Ibarra, Ignacio L, Srivatsan, Sanjay R, Naghipourfar, Mohsen, Daza, Riza M, Martin, Beth, Shendure, Jay, McFaline‐Figueroa, Jose L, Boyeau, Pierre, Wolf, F Alexander, Yakubova, Nafissa, Günnemann, Stephan, Trapnell, Cole, Lopez‐Paz, David, and Theis, Fabian J
- Subjects
Biochemistry and Cell Biology ,Biological Sciences ,Genetics ,Bioengineering ,Underpinning research ,1.1 Normal biological development and functioning ,Generic health relevance ,Gene Expression Profiling ,High-Throughput Screening Assays ,Computational Biology ,Single-Cell Gene Expression Analysis ,generative modeling ,high-throughput screening ,machine learning ,perturbation prediction ,single-cell transcriptomics ,Other Biological Sciences ,Bioinformatics ,Biochemistry and cell biology - Abstract
Recent advances in multiplexed single-cell transcriptomics experiments facilitate the high-throughput study of drug and genetic perturbations. However, an exhaustive exploration of the combinatorial perturbation space is experimentally unfeasible. Therefore, computational methods are needed to predict, interpret, and prioritize perturbations. Here, we present the compositional perturbation autoencoder (CPA), which combines the interpretability of linear models with the flexibility of deep-learning approaches for single-cell response modeling. CPA learns to in silico predict transcriptional perturbation response at the single-cell level for unseen dosages, cell types, time points, and species. Using newly generated single-cell drug combination data, we validate that CPA can predict unseen drug combinations while outperforming baseline models. Additionally, the architecture's modularity enables incorporating the chemical representation of the drugs, allowing the prediction of cellular response to completely unseen drugs. Furthermore, CPA is also applicable to genetic combinatorial screens. We demonstrate this by imputing in silico 5,329 missing combinations (97.6% of all possibilities) in a single-cell Perturb-seq experiment with diverse genetic interactions. We envision CPA will facilitate efficient experimental design and hypothesis generation by enabling in silico response prediction at the single-cell level and thus accelerate therapeutic applications using single-cell technologies.
- Published
- 2023