1. Decoding biology with massively parallel reporter assays and machine learning.
- Author
-
La Fleur, Alyssa, Shi, Yongsheng, and Seelig, Georg
- Subjects
gene regulation ,machine learning ,massively parallel reporter assays ,Machine Learning ,Humans ,Genes ,Reporter ,Animals ,High-Throughput Nucleotide Sequencing ,Gene Expression Regulation - Abstract
Massively parallel reporter assays (MPRAs) are powerful tools for quantifying the impacts of sequence variation on gene expression. Reading out molecular phenotypes with sequencing enables interrogating the impact of sequence variation beyond genome scale. Machine learning models integrate and codify information learned from MPRAs and enable generalization by predicting sequences outside the training data set. Models can provide a quantitative understanding of cis-regulatory codes controlling gene expression, enable variant stratification, and guide the design of synthetic regulatory elements for applications from synthetic biology to mRNA and gene therapy. This review focuses on cis-regulatory MPRAs, particularly those that interrogate cotranscriptional and post-transcriptional processes: alternative splicing, cleavage and polyadenylation, translation, and mRNA decay.
- Published
- 2024