1. Defining protein variant functions using high-complexity mutagenesis libraries and enhanced mutant detection software ASMv1.0
- Author
-
William C. Hahn, Ted Sharpe, Xiao Yang, Andrew J. Aguirre, Douglas Alan, Giacomelli Ao, Federica Piccioni, Lee Bp, Sprenkle L, Todd Green, Adam Brown, Andrew L. Hong, Nicole S. Persky, Sawyer E, Briana Fritchman, Hayes Tk, Matthew Meyerson, Robert E. Lintner, David E. Root, Heidi Greulich, Cory M. Johannessen, and Kawabe H
- Subjects
Open reading frame ,Software ,Computer science ,Shotgun sequencing ,business.industry ,Mutagenesis (molecular biology technique) ,Mutation detection ,Computational biology ,ORFS ,business ,Gene ,Stop codon - Abstract
Pooled variant expression libraries can test the phenotypes of thousands of variants of a gene in a single multiplexed experiment. In a library encoding all single-amino-acid substitutions of a protein, each variant differs from its reference only at a single codon-position located anywhere along the coding sequence. Consequently, accurately identifying these variants by sequencing is a major technical challenge. A popular but expensive brute-force approach is to divide the pool of variants into multiple smaller sub-libraries that each contains variants of a small region and that must each be constructed and screened individually, but that can then be PCR-amplified and fully sequenced with a single read to allow direct readout of variant abundance. Here we present an approach to screen very large variant libraries with mutations spanning a wide region in a single pool, including library design criteria and mutant-detection algorithms that permit reliable calling and counting of variants from large-scale sequencing data.
- Published
- 2021
- Full Text
- View/download PDF