1. Accurate proteome-wide missense variant effect prediction with AlphaMissense.
- Author
-
Cheng J, Novati G, Pan J, Bycroft C, Žemgulytė A, Applebaum T, Pritzel A, Wong LH, Zielinski M, Sargeant T, Schneider RG, Senior AW, Jumper J, Hassabis D, Kohli P, and Avsec Ž
- Subjects
- Humans, Benchmarking, Conserved Sequence, Databases, Genetic, Genome, Human, Protein Conformation, Machine Learning, Amino Acid Substitution genetics, Disease genetics, Mutation, Missense, Proteome genetics, Sequence Alignment methods
- Abstract
The vast majority of missense variants observed in the human genome are of unknown clinical significance. We present AlphaMissense, an adaptation of AlphaFold fine-tuned on human and primate variant population frequency databases to predict missense variant pathogenicity. By combining structural context and evolutionary conservation, our model achieves state-of-the-art results across a wide range of genetic and experimental benchmarks, all without explicitly training on such data. The average pathogenicity score of genes is also predictive for their cell essentiality, capable of identifying short essential genes that existing statistical approaches are underpowered to detect. As a resource to the community, we provide a database of predictions for all possible human single amino acid substitutions and classify 89% of missense variants as either likely benign or likely pathogenic.
- Published
- 2023
- Full Text
- View/download PDF