1. DeepTracer ID: De Novo Protein Identification from Cryo-EM Maps
- Author
-
Luca Chang, Fengbin Wang, Kiernan Connolly, Hanze Meng, Zhangli Su, Virginija Cvirkaite-Krupovic, Mart Krupovic, Edward H. Egelman, Dong Si, University of Washington-Bothell, University of Virginia, University of Washington [Seattle], University of Alabama at Birmingham [ Birmingham] (UAB), Virologie des archées - Archaeal Virology, Université Paris Cité (UPCité)-Microbiologie Intégrative et Moléculaire (UMR6047), Institut Pasteur [Paris] (IP)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS)-Institut Pasteur [Paris] (IP)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), The cryo-EM imaging was done at the Molecular Electron Microscopy Core Facility at the University of Virginia, which is supported by the School of Medicine and built with NIH grant G20-RR31199. This work was supported by NIH grant GM122510 (E.H.E.), K99GM138756 (F.W.), K99CA259526 (Z.S.), NSF grant 2030381 (D.S.), the SRCP Seed Grant at the University of Washington Bothell (D.S.), and l’Agence Nationale de la Recherche grant ANR-21-CE11-0001-01 (M.K.)., and ANR-21-CE11-0001,ArcFus,Protéines de classe II de fusion membranaire chez les archées(2021)
- Subjects
Models, Molecular ,[SDV.BBM.BS]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Structural Biology [q-bio.BM] ,Protein Conformation ,Cryoelectron Microscopy ,Biophysics ,Proteins ,Software - Abstract
The recent revolution in cryo-electron microscopy has made it possible to determine macromolecular structures directly from cell extracts. However, identifying the correct protein from the cryo-EM map is still challenging and often needs additional sequence information from other techniques, such as tandem mass spectrometry and/or bioinformatics. Here, we present DeepTracer-ID, a server-based approach to identify the candidate protein in a user-provided organism de novo from a cryo-EM map, without the need for additional information. Our method first uses DeepTracer to generate a protein backbone model that best represents the cryo-EM map, and this model is then searched against the library of AlphaFold2 predictions for all proteins in the given organism. This method is highly accurate and robust: in all 13 experimental maps tested blindly, DeepTracer-ID identified the correct proteins as the top candidates. Eight of the maps were of known structures, while the other five unpublished maps were validated by prior protein annotation and careful inspection of the model refined into the map. The program also showed promising results for both homomeric and heteromeric protein complexes. This platform is possible because of the recent breakthroughs in large-scale protein 3D structure prediction.Statement of SignificanceWhile it has now become routine for cryo-EM maps of proteins to reach a near-atomic resolution, potentially allowing for reliable atomic models to be built, there are a growing number of instances where the protein identity may not be known. Without knowing the protein sequence, it is impossible to build an atomic model. DeepTracer-ID is a server-based approach to surmount this problem by identifying the proteins in a given organism that are found in the cryo-EM map. A free web service for global academic access is provided.
- Published
- 2022
- Full Text
- View/download PDF