Back to Search Start Over

Random, de novo, and conserved proteins: How structure and disorder predictors perform differently.

Authors :
Middendorf, Lasse
Eicholt, Lars A.
Source :
Proteins; Jun2024, Vol. 92 Issue 6, p757-767, 11p
Publication Year :
2024

Abstract

Understanding the emergence and structural characteristics of de novo and random proteins is crucial for unraveling protein evolution and designing novel enzymes. However, experimental determination of their structures remains challenging. Recent advancements in protein structure prediction, particularly with AlphaFold2 (AF2), have expanded our knowledge of protein structures, but their applicability to de novo and random proteins is unclear. In this study, we investigate the structural predictions and confidence scores of AF2 and protein language modelā€based predictor ESMFold for de novo and conserved proteins from Drosophila and a dataset of comparable random proteins. We find that the structural predictions for de novo and random proteins differ significantly from conserved proteins. Interestingly, a positive correlation between disorder and confidence scores (pLDDT) is observed for de novo and random proteins, in contrast to the negative correlation observed for conserved proteins. Furthermore, the performance of structure predictors for de novo and random proteins is hampered by the lack of sequence identity. We also observe fluctuating median predicted disorder among different sequence length quartiles for random proteins, suggesting an influence of sequence length on disorder predictions. In conclusion, while structure predictors provide initial insights into the structural composition of de novo and random proteins, their accuracy and applicability to such proteins remain limited. Experimental determination of their structures is necessary for a comprehensive understanding. The positive correlation between disorder and pLDDT could imply a potential for conditional folding and transient binding interactions of de novo and random proteins. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08873585
Volume :
92
Issue :
6
Database :
Complementary Index
Journal :
Proteins
Publication Type :
Academic Journal
Accession number :
176867305
Full Text :
https://doi.org/10.1002/prot.26652