Back to Search Start Over

Challenges in predicting stabilizing variations: An exploration.

Authors :
Benevenuta S
Birolo G
Sanavia T
Capriotti E
Fariselli P
Source :
Frontiers in molecular biosciences [Front Mol Biosci] 2023 Jan 05; Vol. 9, pp. 1075570. Date of Electronic Publication: 2023 Jan 05 (Print Publication: 2022).
Publication Year :
2023

Abstract

An open challenge of computational and experimental biology is understanding the impact of non-synonymous DNA variations on protein function and, subsequently, human health. The effects of these variants on protein stability can be measured as the difference in the free energy of unfolding (ΔΔ G ) between the mutated structure of the protein and its wild-type form. Throughout the years, bioinformaticians have developed a wide variety of tools and approaches to predict the ΔΔ G . Although the performance of these tools is highly variable, overall they are less accurate in predicting ΔΔ G stabilizing variations rather than the destabilizing ones. Here, we analyze the possible reasons for this difference by focusing on the relationship between experimentally-measured ΔΔ G and seven protein properties on three widely-used datasets (S2648, VariBench, Ssym) and a recently introduced one (S669). These properties include protein structural information, different physical properties and statistical potentials. We found that two highly used input features, i.e., hydrophobicity and the Blosum62 substitution matrix, show a performance close to random choice when trying to separate stabilizing variants from either neutral or destabilizing ones. We then speculate that, since destabilizing variations are the most abundant class in the available datasets, the overall performance of the methods is higher when including features that improve the prediction for the destabilizing variants at the expense of the stabilizing ones. These findings highlight the need of designing predictive methods able to exploit also input features highly correlated with the stabilizing variants. New tools should also be tested on a not-artificially balanced dataset, reporting the performance on all the three classes (i.e., stabilizing, neutral and destabilizing variants) and not only the overall results.<br />Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.<br /> (Copyright © 2023 Benevenuta, Birolo, Sanavia, Capriotti and Fariselli.)

Details

Language :
English
ISSN :
2296-889X
Volume :
9
Database :
MEDLINE
Journal :
Frontiers in molecular biosciences
Publication Type :
Academic Journal
Accession number :
36685278
Full Text :
https://doi.org/10.3389/fmolb.2022.1075570