Back to Search Start Over

Comparative study of various molecular feature representations for solvation free energy predictions of neutral species.

Authors :
Isaev VV
Minenkov Y
Source :
Journal of molecular graphics & modelling [J Mol Graph Model] 2025 Jan; Vol. 134, pp. 108901. Date of Electronic Publication: 2024 Nov 01.
Publication Year :
2025

Abstract

Predicting molecular properties with the help of Neural Networks is a common way to substitute or enhance comprehensive quantum-chemical calculations. One of the problems facing researchers is the choice of vectorization approach to representing the solvent and the solute for the estimator model. In this work, 10 different approaches have been investigated for both organic solute and solvent including vectorizers that relied on macroscopic parameters, functional groups classification, molecular graphs, or atomic coordinates. A variation of the Bag of Bonds approach called JustBonds, trained on the MNSol database, showed the best overall performance resulting in RMSD <2 kcal/mol for the blind dataset that contains the solutes not presented in the training subset and <1 kcal/mol on records from Solv@TUM database, which is close to contemporary continuum models. We have also demonstrated that the most important bags usually contain heteroatom and play a key role in the solvation process. Furthermore, the small role of solvent vectorization was demonstrated and revealed that approaches based on functional groups or macroscopic solvent parameters are often enough to efficiently represent solvent media.<br />Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.<br /> (Copyright © 2024. Published by Elsevier Inc.)

Details

Language :
English
ISSN :
1873-4243
Volume :
134
Database :
MEDLINE
Journal :
Journal of molecular graphics & modelling
Publication Type :
Academic Journal
Accession number :
39515275
Full Text :
https://doi.org/10.1016/j.jmgm.2024.108901