1. From mundane to surprising nonadditivity: drivers and impact on ML models.
- Author
-
Guasch L, Maeder N, Cumming JG, and Kramer C
- Subjects
- Structure-Activity Relationship, Humans, Ligands, Drug Discovery methods, Molecular Conformation, Machine Learning
- Abstract
Nonadditivity (NA) in Structure-Activity and Structure-Property Relationship (SAR) data is a rare but very information rich phenomenon. It can indicate conformational flexibility, structural rearrangements, and errors in assay results and structural assignment. While purely ligand-based conformational causes of NA are rather well understood and mundane, other factors are less so and cause surprising NA that has a huge influence on SAR analysis and ML model performance. We here report a systematic analysis across a wide range of properties (20 on-target biological activities and 4 physicochemical ADME-related properties) to understand the frequency of various different phenomena that may lead to NA. A set of novel descriptors were developed to characterize double transformation cycles and identify trends in NA. Double transformation cycles were classified into "surprising" and "mundane" categories, with the majority being classed as mundane. We also examined commonalities among surprising cycles, finding LogP differences to have the most significant impact on NA. A distinct behavior of NA for on-target sets compared to ADME sets was observed. Finally, we show that machine learning models struggle with highly nonadditive data, indicating that a better understanding of NA is an important future research direction., (© 2024. The Author(s), under exclusive licence to Springer Nature Switzerland AG.)
- Published
- 2024
- Full Text
- View/download PDF