1. Enhancing reaction-based de novo design using a multi-label reaction class recommender
- Author
-
Valerie J. Gillet, Gian Marco Ghiandoni, James Webster, Beining Chen, James E. A. Wallace, Dimitar Hristozov, and Michael J. Bodkin
- Subjects
Computer science ,Databases, Pharmaceutical ,Chemistry, Pharmaceutical ,Multi-label classification ,Chemistry Techniques, Synthetic ,computer.software_genre ,01 natural sciences ,Article ,Set (abstract data type) ,Machine Learning ,Small Molecule Libraries ,03 medical and health sciences ,External data ,Drug Discovery ,Reaction class recommender ,Humans ,Computer Simulation ,Physical and Theoretical Chemistry ,030304 developmental biology ,0303 health sciences ,Filter (signal processing) ,Limiting ,Class (biology) ,0104 chemical sciences ,Computer Science Applications ,010404 medicinal & biomolecular chemistry ,Reaction vector ,Transformation (function) ,De novo design ,Drug Design ,Computer-Aided Design ,Data mining ,computer ,Combinatorial explosion ,Algorithms ,Databases, Chemical - Abstract
Reaction-based de novo design refers to the in-silico generation of novel chemical structures by combining reagents using structural transformations derived from known reactions. The driver for using reaction-based transformations is to increase the likelihood of the designed molecules being synthetically accessible. We have previously described a reaction-based de novo design method based on reaction vectors which are transformation rules that are encoded automatically from reaction databases. A limitation of reaction vectors is that they account for structural changes that occur at the core of a reaction only, and they do not consider the presence of competing functionalities that can compromise the reaction outcome. Here, we present the development of a Reaction Class Recommender to enhance the reaction vector framework. The recommender is intended to be used as a filter on the reaction vectors that are applied during de novo design to reduce the combinatorial explosion of in-silico molecules produced while limiting the generated structures to those which are most likely to be synthesisable. The recommender has been validated using an external data set extracted from the recent medicinal chemistry literature and in two simulated de novo design experiments. Results suggest that the use of the recommender drastically reduces the number of solutions explored by the algorithm while preserving the chance of finding relevant solutions and increasing the global synthetic accessibility of the designed molecules.
- Published
- 2020