1. Chemography: Searching for Hidden Treasures
- Author
-
Alexandre Varnek, Dmitriy M. Volochnyuk, Dragos Horvath, Gilles Marcou, Arkadii Lin, Yuliana Zabolotna, Chimie de la matière complexe (CMC), Université de Strasbourg (UNISTRA)-Institut de Chimie du CNRS (INC)-Centre National de la Recherche Scientifique (CNRS), Laboratoire de Chémoinformatique, Université de Strasbourg (UNISTRA)-Institut de Chimie du CNRS (INC)-Centre National de la Recherche Scientifique (CNRS)-Université de Strasbourg (UNISTRA)-Institut de Chimie du CNRS (INC)-Centre National de la Recherche Scientifique (CNRS), Institute of Organic Chemistry of NASU [Kyiv], and National Academy of Sciences of Ukraine (NASU)
- Subjects
Engineering ,010304 chemical physics ,business.industry ,Chemistry, Pharmaceutical ,General Chemical Engineering ,General Chemistry ,Library and Information Sciences ,01 natural sciences ,0104 chemical sciences ,Computer Science Applications ,World Wide Web ,010404 medicinal & biomolecular chemistry ,0103 physical sciences ,business ,[CHIM.CHEM]Chemical Sciences/Cheminformatics - Abstract
International audience; The days when medicinal chemistry was limited to a few series of compounds of therapeutic interest are long gone. Nowadays, no human may succeed to acquire a complete overview of more than a billion existing or feasible compounds within which the potential “blockbuster drugs” are well hidden and yet only a few mouse clicks away. To reach these “hidden treasures”, we adapted the generative topographic mapping method to enable efficient navigation through the chemical space, from a global overview to a structural pattern detection, covering, for the first time, the complete ZINC library of purchasable compounds, relative to 1.6 million biologically relevant ChEMBL molecules. About 40 000 hierarchical maps of the chemical space were constructed. Structural motifs inherent to only one library were identified. Roughly 20 000 off-market ChEMBL compound families represent incentives to enrich commercial catalogs. Alternatively, 125 000 ZINC-specific compound classes, absent in structure–activity bases, are novel paths to explore in medicinal chemistry. The complete list of these chemotypes can be downloaded using the link https://forms.gle/B6bUJj82t9EfmttV6.
- Published
- 2020
- Full Text
- View/download PDF