Back to Search Start Over

RediscMol: Benchmarking Molecular Generation Models in Biological Properties.

Authors :
Weng G
Zhao H
Nie D
Zhang H
Liu L
Hou T
Kang Y
Source :
Journal of medicinal chemistry [J Med Chem] 2024 Jan 25; Vol. 67 (2), pp. 1533-1543. Date of Electronic Publication: 2024 Jan 05.
Publication Year :
2024

Abstract

Deep learning-based molecular generative models have garnered emerging attention for their capability to generate molecules with novel structures and desired physicochemical properties. However, the evaluation of these models, particularly in a biological context, remains insufficient. To address the limitations of existing metrics and emulate practical application scenarios, we construct the RediscMol benchmark that comprises active molecules extracted from 5 kinase and 3 GPCR data sets. A set of rediscovery- and similarity-related metrics are introduced to assess the performance of 8 representative generative models (CharRNN, VAE, Reinvent, AAE, ORGAN, RNNAttn, TransVAE, and GraphAF). Our findings based on the RediscMol benchmark differ from those of previous evaluations. CharRNN, VAE, and Reinvent exhibit a greater ability to reproduce known active molecules, while RNNAttn, TransVAE, and GraphAF struggle in this aspect despite their notable performance on commonly used distribution-learning metrics. Our evaluation framework may provide valuable guidance for advancing generative models in real-world drug design scenarios.

Subjects

Subjects :
Models, Molecular
Drug Design

Details

Language :
English
ISSN :
1520-4804
Volume :
67
Issue :
2
Database :
MEDLINE
Journal :
Journal of medicinal chemistry
Publication Type :
Academic Journal
Accession number :
38181194
Full Text :
https://doi.org/10.1021/acs.jmedchem.3c02051