Back to Search Start Over

Distinguishing drug/non-drug-like small molecules in drug discovery using deep belief network.

Authors :
Hooshmand SA
Jamalkandi SA
Alavi SM
Masoudi-Nejad A
Source :
Molecular diversity [Mol Divers] 2021 May; Vol. 25 (2), pp. 827-838. Date of Electronic Publication: 2020 Mar 19.
Publication Year :
2021

Abstract

The advent of computational methods for efficient prediction of the druglikeness of small molecules and their ever-burgeoning applications in the fields of medicinal chemistry and drug industries have been a profound scientific development, since only a few amounts of the small molecule libraries were identified as approvable drugs. In this study, a deep belief network was utilized to construct a druglikeness classification model. For this purpose, small molecules and approved drugs from the ZINC database were selected for the unsupervised pre-training step and supervised training step. Various binary fingerprints such as Macc 166 bit, PubChem 881 bit, and Morgan 2048 bit as data features were investigated. The report revealed that using an unsupervised pre-training phase can lead to a good performance model and generalizability capability. Accuracy, precision, and recall of the model for Macc features were 97%, 96%, and 99%, respectively. For more consideration about the generalizability of the model, the external data by expression and investigational drugs in drug banks as drug data and randomly selected data from the ZINC database as non-drug were created. The results confirmed the good performance and generalizability capability of the model. Also, the outcomes depicted that a large proportion of misclassified non-drug small molecules ascertain the bioavailability conditions and could be investigated as a drug in the future. Furthermore, our model attempted to tap potential opportunities as a drug filter in drug discovery.

Details

Language :
English
ISSN :
1573-501X
Volume :
25
Issue :
2
Database :
MEDLINE
Journal :
Molecular diversity
Publication Type :
Academic Journal
Accession number :
32193758
Full Text :
https://doi.org/10.1007/s11030-020-10065-7