Back to Search
Start Over
In silico prediction of chemical acute oral toxicity using multi-classification methods.
- Source :
-
Journal of chemical information and modeling [J Chem Inf Model] 2014 Apr 28; Vol. 54 (4), pp. 1061-9. Date of Electronic Publication: 2014 Apr 16. - Publication Year :
- 2014
-
Abstract
- Chemical acute oral toxicity is an important end point in drug design and environmental risk assessment. However, it is difficult to determine by experiments, and in silico methods are hence developed as an alternative. In this study, a comprehensive data set containing 12, 204 diverse compounds with median lethal dose (LD₅₀) was compiled. These chemicals were classified into four categories, namely categories I, II, III and IV, based on the criterion of the U.S. Environmental Protection Agency (EPA). Then several multiclassification models were developed using five machine learning methods, including support vector machine (SVM), C4.5 decision tree (C4.5), random forest (RF), κ-nearest neighbor (kNN), and naïve Bayes (NB) algorithms, along with MACCS and FP4 fingerprints. One-against-one (OAO) and binary tree (BT) strategies were employed for SVM multiclassification. Performances were measured by two external validation sets containing 1678 and 375 chemicals, separately. The overall accuracy of the MACCS-SVM(OAO) model was 83.0% and 89.9% for external validation sets I and II, respectively, which showed reliable predictive accuracy for each class. In addition, some representative substructures responsible for acute oral toxicity were identified using information gain and substructure frequency analysis methods, which might be very helpful for further study to avoid the toxicity.
Details
- Language :
- English
- ISSN :
- 1549-960X
- Volume :
- 54
- Issue :
- 4
- Database :
- MEDLINE
- Journal :
- Journal of chemical information and modeling
- Publication Type :
- Academic Journal
- Accession number :
- 24735213
- Full Text :
- https://doi.org/10.1021/ci5000467