Back to Search Start Over

In silico log P prediction for a large data set with support vector machines, radial basis neural networks and multiple linear regression

Authors :
Hai-Feng Chen
Source :
Chemical biologydrug design. 74(2)
Publication Year :
2009

Abstract

Oil/water partition coefficient (log P) is one of the key points for lead compound to be drug. In silico log P models based solely on chemical structures have become an important part of modern drug discovery. Here, we report support vector machines, radial basis function neural networks, and multiple linear regression methods to investigate the correlation between partition coefficient and physico-chemical descriptors for a large data set of compounds. The correlation coefficient r(2) between experimental and predicted log P for training and test sets by support vector machines, radial basis function neural networks, and multiple linear regression is 0.92, 0.90, and 0.88, respectively. The results show that non-linear support vector machines derives statistical models that have better prediction ability than those of radial basis function neural networks and multiple linear regression methods. This indicates that support vector machines can be used as an alternative modeling tool for quantitative structure-property/activity relationships studies.

Details

ISSN :
17470285
Volume :
74
Issue :
2
Database :
OpenAIRE
Journal :
Chemical biologydrug design
Accession number :
edsair.doi.dedup.....bba743136ef4a53f4cefd19a354f3230