Back to Search
Start Over
Automated information extraction and structure-activity relationship analysis of cytochrome P450 substrates
- Source :
- Journal of chemical information and modeling. 51(2)
- Publication Year :
- 2011
-
Abstract
- Information on CYP-chemical interactions was comprehensively explored by a text-mining technique, to confirm our previous structure-activity relationship model for CYP substrates (Yamashita et al. J. Chem. Inf. Model. 2008, 48, 364-369). The text-mining technique is based on natural language processing and can extract chemical names and their interaction patterns according to sentence context. After chemicals were automatically extracted and classified into CYP substrates, inhibitors, and inducers, 709 substrates were retrieved from the PubChem database and categorized as 216, 145, 136, 217, 156, and 379 substrates for CYP1A2, CYP2C9, CYP2C19, CYP2D6, CYP2E1, and CYP3A4, respectively. Although the previous classification model was developed using data from only 161 compounds, the model classified the substrates found by text-mining analysis with reasonable accuracy. This confirmed the validity of both the multi-objective classification model for CYP substrates and the text-mining procedure.
- Subjects :
- CYP2D6
Databases, Factual
Computer science
General Chemical Engineering
Speech recognition
Chemical nomenclature
Context (language use)
Computational biology
Library and Information Sciences
computer.software_genre
Data modeling
Automation
Structure-Activity Relationship
Cytochrome P-450 Enzyme System
Cytochrome P-450 Enzyme Inhibitors
Data Mining
Humans
Enzyme Inhibitors
biology
Decision Trees
CYP1A2
Cytochrome P450
Reproducibility of Results
General Chemistry
Computer Science Applications
Isoenzymes
Information extraction
Enzyme Induction
biology.protein
computer
PubChem
Subjects
Details
- ISSN :
- 1549960X
- Volume :
- 51
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- Journal of chemical information and modeling
- Accession number :
- edsair.doi.dedup.....d9829d2887bb260d05c3724d22bdd0f8