1. Predict collagen hydroxyproline sites using support vector machines.
- Author
-
Yang ZR
- Subjects
- Algorithms, Amino Acid Sequence, Artificial Intelligence, Mathematics, Molecular Sequence Data, Peptides chemistry, Peptides genetics, ROC Curve, Reproducibility of Results, Sensitivity and Specificity, Sequence Analysis, Protein, Collagen chemistry, Hydroxyproline chemistry, Models, Chemical
- Abstract
Collagen hydroxyproline is an important posttranslational modification activity because of its close relationship with various diseases and signaling activities. However, there is no study to date for constructing models for predicting collagen hydroxyproline sites. Support vector machines with two kernel functions (the identity kernel function and the bio-kernel function) have been used for constructing models for predicting collagen hydroxyproline sites in this study. The models are constructed based on 37 sequences collected from NCBI. Peptide data are generated using a sliding window with various sizes to scan the sequences. Fivefold cross-validation is used for model evaluation. The best model has specificity of 70% and sensitivity of 90%.
- Published
- 2009
- Full Text
- View/download PDF