Back to Search
Start Over
C10Pred: A First Machine Learning Based Tool to Predict C10 Family Cysteine Peptidases Using Sequence-Derived Features.
- Source :
-
International journal of molecular sciences [Int J Mol Sci] 2022 Aug 23; Vol. 23 (17). Date of Electronic Publication: 2022 Aug 23. - Publication Year :
- 2022
-
Abstract
- Streptococcus pyogenes , or group A Streptococcus (GAS), a gram-positive bacterium, is implicated in a wide range of clinical manifestations and life-threatening diseases. One of the key virulence factors of GAS is streptopain, a C10 family cysteine peptidase. Since its discovery, various homologs of streptopain have been reported from other bacterial species. With the increased affordability of sequencing, a significant increase in the number of potential C10 family-like sequences in the public databases is anticipated, posing a challenge in classifying such sequences. Sequence-similarity-based tools are the methods of choice to identify such streptopain-like sequences. However, these methods depend on some level of sequence similarity between the existing C10 family and the target sequences. Therefore, in this work, we propose a novel predictor, C10Pred, for the prediction of C10 peptidases using sequence-derived optimal features. C10Pred is a support vector machine (SVM) based model which is efficient in predicting C10 enzymes with an overall accuracy of 92.7% and Matthews' correlation coefficient (MCC) value of 0.855 when tested on an independent dataset. We anticipate that C10Pred will serve as a handy tool to classify novel streptopain-like proteins belonging to the C10 family and offer essential information.
- Subjects :
- Machine Learning
Proteins
Support Vector Machine
Cysteine
Cysteine Proteases
Subjects
Details
- Language :
- English
- ISSN :
- 1422-0067
- Volume :
- 23
- Issue :
- 17
- Database :
- MEDLINE
- Journal :
- International journal of molecular sciences
- Publication Type :
- Academic Journal
- Accession number :
- 36076915
- Full Text :
- https://doi.org/10.3390/ijms23179518