Back to Search
Start Over
Using CHOU'S 5-Steps Rule to Predict O-Linked Serine Glycosylation Sites by Blending Position Relative Features and Statistical Moment
- Source :
- IEEE/ACM Transactions on Computational Biology and Bioinformatics. 18:2045-2056
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- Glycosylation of proteins in eukaryote cells is an important and complicated post-translation modification due to its pivotal role and association with crucial physiological functions within most of the proteins. Identification of glycosylation sites in a polypeptide chain is not an easy task due to multiple impediments. Analytical identification of these sites is expensive and laborious. There is a dire need to develop a reliable computational method for precise determination of such sites which can help researchers to save time and effort. Herein, we propose a novel predictor namely iGlycoS-PseAAC by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) and relative/absolute position-based features. The self-consistency results show that the accuracy revealed by the model using the benchmark dataset for prediction of O-linked glycosylation having serine sites is 98.8 percent. The overall accuracy of predictor achieved through 10-fold cross validation by combining the positive and negative results is 97.2 percent. The overall accuracy achieved through Jackknife test is 96.195 percent by aggregating of all the prediction results. Thus the proposed predictor can help in predicting the O-linked glycosylated serine sites in an efficient and accurate way. The overall results show that the accuracy of the iGlycoS-PseAAC is higher than the existing tools.
- Subjects :
- Glycosylation
Computer science
Applied Mathematics
0206 medical engineering
Computational Biology
02 engineering and technology
Computational biology
Cross-validation
Moment (mathematics)
Serine
chemistry.chemical_compound
Identification (information)
chemistry
Position (vector)
Genetics
Benchmark (computing)
Protein Processing, Post-Translational
Pseudo amino acid composition
Algorithms
020602 bioinformatics
Glycoproteins
Biotechnology
Subjects
Details
- ISSN :
- 23740043 and 15455963
- Volume :
- 18
- Database :
- OpenAIRE
- Journal :
- IEEE/ACM Transactions on Computational Biology and Bioinformatics
- Accession number :
- edsair.doi.dedup.....005857111d701646c76c165a225481e2