Back to Search
Start Over
m5Cpred-XS: A New Method for Predicting RNA m5C Sites Based on XGBoost and SHAP
- Source :
- Frontiers in Genetics, Vol 13 (2022)
- Publication Year :
- 2022
- Publisher :
- Frontiers Media S.A., 2022.
-
Abstract
- As one of the most important post-transcriptional modifications of RNA, 5-cytosine-methylation (m5C) is reported to closely relate to many chemical reactions and biological functions in cells. Recently, several computational methods have been proposed for identifying m5C sites. However, the accuracy and efficiency are still not satisfactory. In this study, we proposed a new method, m5Cpred-XS, for predicting m5C sites of H. sapiens, M. musculus, and A. thaliana. First, the powerful SHAP method was used to select the optimal feature subset from seven different kinds of sequence-based features. Second, different machine learning algorithms were used to train the models. The results of five-fold cross-validation indicate that the model based on XGBoost achieved the highest prediction accuracy. Finally, our model was compared with other state-of-the-art models, which indicates that m5Cpred-XS is superior to other methods. Moreover, we deployed the model on a web server that can be accessed through http://m5cpred-xs.zhulab.org.cn/, and m5Cpred-XS is expected to be a useful tool for studying m5C sites.
Details
- Language :
- English
- ISSN :
- 16648021
- Volume :
- 13
- Database :
- Directory of Open Access Journals
- Journal :
- Frontiers in Genetics
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.87cdee2bcb734aeea959fc78d305e05a
- Document Type :
- article
- Full Text :
- https://doi.org/10.3389/fgene.2022.853258