Back to Search
Start Over
Analysis of Expression Pattern of snoRNAs in Different Cancer Types with Machine Learning Algorithms
- Source :
- International Journal of Molecular Sciences, Volume 20, Issue 9, International Journal of Molecular Sciences, 20(9):2185. Multidisciplinary Digital Publishing Institute (MDPI), International Journal of Molecular Sciences, Vol 20, Iss 9, p 2185 (2019)
- Publication Year :
- 2019
- Publisher :
- Multidisciplinary Digital Publishing Institute (MDPI), 2019.
-
Abstract
- Small nucleolar RNAs (snoRNAs) are a new type of functional small RNAs involved in the chemical modifications of rRNAs, tRNAs, and small nuclear RNAs. It is reported that they play important roles in tumorigenesis via various regulatory modes. snoRNAs can both participate in the regulation of methylation and pseudouridylation and regulate the expression pattern of their host genes. This research investigated the expression pattern of snoRNAs in eight major cancer types in TCGA via several machine learning algorithms. The expression levels of snoRNAs were first analyzed by a powerful feature selection method, Monte Carlo feature selection (MCFS). A feature list and some informative features were accessed. Then, the incremental feature selection (IFS) was applied to the feature list to extract optimal features/snoRNAs, which can make the support vector machine (SVM) yield best performance. The discriminative snoRNAs included HBII-52-14, HBII-336, SNORD123, HBII-85-29, HBII-420, U3, HBI-43, SNORD116, SNORA73B, SCARNA4, HBII-85-20, etc., on which the SVM can provide a Matthew&rsquo<br />s correlation coefficient (MCC) of 0.881 for predicting these eight cancer types. On the other hand, the informative features were fed into the Johnson reducer and repeated incremental pruning to produce error reduction (RIPPER) algorithms to generate classification rules, which can clearly show different snoRNAs expression patterns in different cancer types. The analysis results indicated that extracted discriminative snoRNAs can be important for identifying cancer samples in different types and the expression pattern of snoRNAs in different cancer types can be partly uncovered by quantitative recognition rules.
- Subjects :
- Computer science
Feature selection
snoRNA
Machine learning
computer.software_genre
Monte Carlo feature selection
Article
Catalysis
lcsh:Chemistry
Machine Learning
Inorganic Chemistry
Discriminative model
SDG 3 - Good Health and Well-being
Neoplasms
Feature (machine learning)
Humans
RNA, Small Nucleolar
support vector machine
Pruning (decision trees)
Physical and Theoretical Chemistry
Small nucleolar RNA
lcsh:QH301-705.5
Molecular Biology
Gene
Spectroscopy
RIPPER algorithm
urogenital system
business.industry
Organic Chemistry
General Medicine
Expression (mathematics)
Computer Science Applications
Gene Expression Regulation, Neoplastic
Support vector machine
lcsh:Biology (General)
lcsh:QD1-999
Artificial intelligence
business
Monte Carlo Method
Algorithm
computer
Algorithms
cancer type
Subjects
Details
- ISSN :
- 14220067 and 16616596
- Volume :
- 20
- Issue :
- 9
- Database :
- OpenAIRE
- Journal :
- International Journal of Molecular Sciences
- Accession number :
- edsair.doi.dedup.....e13bc1b0e49b3304fc5076eaf4a9cf29