Back to Search
Start Over
Software Benchmark—Classification Tree Algorithms for Cell Atlases Annotation Using Single-Cell RNA-Sequencing Data
- Source :
- Microbiology Research, Vol 12, Iss 22, Pp 317-334 (2021), Microbiology Research, Volume 12, Issue 2, Pages 22-334
- Publication Year :
- 2021
- Publisher :
- MDPI AG, 2021.
-
Abstract
- Classification tree is a widely used machine learning method. It has multiple implementations as R packages<br />rpart, ctree, evtree, tree and C5.0. The details of these implementations are not the same, and hence their performances differ from one application to another. We are interested in their performance in the classification of cells using the single-cell RNA-Sequencing data. In this paper, we conducted a benchmark study using 22 Single-Cell RNA-sequencing data sets. Using cross-validation, we compare packages’ prediction performances based on their Precision, Recall, F1-score, Area Under the Curve (AUC). We also compared the Complexity and Run-time of these R packages. Our study shows that rpart and evtree have the best Precision<br />evtree is the best in Recall, F1-score and AUC<br />C5.0 prefers more complex trees<br />tree is consistently much faster than others, although its complexity is often higher than others.
- Subjects :
- Microbiology (medical)
Sequencing data
recall
Biology
single-cell RNA-Sequencing
Machine learning
computer.software_genre
run-time
Microbiology
03 medical and health sciences
Annotation
benchmark
0302 clinical medicine
Software
Molecular Biology
classification tree
030304 developmental biology
0303 health sciences
F1-score
business.industry
Decision tree learning
QR1-502
Area Under the Curve
Tree (data structure)
Benchmark (computing)
precision
Artificial intelligence
complexity
business
F1 score
computer
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 20367481
- Volume :
- 12
- Database :
- OpenAIRE
- Journal :
- Microbiology Research
- Accession number :
- edsair.doi.dedup.....3049a37e77d0089a734b8b6874ca43b7