Back to Search Start Over

RFCell: A Gene Selection Approach for scRNA-seq Clustering Based on Permutation and Random Forest.

Authors :
Zhao Y
Fang ZY
Lin CX
Deng C
Xu YP
Li HD
Source :
Frontiers in genetics [Front Genet] 2021 Jul 27; Vol. 12, pp. 665843. Date of Electronic Publication: 2021 Jul 27 (Print Publication: 2021).
Publication Year :
2021

Abstract

In recent years, the application of single cell RNA-seq (scRNA-seq) has become more and more popular in fields such as biology and medical research. Analyzing scRNA-seq data can discover complex cell populations and infer single-cell trajectories in cell development. Clustering is one of the most important methods to analyze scRNA-seq data. In this paper, we focus on improving scRNA-seq clustering through gene selection, which also reduces the dimensionality of scRNA-seq data. Studies have shown that gene selection for scRNA-seq data can improve clustering accuracy. Therefore, it is important to select genes with cell type specificity. Gene selection not only helps to reduce the dimensionality of scRNA-seq data, but also can improve cell type identification in combination with clustering methods. Here, we proposed RFCell, a supervised gene selection method, which is based on permutation and random forest classification. We first use RFCell and three existing gene selection methods to select gene sets on 10 scRNA-seq data sets. Then, three classical clustering algorithms are used to cluster the cells obtained by these gene selection methods. We found that the gene selection performance of RFCell was better than other gene selection methods.<br />Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.<br /> (Copyright © 2021 Zhao, Fang, Lin, Deng, Xu and Li.)

Details

Language :
English
ISSN :
1664-8021
Volume :
12
Database :
MEDLINE
Journal :
Frontiers in genetics
Publication Type :
Academic Journal
Accession number :
34386033
Full Text :
https://doi.org/10.3389/fgene.2021.665843