Back to Search Start Over

Evaluating the performance of random forest and iterative random forest based methods when applied to gene expression data.

Authors :
Walker AM
Cliff A
Romero J
Shah MB
Jones P
Felipe Machado Gazolla JG
Jacobson DA
Kainer D
Source :
Computational and structural biotechnology journal [Comput Struct Biotechnol J] 2022 Jun 22; Vol. 20, pp. 3372-3386. Date of Electronic Publication: 2022 Jun 22 (Print Publication: 2022).
Publication Year :
2022

Abstract

Gene-to-gene networks, such as Gene Regulatory Networks (GRN) and Predictive Expression Networks (PEN) capture relationships between genes and are beneficial for use in downstream biological analyses. There exists multiple network inference tools to produce these gene-to-gene networks from matrices of gene expression data. Random Forest-Leave One Out Prediction (RF-LOOP) is a method that has been shown to be efficient at producing these gene-to-gene networks, frequently known as GEne Network Inference with Ensemble of trees (GENIE3). Random Forest can be replaced in this process by iterative Random Forest (iRF), which performs variable selection and boosting. Here we validate that iterative Random Forest-Leave One Out Prediction (iRF-LOOP) produces higher quality networks than GENIE3 (RF-LOOP). We use both synthetic and empirical networks from the Dialogue for Reverse Engineering Assessment and Methods (DREAM) Challenges by Sage Bionetworks, as well as two additional empirical networks created from Arabidopsis thaliana and Populus trichocarpa expression data.<br />Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.<br /> (© 2022 The Author(s).)

Details

Language :
English
ISSN :
2001-0370
Volume :
20
Database :
MEDLINE
Journal :
Computational and structural biotechnology journal
Publication Type :
Academic Journal
Accession number :
35832622
Full Text :
https://doi.org/10.1016/j.csbj.2022.06.037