Back to Search Start Over

Influence of statistical estimators of mutual information and data heterogeneity on the inference of gene regulatory networks.

Authors :
de Matos Simoes R
Emmert-Streib F
Source :
PloS one [PLoS One] 2011; Vol. 6 (12), pp. e29279. Date of Electronic Publication: 2011 Dec 29.
Publication Year :
2011

Abstract

The inference of gene regulatory networks from gene expression data is a difficult problem because the performance of the inference algorithms depends on a multitude of different factors. In this paper we study two of these. First, we investigate the influence of discrete mutual information (MI) estimators on the global and local network inference performance of the C3NET algorithm. More precisely, we study 4 different MI estimators (Empirical, Miller-Madow, Shrink and Schürmann-Grassberger) in combination with 3 discretization methods (equal frequency, equal width and global equal width discretization). We observe the best global and local inference performance of C3NET for the Miller-Madow estimator with an equal width discretization. Second, our numerical analysis can be considered as a systems approach because we simulate gene expression data from an underlying gene regulatory network, instead of making a distributional assumption to sample thereof. We demonstrate that despite the popularity of the latter approach, which is the traditional way of studying MI estimators, this is in fact not supported by simulated and biological expression data because of their heterogeneity. Hence, our study provides guidance for an efficient design of a simulation study in the context of network inference, supporting a systems approach.

Details

Language :
English
ISSN :
1932-6203
Volume :
6
Issue :
12
Database :
MEDLINE
Journal :
PloS one
Publication Type :
Academic Journal
Accession number :
22242113
Full Text :
https://doi.org/10.1371/journal.pone.0029279