Back to Search Start Over

An Improved Fuzzy Based Missing Value Estimation in DNA Microarray Validated by Gene Ranking

Authors :
Sujay Saha
Anupam Ghosh
Dibyendu Bikash Seal
Kashi Nath Dey
Source :
Advances in Fuzzy Systems, Vol 2016 (2016)
Publication Year :
2016
Publisher :
Hindawi Limited, 2016.

Abstract

Most of the gene expression data analysis algorithms require the entire gene expression matrix without any missing values. Hence, it is necessary to devise methods which would impute missing data values accurately. There exist a number of imputation algorithms to estimate those missing values. This work starts with a microarray dataset containing multiple missing values. We first apply the modified version of the fuzzy theory based existing method LRFDVImpute to impute multiple missing values of time series gene expression data and then validate the result of imputation by genetic algorithm (GA) based gene ranking methodology along with some regular statistical validation techniques, like RMSE method. Gene ranking, as far as our knowledge, has not been used yet to validate the result of missing value estimation. Firstly, the proposed method has been tested on the very popular Spellman dataset and results show that error margins have been drastically reduced compared to some previous works, which indirectly validates the statistical significance of the proposed method. Then it has been applied on four other 2-class benchmark datasets, like Colorectal Cancer tumours dataset (GDS4382), Breast Cancer dataset (GSE349-350), Prostate Cancer dataset, and DLBCL-FL (Leukaemia) for both missing value estimation and ranking the genes, and the results show that the proposed method can reach 100% classification accuracy with very few dominant genes, which indirectly validates the biological significance of the proposed method.

Details

Language :
English
ISSN :
16877101 and 1687711X
Volume :
2016
Database :
Directory of Open Access Journals
Journal :
Advances in Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
edsdoj.957ded0bd19047d3af4f645364dcb82f
Document Type :
article
Full Text :
https://doi.org/10.1155/2016/6134736