Back to Search Start Over

Evaluation of Gene Expression Classification Studies: Factors Associated with Classification Performance.

Authors :
Novianti, Putri W.
Roes, Kit C. B.
Eijkemans, Marinus J. C.
Source :
PLoS ONE; Apr2014, Vol. 9 Issue 4, p1-9, 9p
Publication Year :
2014

Abstract

Classification methods used in microarray studies for gene expression are diverse in the way they deal with the underlying complexity of the data, as well as in the technique used to build the classification model. The MAQC II study on cancer classification problems has found that performance was affected by factors such as the classification algorithm, cross validation method, number of genes, and gene selection method. In this paper, we study the hypothesis that the disease under study significantly determines which method is optimal, and that additionally sample size, class imbalance, type of medical question (diagnostic, prognostic or treatment response), and microarray platform are potentially influential. A systematic literature review was used to extract the information from 48 published articles on non-cancer microarray classification studies. The impact of the various factors on the reported classification accuracy was analyzed through random-intercept logistic regression. The type of medical question and method of cross validation dominated the explained variation in accuracy among studies, followed by disease category and microarray platform. In total, 42% of the between study variation was explained by all the study specific and problem specific factors that we studied together. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
19326203
Volume :
9
Issue :
4
Database :
Complementary Index
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
95820330
Full Text :
https://doi.org/10.1371/journal.pone.0096063