Back to Search Start Over

Partitioning gene expression data by data-driven Markov chain Monte Carlo.

Authors :
Saraiva, E.F.
Suzuki, A.K.
Louzada, F.
Milan, L.A.
Source :
Journal of Applied Statistics; May2016, Vol. 43 Issue 6, p1155-1173, 19p
Publication Year :
2016

Abstract

In this paper we introduce a Bayesian mixture model with an unknown number of components for partitioning gene expression data. Inferences about all the unknown parameters involved are made by using the proposed data-driven Markov chain Monte Carlo. This algorithm is essentially a Metropolis–Hastings within Gibbs sampling. The Metropolis–Hastings is performed to change the number of partitions k in the neighborhood and using a pair of split-merge moves. Our strategy for splitting is based on data in which allocation probabilities are calculated based on marginal likelihood function from the previously allocated observations. Conditional on k, the partitions labels are updated via Gibbs sampling. The two main advantages of the proposed algorithm is that it is easy to be implemented and the acceptance probability for split-merge movements depends only on the observed data. We examine the performance of the proposed algorithm on simulated data and then analyze two publicly available gene expression data sets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02664763
Volume :
43
Issue :
6
Database :
Complementary Index
Journal :
Journal of Applied Statistics
Publication Type :
Academic Journal
Accession number :
113083893
Full Text :
https://doi.org/10.1080/02664763.2015.1092113