Back to Search Start Over

Integrative Clustering Analysis with Application in Multi-Source Gene Expression Data.

Authors :
LIUQING YANG
QING PAN
YUNPENG ZHAO
Source :
Journal of Data Science. Jan2022, Vol. 20 Issue 1, p14-33. 20p.
Publication Year :
2022

Abstract

In omics studies, different sources of information about the same set of genes are often available. When the group structure (e.g., gene pathways) within the genes are of interests, we combine the normal hierarchical model with the stochastic block model, through an integrative clustering framework, to model gene expression and gene networks jointly. The integrative framework provides higher accuracy in extensive simulation studies when one or both of the data sources contain noises or when different data sources provide complementary information. An empirical guideline in the choice between integrative versus separate clustering models is proposed. The integrative clustering method is illustrated on the mouse embryo single cell RNAseq and bulk cell microarray data, which identified not only the gene sets shared by both data sources but also the gene sets unique in one data source. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1680743X
Volume :
20
Issue :
1
Database :
Academic Search Index
Journal :
Journal of Data Science
Publication Type :
Academic Journal
Accession number :
156282949
Full Text :
https://doi.org/10.6339/21-JDS1028