Back to Search Start Over

Self-semi-supervised clustering for large scale data with massive null group

Authors :
Soohyun Ahn
Hyungwon Choi
Johan Lim
Kyeong Eun Lee
Source :
Journal of the Korean Statistical Society. 49:161-176
Publication Year :
2020
Publisher :
Springer Science and Business Media LLC, 2020.

Abstract

In this paper, we propose self-semi-supervised clustering, a new clustering method for large scale data with a massive null group. Self-semi-supervised clustering is a two-stage procedure: preselect a part of “null” group from the data in the first stage and apply semi-supervised clustering to the rest of the data in the second stage, allowing them to be assigned to the null group. We evaluate the performance of the proposed method using a simulation study and demonstrate the method in the analysis of time course gene expression data from a longitudinal study of Influenza A virus infection.

Details

ISSN :
20052863 and 12263192
Volume :
49
Database :
OpenAIRE
Journal :
Journal of the Korean Statistical Society
Accession number :
edsair.doi...........d545b3b9a3d8ed87621fd6074e147a8b