Back to Search
Start Over
Self-semi-supervised clustering for large scale data with massive null group
- Source :
- Journal of the Korean Statistical Society. 49:161-176
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- In this paper, we propose self-semi-supervised clustering, a new clustering method for large scale data with a massive null group. Self-semi-supervised clustering is a two-stage procedure: preselect a part of “null” group from the data in the first stage and apply semi-supervised clustering to the rest of the data in the second stage, allowing them to be assigned to the null group. We evaluate the performance of the proposed method using a simulation study and demonstrate the method in the analysis of time course gene expression data from a longitudinal study of Influenza A virus infection.
- Subjects :
- Statistics and Probability
Group (mathematics)
business.industry
05 social sciences
Null (mathematics)
Pattern recognition
Large scale data
Bayesian inference
01 natural sciences
010104 statistics & probability
0502 economics and business
Time course
Artificial intelligence
0101 mathematics
business
Cluster analysis
050205 econometrics
Mathematics
Semi supervised clustering
Subjects
Details
- ISSN :
- 20052863 and 12263192
- Volume :
- 49
- Database :
- OpenAIRE
- Journal :
- Journal of the Korean Statistical Society
- Accession number :
- edsair.doi...........d545b3b9a3d8ed87621fd6074e147a8b