Back to Search Start Over

Unsupervised learning for medical data: A review of probabilistic factorization methods.

Authors :
Neijzen, Dorien
Lunter, Gerton
Source :
Statistics in Medicine. 12/30/2023, Vol. 42 Issue 30, p5541-5554. 14p.
Publication Year :
2023

Abstract

We review popular unsupervised learning methods for the analysis of high‐dimensional data encountered in, for example, genomics, medical imaging, cohort studies, and biobanks. We show that four commonly used methods, principal component analysis, K‐means clustering, nonnegative matrix factorization, and latent Dirichlet allocation, can be written as probabilistic models underpinned by a low‐rank matrix factorization. In addition to highlighting their similarities, this formulation clarifies the various assumptions and restrictions of each approach, which eases identifying the appropriate method for specific applications for applied medical researchers. We also touch upon the most important aspects of inference and model selection for the application of these methods to health data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02776715
Volume :
42
Issue :
30
Database :
Academic Search Index
Journal :
Statistics in Medicine
Publication Type :
Academic Journal
Accession number :
174271225
Full Text :
https://doi.org/10.1002/sim.9924