Back to Search
Start Over
Optimal Recovery of Missing Values for Non-Negative Matrix Factorization
- Source :
- IEEE Open Journal of Signal Processing, Vol 2, Pp 207-216 (2021)
- Publication Year :
- 2021
- Publisher :
- IEEE, 2021.
-
Abstract
- Missing values imputation is often evaluated on some similarity measure between actual and imputed data. However, it may be more meaningful to evaluate downstream algorithm performance after imputation than the imputation itself. We describe a straightforward unsupervised imputation algorithm, a minimax approach based on optimal recovery, and derive probabilistic error bounds on downstream non-negative matrix factorization (NMF). Under certain geometric conditions, we prove upper bounds on NMF relative error, which is the first bound of this type for missing values. We also give probabilistic bounds for the same geometric assumptions. Experiments on image data and biological data show that this theoretically-grounded technique performs as well as or better than other imputation techniques that account for local structure. We also comment on imputation fairness.
Details
- Language :
- English
- ISSN :
- 26441322
- Volume :
- 2
- Database :
- Directory of Open Access Journals
- Journal :
- IEEE Open Journal of Signal Processing
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.830f0d6deb46b69b721e4c69ef302f
- Document Type :
- article
- Full Text :
- https://doi.org/10.1109/OJSP.2021.3069373