1. Estimation of low-rank matrices via approximate message passing
- Author
-
Andrea Montanari and Ramji Venkataramanan
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Polynomial ,Rank (linear algebra) ,Machine Learning (stat.ML) ,Mathematics - Statistics Theory ,Statistics Theory (math.ST) ,02 engineering and technology ,01 natural sciences ,010104 statistics & probability ,Matrix (mathematics) ,symbols.namesake ,Statistics - Machine Learning ,FOS: Mathematics ,0202 electrical engineering, electronic engineering, information engineering ,approximate message passing ,0101 mathematics ,Eigenvalues and eigenvectors ,Mathematics ,Low-rank matrix estimation ,62E20 ,spectral initialization ,Estimator ,020206 networking & telecommunications ,Gaussian noise ,Outlier ,symbols ,Statistics, Probability and Uncertainty ,62F15 ,Algorithm ,Random matrix ,62H99 - Abstract
Consider the problem of estimating a low-rank matrix when its entries are perturbed by Gaussian noise. If the empirical distribution of the entries of the spikes is known, optimal estimators that exploit this knowledge can substantially outperform simple spectral approaches. Recent work characterizes the asymptotic accuracy of Bayes-optimal estimators in the high-dimensional limit. In this paper we present a practical algorithm that can achieve Bayes-optimal accuracy above the spectral threshold. A bold conjecture from statistical physics posits that no polynomial-time algorithm achieves optimal error below the same threshold (unless the best estimator is trivial). Our approach uses Approximate Message Passing (AMP) in conjunction with a spectral initialization. AMP algorithms have proved successful in a variety of statistical estimation tasks, and are amenable to exact asymptotic analysis via state evolution. Unfortunately, state evolution is uninformative when the algorithm is initialized near an unstable fixed point, as often happens in low-rank matrix estimation. We develop a new analysis of AMP that allows for spectral initializations. Our main theorem is general and applies beyond matrix estimation. However, we use it to derive detailed predictions for the problem of estimating a rank-one matrix in noise. Special cases of this problem are closely related---via universality arguments---to the network community detection problem for two asymmetric communities. For general rank-one models, we show that AMP can be used to construct confidence intervals and control false discovery rate. We provide illustrations of the general methodology by considering the cases of sparse low-rank matrices and of block-constant low-rank matrices with symmetric blocks (we refer to the latter as to the `Gaussian Block Model')., 76 pages, 6 pdf figures; Version 4 expands the introductory material and the applications to statistical inference
- Published
- 2021