1. Proof of the Achievability Conjectures for the General Stochastic Block Model
- Author
-
Emmanuel Abbe and Colin Sandon
- Subjects
Discrete mathematics ,Conjecture ,Applied Mathematics ,General Mathematics ,020206 networking & telecommunications ,02 engineering and technology ,Belief propagation ,01 natural sciences ,Algebra ,010104 statistics & probability ,Operator (computer programming) ,Power iteration ,Stochastic block model ,0202 electrical engineering, electronic engineering, information engineering ,Order (group theory) ,0101 mathematics ,Cluster analysis ,Time complexity ,Mathematics - Abstract
In a paper that initiated the modern study of the stochastic block model (SBM), Decelle, Krzakala, Moore, and Zdeborova, backed by Mossel, Neeman, and Sly, conjectured that detecting clusters in the symmetric SBM in polynomial time is always possible above the Kesten-Stigum (KS) threshold, while it is possible to detect clusters information theoretically (i.e., not necessarily in polynomial time) below the KS threshold when the number of clusters k is at least 4. Massoulie, Mossel et al., and Bordenave, Lelarge, and Massoulie proved that the KS threshold is in fact efficiently achievable for k = 2, while Mossel et al. proved that it cannot be crossed at k = 2. The above conjecture remained open for k ≥ 3. This paper proves the two parts of the conjecture, further extending the results to general SBMs. For the efficient part, an approximate acyclic belief propagation (ABP) algorithm is developed and proved to detect communities for any k down to the KS threshold in quasi-linear time. Achieving this requires showing optimality of ABP in the presence of cycles, a challenge for message-passing algorithms. The paper further connects ABP to a power iteration method on a nonbacktracking operator of generalized order, formalizing the interplay between message passing and spectral methods. For the information-theoretic part, a nonefficient algorithm sampling a typical clustering is shown to break down the KS threshold at k = 4. The emerging gap is shown to be large in some cases, making the SBM a good case study for information-computation gaps. © 2017 Wiley Periodicals, Inc.
- Published
- 2017