Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Miklos Z. Racz

Exact Community Recovery in Correlated Stochastic Block Models

Mar 29, 2022

Julia Gaudio, Miklos Z. Racz, Anirudh Sridhar

Figure 1 for Exact Community Recovery in Correlated Stochastic Block Models

Figure 2 for Exact Community Recovery in Correlated Stochastic Block Models

Figure 3 for Exact Community Recovery in Correlated Stochastic Block Models

Figure 4 for Exact Community Recovery in Correlated Stochastic Block Models

Abstract:We consider the problem of learning latent community structure from multiple correlated networks. We study edge-correlated stochastic block models with two balanced communities, focusing on the regime where the average degree is logarithmic in the number of vertices. Our main result derives the precise information-theoretic threshold for exact community recovery using multiple correlated graphs. This threshold captures the interplay between the community recovery and graph matching tasks. In particular, we uncover and characterize a region of the parameter space where exact community recovery is possible using multiple correlated graphs, even though (1) this is information-theoretically impossible using a single graph and (2) exact graph matching is also information-theoretically impossible. In this regime, we develop a novel algorithm that carefully synthesizes algorithms from the community recovery and graph matching literatures.

* 54 pages, 6 figures

Via

Access Paper or Ask Questions

Correlated Stochastic Block Models: Exact Graph Matching with Applications to Recovering Communities

Jul 14, 2021

Miklos Z. Racz, Anirudh Sridhar

Figure 1 for Correlated Stochastic Block Models: Exact Graph Matching with Applications to Recovering Communities

Figure 2 for Correlated Stochastic Block Models: Exact Graph Matching with Applications to Recovering Communities

Figure 3 for Correlated Stochastic Block Models: Exact Graph Matching with Applications to Recovering Communities

Figure 4 for Correlated Stochastic Block Models: Exact Graph Matching with Applications to Recovering Communities

Abstract:We consider the task of learning latent community structure from multiple correlated networks. First, we study the problem of learning the latent vertex correspondence between two edge-correlated stochastic block models, focusing on the regime where the average degree is logarithmic in the number of vertices. We derive the precise information-theoretic threshold for exact recovery: above the threshold there exists an estimator that outputs the true correspondence with probability close to 1, while below it no estimator can recover the true correspondence with probability bounded away from 0. As an application of our results, we show how one can exactly recover the latent communities using multiple correlated graphs in parameter regimes where it is information-theoretically impossible to do so using just a single graph.

* 42 pages, 4 figures

Via

Access Paper or Ask Questions

Approximate Trace Reconstruction

Dec 16, 2020

Sami Davies, Miklos Z. Racz, Cyrus Rashtchian, Benjamin G. Schiffer

Figure 1 for Approximate Trace Reconstruction

Abstract:In the usual trace reconstruction problem, the goal is to exactly reconstruct an unknown string of length $n$ after it passes through a deletion channel many times independently, producing a set of traces (i.e., random subsequences of the string). We consider the relaxed problem of approximate reconstruction. Here, the goal is to output a string that is close to the original one in edit distance while using much fewer traces than is needed for exact reconstruction. We present several algorithms that can approximately reconstruct strings that belong to certain classes, where the estimate is within $n/\mathrm{polylog}(n)$ edit distance, and where we only use $\mathrm{polylog}(n)$ traces (or sometimes just a single trace). These classes contain strings that require a linear number of traces for exact reconstruction and which are quite different from a typical random string. From a technical point of view, our algorithms approximately reconstruct consecutive substrings of the unknown string by aligning dense regions of traces and using a run of a suitable length to approximate each region. To complement our algorithms, we present a general black-box lower bound for approximate reconstruction, building on a lower bound for distinguishing between two candidate input strings in the worst case. In particular, this shows that approximating to within $n^{1/3 - \delta}$ edit distance requires $n^{1 + 3\delta/2}/\mathrm{polylog}(n)$ traces for $0< \delta < 1/3$ in the worst case.

Via

Access Paper or Ask Questions