Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Avanti Athreya

The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

Aug 01, 2020

Konstantinos Pantazis, Avanti Athreya, William N. Frost, Evan S. Hill, Vince Lyzinski

Figure 1 for The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

Figure 2 for The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

Figure 3 for The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

Figure 4 for The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

Abstract:Spectral inference on multiple networks is a rapidly-developing subfield of graph statistics. Recent work has demonstrated that joint, or simultaneous, spectral embedding of multiple independent network realizations can deliver more accurate estimation than individual spectral decompositions of those same networks. Little attention has been paid, however, to the network correlation that such joint embedding procedures necessarily induce. In this paper, we present a detailed analysis of induced correlation in a {\em generalized omnibus} embedding for multiple networks. We show that our embedding procedure is flexible and robust, and, moreover, we prove a central limit theorem for this embedding and explicitly compute the limiting covariance. We examine how this covariance can impact inference in a network time series, and we construct an appropriately calibrated omnibus embedding that can detect changes in real biological networks that previous embedding procedures could not discern. Our analysis confirms that the effect of induced correlation can be both subtle and transformative, with import in theory and practice.

Via

Access Paper or Ask Questions

On identifying unobserved heterogeneity in stochastic blockmodel graphs with vertex covariates

Jul 04, 2020

Cong Mu, Angelo Mele, Lingxin Hao, Joshua Cape, Avanti Athreya, Carey E. Priebe

Figure 1 for On identifying unobserved heterogeneity in stochastic blockmodel graphs with vertex covariates

Figure 2 for On identifying unobserved heterogeneity in stochastic blockmodel graphs with vertex covariates

Figure 3 for On identifying unobserved heterogeneity in stochastic blockmodel graphs with vertex covariates

Figure 4 for On identifying unobserved heterogeneity in stochastic blockmodel graphs with vertex covariates

Abstract:Both observed and unobserved vertex heterogeneity can influence block structure in graphs. To assess these effects on block recovery, we present a comparative analysis of two model-based spectral algorithms for clustering vertices in stochastic blockmodel graphs with vertex covariates. The first algorithm directly estimates the induced block assignments by investigating the estimated block connectivity probability matrix including the vertex covariate effect. The second algorithm estimates the vertex covariate effect and then estimates the induced block assignments after accounting for this effect. We employ Chernoff information to analytically compare the algorithms' performance and derive the Chernoff ratio formula for some special models of interest. Analytic results and simulations suggest that, in general, the second algorithm is preferred: we can better estimate the induced block assignments by first estimating the vertex covariate effect. In addition, real data experiments on a diffusion MRI connectome data set indicate that the second algorithm has the advantages of revealing underlying block structure and taking observed vertex heterogeneity into account in real applications. Our findings emphasize the importance of distinguishing between observed and unobserved factors that can affect block structure in graphs.

* 14 pages, 8 figures

Via

Access Paper or Ask Questions

Learning to rank via combining representations

May 20, 2020

Hayden S. Helm, Amitabh Basu, Avanti Athreya, Youngser Park, Joshua T. Vogelstein, Michael Winding, Marta Zlatic, Albert Cardona, Patrick Bourke, Jonathan Larson(+2 more)

Figure 1 for Learning to rank via combining representations

Figure 2 for Learning to rank via combining representations

Figure 3 for Learning to rank via combining representations

Figure 4 for Learning to rank via combining representations

Abstract:Learning to rank -- producing a ranked list of items specific to a query and with respect to a set of supervisory items -- is a problem of general interest. The setting we consider is one in which no analytic description of what constitutes a good ranking is available. Instead, we have a collection of representations and supervisory information consisting of a (target item, interesting items set) pair. We demonstrate -- analytically, in simulation, and in real data examples -- that learning to rank via combining representations using an integer linear program is effective when the supervision is as light as "these few items are similar to your item of interest." While this nomination task is of general interest, for specificity we present our methodology from the perspective of vertex nomination in graphs. The methodology described herein is model agnostic.

* 10 pages, 4 figures

Via

Access Paper or Ask Questions

On a 'Two Truths' Phenomenon in Spectral Graph Clustering

Sep 07, 2018

Carey E. Priebe, Youngser Park, Joshua T. Vogelstein, John M. Conroy, Vince Lyzinski, Minh Tang, Avanti Athreya, Joshua Cape, Eric Bridgeford

Figure 1 for On a 'Two Truths' Phenomenon in Spectral Graph Clustering

Figure 2 for On a 'Two Truths' Phenomenon in Spectral Graph Clustering

Figure 3 for On a 'Two Truths' Phenomenon in Spectral Graph Clustering

Figure 4 for On a 'Two Truths' Phenomenon in Spectral Graph Clustering

Abstract:Clustering is concerned with coherently grouping observations without any explicit concept of true groupings. Spectral graph clustering - clustering the vertices of a graph based on their spectral embedding - is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian or Adjacency spectral embedding (LSE or ASE). Recent theoretical results provide new understanding of the problem and solutions, and lead us to a 'Two Truths' LSE vs. ASE spectral graph clustering phenomenon convincingly illustrated here via a diffusion MRI connectome data set: the different embedding methods yield different clustering results, with LSE capturing left hemisphere/right hemisphere affinity structure and ASE capturing gray matter/white matter core-periphery structure.

Via

Access Paper or Ask Questions

Statistical inference on random dot product graphs: a survey

Sep 16, 2017

Avanti Athreya, Donniell E. Fishkind, Keith Levin, Vince Lyzinski, Youngser Park, Yichen Qin, Daniel L. Sussman, Minh Tang, Joshua T. Vogelstein, Carey E. Priebe

Figure 1 for Statistical inference on random dot product graphs: a survey

Figure 2 for Statistical inference on random dot product graphs: a survey

Figure 3 for Statistical inference on random dot product graphs: a survey

Figure 4 for Statistical inference on random dot product graphs: a survey

Abstract:The random dot product graph (RDPG) is an independent-edge random graph that is analytically tractable and, simultaneously, either encompasses or can successfully approximate a wide range of random graphs, from relatively simple stochastic block models to complex latent position graphs. In this survey paper, we describe a comprehensive paradigm for statistical inference on random dot product graphs, a paradigm centered on spectral embeddings of adjacency and Laplacian matrices. We examine the analogues, in graph inference, of several canonical tenets of classical Euclidean inference: in particular, we summarize a body of existing results on the consistency and asymptotic normality of the adjacency and Laplacian spectral embeddings, and the role these spectral embeddings can play in the construction of single- and multi-sample hypothesis tests for graph data. We investigate several real-world applications, including community detection and classification in large social networks and the determination of functional and biologically relevant network properties from an exploratory data analysis of the Drosophila connectome. We outline requisite background and current open problems in spectral graph inference.

* Journal of Machine Learning Research, 2018
* An expository survey paper on a comprehensive paradigm for inference for random dot product graphs, centered on graph adjacency and Laplacian spectral embeddings. Paper outlines requisite background; summarizes theory, methodology, and applications from previous and ongoing work; and closes with a discussion of several open problems

Via

Access Paper or Ask Questions

Semiparametric spectral modeling of the Drosophila connectome

May 09, 2017

Carey E. Priebe, Youngser Park, Minh Tang, Avanti Athreya, Vince Lyzinski, Joshua T. Vogelstein, Yichen Qin, Ben Cocanougher, Katharina Eichler, Marta Zlatic(+1 more)

Figure 1 for Semiparametric spectral modeling of the Drosophila connectome

Figure 2 for Semiparametric spectral modeling of the Drosophila connectome

Figure 3 for Semiparametric spectral modeling of the Drosophila connectome

Figure 4 for Semiparametric spectral modeling of the Drosophila connectome

Abstract:We present semiparametric spectral modeling of the complete larval Drosophila mushroom body connectome. Motivated by a thorough exploratory data analysis of the network via Gaussian mixture modeling (GMM) in the adjacency spectral embedding (ASE) representation space, we introduce the latent structure model (LSM) for network modeling and inference. LSM is a generalization of the stochastic block model (SBM) and a special case of the random dot product graph (RDPG) latent position model, and is amenable to semiparametric GMM in the ASE representation space. The resulting connectome code derived via semiparametric GMM composed with ASE captures latent connectome structure and elucidates biologically relevant neuronal properties.

Via

Access Paper or Ask Questions

Community Detection and Classification in Hierarchical Stochastic Blockmodels

Aug 26, 2016

Vince Lyzinski, Minh Tang, Avanti Athreya, Youngser Park, Carey E. Priebe

Figure 1 for Community Detection and Classification in Hierarchical Stochastic Blockmodels

Figure 2 for Community Detection and Classification in Hierarchical Stochastic Blockmodels

Figure 3 for Community Detection and Classification in Hierarchical Stochastic Blockmodels

Figure 4 for Community Detection and Classification in Hierarchical Stochastic Blockmodels

Abstract:We propose a robust, scalable, integrated methodology for community detection and community comparison in graphs. In our procedure, we first embed a graph into an appropriate Euclidean space to obtain a low-dimensional representation, and then cluster the vertices into communities. We next employ nonparametric graph inference techniques to identify structural similarity among these communities. These two steps are then applied recursively on the communities, allowing us to detect more fine-grained structure. We describe a hierarchical stochastic blockmodel---namely, a stochastic blockmodel with a natural hierarchical structure---and establish conditions under which our algorithm yields consistent estimates of model parameters and motifs, which we define to be stochastically similar groups of subgraphs. Finally, we demonstrate the effectiveness of our algorithm in both simulated and real data. Specifically, we address the problem of locating similar subcommunities in a partially reconstructed Drosophila connectome and in the social network Friendster.

* 17 pages, 7 figures

Via

Access Paper or Ask Questions

Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding

Jan 15, 2015

Vince Lyzinski, Daniel Sussman, Minh Tang, Avanti Athreya, Carey Priebe

Figure 1 for Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding

Figure 2 for Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding

Abstract:Vertex clustering in a stochastic blockmodel graph has wide applicability and has been the subject of extensive research. In thispaper, we provide a short proof that the adjacency spectral embedding can be used to obtain perfect clustering for the stochastic blockmodel and the degree-corrected stochastic blockmodel. We also show an analogous result for the more general random dot product graph model.

* Electronic Journal of Statistics, 8 (2014) 2905--2922
* 22 pages, including references; 2 figures

Via

Access Paper or Ask Questions

A central limit theorem for scaled eigenvectors of random dot product graphs

Dec 23, 2013

Avanti Athreya, Vince Lyzinski, David J. Marchette, Carey E. Priebe, Daniel L. Sussman, Minh Tang

Figure 1 for A central limit theorem for scaled eigenvectors of random dot product graphs

Figure 2 for A central limit theorem for scaled eigenvectors of random dot product graphs

Figure 3 for A central limit theorem for scaled eigenvectors of random dot product graphs

Abstract:We prove a central limit theorem for the components of the largest eigenvectors of the adjacency matrix of a finite-dimensional random dot product graph whose true latent positions are unknown. In particular, we follow the methodology outlined in \citet{sussman2012universally} to construct consistent estimates for the latent positions, and we show that the appropriately scaled differences between the estimated and true latent positions converge to a mixture of Gaussian random variables. As a corollary, we obtain a central limit theorem for the first eigenvector of the adjacency matrix of an Erd\"os-Renyi random graph.

* 24 pages, 2 figures

Via

Access Paper or Ask Questions