Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paulo Gonçalves

OCKHAM

PASCO (PArallel Structured COarsening): an overlay to speed up graph clustering algorithms

Dec 18, 2024

Etienne Lasalle, Rémi Vaudaine, Titouan Vayer, Pierre Borgnat, Rémi Gribonval, Paulo Gonçalves, Màrton Karsai

Abstract:Clustering the nodes of a graph is a cornerstone of graph analysis and has been extensively studied. However, some popular methods are not suitable for very large graphs: e.g., spectral clustering requires the computation of the spectral decomposition of the Laplacian matrix, which is not applicable for large graphs with a large number of communities. This work introduces PASCO, an overlay that accelerates clustering algorithms. Our method consists of three steps: 1-We compute several independent small graphs representing the input graph by applying an efficient and structure-preserving coarsening algorithm. 2-A clustering algorithm is run in parallel onto each small graph and provides several partitions of the initial graph. 3-These partitions are aligned and combined with an optimal transport method to output the final partition. The PASCO framework is based on two key contributions: a novel global algorithm structure designed to enable parallelization and a fast, empirically validated graph coarsening algorithm that preserves structural properties. We demonstrate the strong performance of 1 PASCO in terms of computational efficiency, structural preservation, and output partition quality, evaluated on both synthetic and real-world graph datasets.

Via

Access Paper or Ask Questions

Compressive Recovery of Sparse Precision Matrices

Nov 08, 2023

Titouan Vayer, Etienne Lasalle, Rémi Gribonval, Paulo Gonçalves

Abstract:We consider the problem of learning a graph modeling the statistical relations of the $d$ variables of a dataset with $n$ samples $X \in \mathbb{R}^{n \times d}$. Standard approaches amount to searching for a precision matrix $\Theta$ representative of a Gaussian graphical model that adequately explains the data. However, most maximum likelihood-based estimators usually require storing the $d^{2}$ values of the empirical covariance matrix, which can become prohibitive in a high-dimensional setting. In this work, we adopt a compressive viewpoint and aim to estimate a sparse $\Theta$ from a sketch of the data, i.e. a low-dimensional vector of size $m \ll d^{2}$ carefully designed from $X$ using nonlinear random features. Under certain assumptions on the spectrum of $\Theta$ (or its condition number), we show that it is possible to estimate it from a sketch of size $m=\Omega((d+2k)\log(d))$ where $k$ is the maximal number of edges of the underlying graph. These information-theoretic guarantees are inspired by compressed sensing theory and involve restricted isometry properties and instance optimal decoders. We investigate the possibility of achieving practical recovery with an iterative algorithm based on the graphical lasso, viewed as a specific denoiser. We compare our approach and graphical lasso on synthetic datasets, demonstrating its favorable performance even when the dataset is compressed.

Via

Access Paper or Ask Questions

Implicit Differentiation for Hyperparameter Tuning the Weighted Graphical Lasso

Jul 05, 2023

Can Pouliquen, Paulo Gonçalves, Mathurin Massias, Titouan Vayer

Figure 1 for Implicit Differentiation for Hyperparameter Tuning the Weighted Graphical Lasso

Figure 2 for Implicit Differentiation for Hyperparameter Tuning the Weighted Graphical Lasso

Figure 3 for Implicit Differentiation for Hyperparameter Tuning the Weighted Graphical Lasso

Abstract:We provide a framework and algorithm for tuning the hyperparameters of the Graphical Lasso via a bilevel optimization problem solved with a first-order method. In particular, we derive the Jacobian of the Graphical Lasso solution with respect to its regularization hyperparameters.

Via

Access Paper or Ask Questions

Fast Multiscale Diffusion on Graphs

Apr 29, 2021

Sibylle Marcotte, Amélie Barbe, Rémi Gribonval, Titouan Vayer, Marc Sebban, Pierre Borgnat, Paulo Gonçalves

Figure 1 for Fast Multiscale Diffusion on Graphs

Abstract:Diffusing a graph signal at multiple scales requires computing the action of the exponential of several multiples of the Laplacian matrix. We tighten a bound on the approximation error of truncated Chebyshev polynomial approximations of the exponential, hence significantly improving a priori estimates of the polynomial order for a prescribed error. We further exploit properties of these approximations to factorize the computation of the action of the diffusion operator over multiple scales, thus reducing drastically its computational cost.

Via

Access Paper or Ask Questions

Solving NMF with smoothness and sparsity constraints using PALM

Oct 31, 2019

Raimon Fabregat, Nelly Pustelnik, Paulo Gonçalves, Pierre Borgnat

Figure 1 for Solving NMF with smoothness and sparsity constraints using PALM

Figure 2 for Solving NMF with smoothness and sparsity constraints using PALM

Figure 3 for Solving NMF with smoothness and sparsity constraints using PALM

Abstract:Non-negative matrix factorization is a problem of dimensionality reduction and source separation of data that has been widely used in many fields since it was studied in depth in 1999 by Lee and Seung, including in compression of data, document clustering, processing of audio spectrograms and astronomy. In this work we have adapted a minimization scheme for convex functions with non-differentiable constraints called PALM to solve the NMF problem with solutions that can be smooth and/or sparse, two properties frequently desired.

Via

Access Paper or Ask Questions

$L^γ$-PageRank for Semi-Supervised Learning

Mar 11, 2019

Esteban Bautista, Patrice Abry, Paulo Gonçalves

Figure 1 for $L^γ$-PageRank for Semi-Supervised Learning

Figure 2 for $L^γ$-PageRank for Semi-Supervised Learning

Figure 3 for $L^γ$-PageRank for Semi-Supervised Learning

Figure 4 for $L^γ$-PageRank for Semi-Supervised Learning

Abstract:PageRank for Semi-Supervised Learning has shown to leverage data structures and limited tagged examples to yield meaningful classification. Despite successes, classification performance can still be improved, particularly in cases of fuzzy graphs or unbalanced labeled data. To address such limitations, a novel approach based on powers of the Laplacian matrix $L^\gamma$ ($\gamma > 0$), referred to as $L^\gamma$-PageRank, is proposed. Its theoretical study shows that it operates on signed graphs, where nodes belonging to one same class are more likely to share positive edges while nodes from different classes are more likely to be connected with negative edges. It is shown that by selecting an optimal $\gamma$, classification performance can be significantly enhanced. A procedure for the automated estimation of the optimal $\gamma$, from a unique observation of data, is devised and assessed. Experiments on several datasets demonstrate the effectiveness of both $L^\gamma$-PageRank classification and the optimal $\gamma$ estimation.

* Submitted to Applied Network Science (special issue on machine learning with graphs)

Via

Access Paper or Ask Questions