Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pierre Vandergheynst

Return of ChebNet: Understanding and Improving an Overlooked GNN on Long Range Tasks

Jun 09, 2025

Ali Hariri, Álvaro Arroyo, Alessio Gravina, Moshe Eliasof, Carola-Bibiane Schönlieb, Davide Bacciu, Kamyar Azizzadenesheli, Xiaowen Dong, Pierre Vandergheynst

Abstract:ChebNet, one of the earliest spectral GNNs, has largely been overshadowed by Message Passing Neural Networks (MPNNs), which gained popularity for their simplicity and effectiveness in capturing local graph structure. Despite their success, MPNNs are limited in their ability to capture long-range dependencies between nodes. This has led researchers to adapt MPNNs through rewiring or make use of Graph Transformers, which compromises the computational efficiency that characterized early spatial message-passing architectures, and typically disregards the graph structure. Almost a decade after its original introduction, we revisit ChebNet to shed light on its ability to model distant node interactions. We find that out-of-box, ChebNet already shows competitive advantages relative to classical MPNNs and GTs on long-range benchmarks, while maintaining good scalability properties for high-order polynomials. However, we uncover that this polynomial expansion leads ChebNet to an unstable regime during training. To address this limitation, we cast ChebNet as a stable and non-dissipative dynamical system, which we coin Stable-ChebNet. Our Stable-ChebNet model allows for stable information propagation, and has controllable dynamics which do not require the use of eigendecompositions, positional encodings, or graph rewiring. Across several benchmarks, Stable-ChebNet achieves near state-of-the-art performance.

Via

Access Paper or Ask Questions

Implicit Gaussian process representation of vector fields over arbitrary latent manifolds

Sep 28, 2023

Robert L. Peach, Matteo Vinao-Carl, Nir Grossman, Michael David, Emma Mallas, David Sharp, Paresh A. Malhotra, Pierre Vandergheynst, Adam Gosztolai

Abstract:Gaussian processes (GPs) are popular nonparametric statistical models for learning unknown functions and quantifying the spatiotemporal uncertainty in data. Recent works have extended GPs to model scalar and vector quantities distributed over non-Euclidean domains, including smooth manifolds appearing in numerous fields such as computer vision, dynamical systems, and neuroscience. However, these approaches assume that the manifold underlying the data is known, limiting their practical utility. We introduce RVGP, a generalisation of GPs for learning vector signals over latent Riemannian manifolds. Our method uses positional encoding with eigenfunctions of the connection Laplacian, associated with the tangent bundle, readily derived from common graph-based approximation of data. We demonstrate that RVGP possesses global regularity over the manifold, which allows it to super-resolve and inpaint vector fields while preserving singularities. Furthermore, we use RVGP to reconstruct high-density neural dynamics derived from low-density EEG recordings in healthy individuals and Alzheimer's patients. We show that vector field singularities are important disease markers and that their reconstruction leads to a comparable classification accuracy of disease states to high-density recordings. Thus, our method overcomes a significant practical limitation in experimental and clinical applications.

Via

Access Paper or Ask Questions

Interpretable statistical representations of neural population dynamics and geometry

Apr 06, 2023

Adam Gosztolai, Robert L. Peach, Alexis Arnaudon, Mauricio Barahona, Pierre Vandergheynst

Figure 1 for Interpretable statistical representations of neural population dynamics and geometry

Figure 2 for Interpretable statistical representations of neural population dynamics and geometry

Figure 3 for Interpretable statistical representations of neural population dynamics and geometry

Figure 4 for Interpretable statistical representations of neural population dynamics and geometry

Abstract:The dynamics of neuron populations during diverse tasks often evolve on low-dimensional manifolds. However, it remains challenging to discern the contributions of geometry and dynamics for encoding relevant behavioural variables. Here, we introduce an unsupervised geometric deep learning framework for representing non-linear dynamical systems based on statistical distributions of local phase portrait features. Our method provides robust geometry-aware or geometry-agnostic representations for the unbiased comparison of dynamics based on measured trajectories. We demonstrate that our statistical representation can generalise across neural network instances to discriminate computational mechanisms, obtain interpretable embeddings of neural dynamics in a primate reaching task with geometric correspondence to hand kinematics, and develop a decoding algorithm with state-of-the-art accuracy. Our results highlight the importance of using the intrinsic manifold structure over temporal information to develop better decoding algorithms and assimilate data across experiments.

Via

Access Paper or Ask Questions

Generalised Implicit Neural Representations

May 31, 2022

Daniele Grattarola, Pierre Vandergheynst

Figure 1 for Generalised Implicit Neural Representations

Figure 2 for Generalised Implicit Neural Representations

Figure 3 for Generalised Implicit Neural Representations

Figure 4 for Generalised Implicit Neural Representations

Abstract:We consider the problem of learning implicit neural representations (INRs) for signals on non-Euclidean domains. In the Euclidean case, INRs are trained on a discrete sampling of a signal over a regular lattice. Here, we assume that the continuous signal exists on some unknown topological space from which we sample a discrete graph. In the absence of a coordinate system to identify the sampled nodes, we propose approximating their location with a spectral embedding of the graph. This allows us to train INRs without knowing the underlying continuous domain, which is the case for most graph signals in nature, while also making the INRs equivariant under the symmetry group of the domain. We show experiments with our method on various real-world signals on non-Euclidean domains.

Via

Access Paper or Ask Questions

Some limitations of norm based generalization bounds in deep neural networks

May 23, 2019

Konstantinos Pitas, Andreas Loukas, Mike Davies, Pierre Vandergheynst

Figure 1 for Some limitations of norm based generalization bounds in deep neural networks

Figure 2 for Some limitations of norm based generalization bounds in deep neural networks

Figure 3 for Some limitations of norm based generalization bounds in deep neural networks

Figure 4 for Some limitations of norm based generalization bounds in deep neural networks

Abstract:Deep convolutional neural networks have been shown to be able to fit a labeling over random data while still being able to generalize well on normal datasets. Describing deep convolutional neural network capacity through the measure of spectral complexity has been recently proposed to tackle this apparent paradox. Spectral complexity correlates with GE and can distinguish networks trained on normal and random labels. We propose the first GE bound based on spectral complexity for deep convolutional neural networks and provide tighter bounds by orders of magnitude from the previous estimate. We then investigate theoretically and empirically the insensitivity of spectral complexity to invariances of modern deep convolutional neural networks, and show several limitations of spectral complexity that occur as a result.

Via

Access Paper or Ask Questions

Revisiting hard thresholding for DNN pruning

May 21, 2019

Konstantinos Pitas, Mike Davies, Pierre Vandergheynst

Figure 1 for Revisiting hard thresholding for DNN pruning

Figure 2 for Revisiting hard thresholding for DNN pruning

Figure 3 for Revisiting hard thresholding for DNN pruning

Figure 4 for Revisiting hard thresholding for DNN pruning

Abstract:The most common method for DNN pruning is hard thresholding of network weights, followed by retraining to recover any lost accuracy. Recently developed smart pruning algorithms use the DNN response over the training set for a variety of cost functions to determine redundant network weights, leading to less accuracy degradation and possibly less retraining time. For experiments on the total pruning time (pruning time + retraining time) we show that hard thresholding followed by retraining remains the most efficient way of reducing the number of network parameters. However smart pruning algorithms still have advantages when retraining is not possible. In this context we propose a novel smart pruning algorithm based on difference of convex functions optimisation and show that it is often orders of magnitude faster than competing approaches while achieving the lowest classification accuracy degradation. Furthermore we investigate theoretically the effect of hard thresholding on DNN accuracy. We show that accuracy degradation increases with remaining network depth from the pruned layer. We also discover a link between the latent dimensionality of the training data manifold and network robustness to hard thresholding.

* arXiv admin note: substantial text overlap with arXiv:1803.04239

Via

Access Paper or Ask Questions

A Graph-structured Dataset for Wikipedia Research

Mar 20, 2019

Nicolas Aspert, Volodymyr Miz, Benjamin Ricaud, Pierre Vandergheynst

Figure 1 for A Graph-structured Dataset for Wikipedia Research

Figure 2 for A Graph-structured Dataset for Wikipedia Research

Figure 3 for A Graph-structured Dataset for Wikipedia Research

Figure 4 for A Graph-structured Dataset for Wikipedia Research

Abstract:Wikipedia is a rich and invaluable source of information. Its central place on the Web makes it a particularly interesting object of study for scientists. Researchers from different domains used various complex datasets related to Wikipedia to study language, social behavior, knowledge organization, and network theory. While being a scientific treasure, the large size of the dataset hinders pre-processing and may be a challenging obstacle for potential new studies. This issue is particularly acute in scientific domains where researchers may not be technically and data processing savvy. On one hand, the size of Wikipedia dumps is large. It makes the parsing and extraction of relevant information cumbersome. On the other hand, the API is straightforward to use but restricted to a relatively small number of requests. The middle ground is at the mesoscopic scale when researchers need a subset of Wikipedia ranging from thousands to hundreds of thousands of pages but there exists no efficient solution at this scale. In this work, we propose an efficient data structure to make requests and access subnetworks of Wikipedia pages and categories. We provide convenient tools for accessing and filtering viewership statistics or "pagecounts" of Wikipedia web pages. The dataset organization leverages principles of graph databases that allows rapid and intuitive access to subgraphs of Wikipedia articles and categories. The dataset and deployment guidelines are available on the LTS2 website \url{https://lts2.epfl.ch/Datasets/Wikipedia/}.

Via

Access Paper or Ask Questions

Anomaly detection in the dynamics of web and social networks

Jan 22, 2019

Volodymyr Miz, Benjamin Ricaud, Kirell Benzi, Pierre Vandergheynst

Figure 1 for Anomaly detection in the dynamics of web and social networks

Figure 2 for Anomaly detection in the dynamics of web and social networks

Figure 3 for Anomaly detection in the dynamics of web and social networks

Figure 4 for Anomaly detection in the dynamics of web and social networks

Abstract:In this work, we propose a new, fast and scalable method for anomaly detection in large time-evolving graphs. It may be a static graph with dynamic node attributes (e.g. time-series), or a graph evolving in time, such as a temporal network. We define an anomaly as a localized increase in temporal activity in a cluster of nodes. The algorithm is unsupervised. It is able to detect and track anomalous activity in a dynamic network despite the noise from multiple interfering sources. We use the Hopfield network model of memory to combine the graph and time information. We show that anomalies can be spotted with a good precision using a memory network. The presented approach is scalable and we provide a distributed implementation of the algorithm. To demonstrate its efficiency, we apply it to two datasets: Enron Email dataset and Wikipedia page views. We show that the anomalous spikes are triggered by the real-world events that impact the network dynamics. Besides, the structure of the clusters and the analysis of the time evolution associated with the detected events reveals interesting facts on how humans interact, exchange and search for information, opening the door to new quantitative studies on collective and social behavior on large and dynamic datasets.

* The Web Conference 2019, 10 pages, 7 figures

Via

Access Paper or Ask Questions

Geodesic convolutional neural networks on Riemannian manifolds

Jun 08, 2018

Jonathan Masci, Davide Boscaini, Michael M. Bronstein, Pierre Vandergheynst

Figure 1 for Geodesic convolutional neural networks on Riemannian manifolds

Figure 2 for Geodesic convolutional neural networks on Riemannian manifolds

Figure 3 for Geodesic convolutional neural networks on Riemannian manifolds

Figure 4 for Geodesic convolutional neural networks on Riemannian manifolds

Abstract:Feature descriptors play a crucial role in a wide range of geometry analysis and processing applications, including shape correspondence, retrieval, and segmentation. In this paper, we introduce Geodesic Convolutional Neural Networks (GCNN), a generalization of the convolutional networks (CNN) paradigm to non-Euclidean manifolds. Our construction is based on a local geodesic system of polar coordinates to extract "patches", which are then passed through a cascade of filters and linear and non-linear operators. The coefficients of the filters and linear combination weights are optimization variables that are learned to minimize a task-specific cost function. We use GCNN to learn invariant shape features, allowing to achieve state-of-the-art performance in problems such as shape description, retrieval, and correspondence.

Via

Access Paper or Ask Questions

PAC-Bayesian Margin Bounds for Convolutional Neural Networks

Apr 20, 2018

Konstantinos Pitas, Mike Davies, Pierre Vandergheynst

Figure 1 for PAC-Bayesian Margin Bounds for Convolutional Neural Networks

Figure 2 for PAC-Bayesian Margin Bounds for Convolutional Neural Networks

Figure 3 for PAC-Bayesian Margin Bounds for Convolutional Neural Networks

Figure 4 for PAC-Bayesian Margin Bounds for Convolutional Neural Networks

Abstract:Recently the generalization error of deep neural networks has been analyzed through the PAC-Bayesian framework, for the case of fully connected layers. We adapt this approach to the convolutional setting.

* arXiv admin note: text overlap with arXiv:1707.09564 by other authors

Via

Access Paper or Ask Questions