Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hans Riess

Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare

Oct 09, 2024

Xenia Konti, Hans Riess, Manos Giannopoulos, Yi Shen, Michael J. Pencina, Nicoleta J. Economou-Zavlanos, Michael M. Zavlanos

Figure 1 for Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare

Figure 2 for Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare

Figure 3 for Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare

Abstract:In this paper, we address the challenge of heterogeneous data distributions in cross-silo federated learning by introducing a novel algorithm, which we term Cross-silo Robust Clustered Federated Learning (CS-RCFL). Our approach leverages the Wasserstein distance to construct ambiguity sets around each client's empirical distribution that capture possible distribution shifts in the local data, enabling evaluation of worst-case model performance. We then propose a model-agnostic integer fractional program to determine the optimal distributionally robust clustering of clients into coalitions so that possible biases in the local models caused by statistically heterogeneous client datasets are avoided, and analyze our method for linear and logistic regression models. Finally, we discuss a federated learning protocol that ensures the privacy of client distributions, a critical consideration, for instance, when clients are healthcare institutions. We evaluate our algorithm on synthetic and real-world healthcare data.

* 8 pages, 3 figures, Accepted to IEEE CDC 2024

Via

Access Paper or Ask Questions

Path Signatures and Graph Neural Networks for Slow Earthquake Analysis: Better Together?

Feb 05, 2024

Hans Riess, Manolis Veveakis, Michael M. Zavlanos

Abstract:The path signature, having enjoyed recent success in the machine learning community, is a theoretically-driven method for engineering features from irregular paths. On the other hand, graph neural networks (GNN), neural architectures for processing data on graphs, excel on tasks with irregular domains, such as sensor networks. In this paper, we introduce a novel approach, Path Signature Graph Convolutional Neural Networks (PS-GCNN), integrating path signatures into graph convolutional neural networks (GCNN), and leveraging the strengths of both path signatures, for feature extraction, and GCNNs, for handling spatial interactions. We apply our method to analyze slow earthquake sequences, also called slow slip events (SSE), utilizing data from GPS timeseries, with a case study on a GPS sensor network on the east coast of New Zealand's north island. We also establish benchmarks for our method on simulated stochastic differential equations, which model similar reaction-diffusion phenomenon. Our methodology shows promise for future advancement in earthquake prediction and sensor network analysis.

Via

Access Paper or Ask Questions

Lattice Theory in Multi-Agent Systems

Apr 05, 2023

Hans Riess

Figure 1 for Lattice Theory in Multi-Agent Systems

Figure 2 for Lattice Theory in Multi-Agent Systems

Figure 3 for Lattice Theory in Multi-Agent Systems

Figure 4 for Lattice Theory in Multi-Agent Systems

Abstract:In this thesis, we argue that (order-) lattice-based multi-agent information systems constitute a broad class of networked multi-agent systems in which relational data is passed between nodes. Mathematically modeled as lattice-valued sheaves, we initiate a discrete Hodge theory with a Laplace operator, analogous to the graph Laplacian and the graph connection Laplacian, acting on assignments of data to the nodes of a Tarski sheaf. The Hodge-Tarski theorem (the main theorem) relates the fixed point theory of this operator, called the Tarski Laplacian in deference to the Tarski Fixed Point Theorem, to the global sections (consistent global states) of the sheaf. We present novel applications to signal processing and multi-agent semantics and supply a plethora of examples throughout.

* Ph.D. Thesis

Via

Access Paper or Ask Questions

Tangent Bundle Convolutional Learning: from Manifolds to Cellular Sheaves and Back

Mar 20, 2023

Claudio Battiloro, Zhiyang Wang, Hans Riess, Paolo Di Lorenzo, Alejandro Ribeiro

Figure 1 for Tangent Bundle Convolutional Learning: from Manifolds to Cellular Sheaves and Back

Figure 2 for Tangent Bundle Convolutional Learning: from Manifolds to Cellular Sheaves and Back

Figure 3 for Tangent Bundle Convolutional Learning: from Manifolds to Cellular Sheaves and Back

Figure 4 for Tangent Bundle Convolutional Learning: from Manifolds to Cellular Sheaves and Back

Abstract:In this work we introduce a convolution operation over the tangent bundle of Riemann manifolds in terms of exponentials of the Connection Laplacian operator. We define tangent bundle filters and tangent bundle neural networks (TNNs) based on this convolution operation, which are novel continuous architectures operating on tangent bundle signals, i.e. vector fields over the manifolds. Tangent bundle filters admit a spectral representation that generalizes the ones of scalar manifold filters, graph filters and standard convolutional filters in continuous time. We then introduce a discretization procedure, both in the space and time domains, to make TNNs implementable, showing that their discrete counterpart is a novel principled variant of the very recently introduced sheaf neural networks. We formally prove that this discretized architecture converges to the underlying continuous TNN. Finally, we numerically evaluate the effectiveness of the proposed architecture on various learning tasks, both on synthetic and real data.

* arXiv admin note: substantial text overlap with arXiv:2210.15058

Via

Access Paper or Ask Questions

Tangent Bundle Filters and Neural Networks: from Manifolds to Cellular Sheaves and Back

Nov 18, 2022

Claudio Battiloro, Zhiyang Wang, Hans Riess, Paolo Di Lorenzo, Alejandro Ribeiro

Abstract:In this work we introduce a convolution operation over the tangent bundle of Riemannian manifolds exploiting the Connection Laplacian operator. We use the convolution to define tangent bundle filters and tangent bundle neural networks (TNNs), novel continuous architectures operating on tangent bundle signals, i.e. vector fields over manifolds. We discretize TNNs both in space and time domains, showing that their discrete counterpart is a principled variant of the recently introduced Sheaf Neural Networks. We formally prove that this discrete architecture converges to the underlying continuous TNN. We numerically evaluate the effectiveness of the proposed architecture on a denoising task of a tangent vector field over the unit 2-sphere.

Via

Access Paper or Ask Questions

Stable and Transferable Hyper-Graph Neural Networks

Nov 11, 2022

Mikhail Hayhoe, Hans Riess, Victor M. Preciado, Alejandro Ribeiro

Figure 1 for Stable and Transferable Hyper-Graph Neural Networks

Figure 2 for Stable and Transferable Hyper-Graph Neural Networks

Figure 3 for Stable and Transferable Hyper-Graph Neural Networks

Figure 4 for Stable and Transferable Hyper-Graph Neural Networks

Abstract:We introduce an architecture for processing signals supported on hypergraphs via graph neural networks (GNNs), which we call a Hyper-graph Expansion Neural Network (HENN), and provide the first bounds on the stability and transferability error of a hypergraph signal processing model. To do so, we provide a framework for bounding the stability and transferability error of GNNs across arbitrary graphs via spectral similarity. By bounding the difference between two graph shift operators (GSOs) in the positive semi-definite sense via their eigenvalue spectrum, we show that this error depends only on the properties of the GNN and the magnitude of spectral similarity of the GSOs. Moreover, we show that existing transferability results that assume the graphs are small perturbations of one another, or that the graphs are random and drawn from the same distribution or sampled from the same graphon can be recovered using our approach. Thus, both GNNs and our HENNs (trained using normalized Laplacians as graph shift operators) will be increasingly stable and transferable as the graphs become larger. Experimental results illustrate the importance of considering multiple graph representations in HENN, and show its superior performance when transferability is desired.

* 13 pages, 1 figure

Via

Access Paper or Ask Questions

Multidimensional Persistence Module Classification via Lattice-Theoretic Convolutions

Nov 28, 2020

Hans Riess, Jakob Hansen

Figure 1 for Multidimensional Persistence Module Classification via Lattice-Theoretic Convolutions

Figure 2 for Multidimensional Persistence Module Classification via Lattice-Theoretic Convolutions

Abstract:Multiparameter persistent homology has been largely neglected as an input to machine learning algorithms. We consider the use of lattice-based convolutional neural network layers as a tool for the analysis of features arising from multiparameter persistence modules. We find that these show promise as an alternative to convolutions for the classification of multidimensional persistence modules.

Via

Access Paper or Ask Questions

Quiver Signal Processing (QSP)

Oct 22, 2020

Alejandro Parada-Mayorga, Hans Riess, Alejandro Ribeiro, Robert Ghrist

Figure 1 for Quiver Signal Processing (QSP)

Abstract:In this paper we state the basics for a signal processing framework on quiver representations. A quiver is a directed graph and a quiver representation is an assignment of vector spaces to the nodes of the graph and of linear maps between the vector spaces associated to the nodes. Leveraging the tools from representation theory, we propose a signal processing framework that allows us to handle heterogeneous multidimensional information in networks. We provide a set of examples where this framework provides a natural set of tools to understand apparently hidden structure in information. We remark that the proposed framework states the basis for building graph neural networks where information can be processed and handled in alternative ways.

Via

Access Paper or Ask Questions