Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Irene Cannistraci

Detecting and Approximating Redundant Computational Blocks in Neural Networks

Oct 07, 2024

Irene Cannistraci, Emanuele Rodolà, Bastian Rieck

Abstract:Deep neural networks often learn similar internal representations, both across different models and within their own layers. While inter-network similarities have enabled techniques such as model stitching and merging, intra-network similarities present new opportunities for designing more efficient architectures. In this paper, we investigate the emergence of these internal similarities across different layers in diverse neural architectures, showing that similarity patterns emerge independently of the datataset used. We introduce a simple metric, Block Redundancy, to detect redundant blocks, providing a foundation for future architectural optimization methods. Building on this, we propose Redundant Blocks Approximation (RBA), a general framework that identifies and approximates one or more redundant computational blocks using simpler transformations. We show that the transformation $\mathcal{T}$ between two representations can be efficiently computed in closed-form, and it is enough to replace the redundant blocks from the network. RBA reduces model parameters and time complexity while maintaining good performance. We validate our method on classification tasks in the vision domain using a variety of pretrained foundational models and datasets.

* 9 pages, 10 figures, 7 tables

Via

Access Paper or Ask Questions

From Charts to Atlas: Merging Latent Spaces into One

Nov 11, 2023

Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Liò, Emanuele Rodolà

Figure 1 for From Charts to Atlas: Merging Latent Spaces into One

Figure 2 for From Charts to Atlas: Merging Latent Spaces into One

Figure 3 for From Charts to Atlas: Merging Latent Spaces into One

Figure 4 for From Charts to Atlas: Merging Latent Spaces into One

Abstract:Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We investigate in this study the aggregation of such latent spaces to create a unified space encompassing the combined information. To this end, we introduce Relative Latent Space Aggregation, a two-step approach that first renders the spaces comparable using relative representations, and then aggregates them via a simple mean. We carefully divide a classification problem into a series of learning tasks under three different settings: sharing samples, classes, or neither. We then train a model on each task and aggregate the resulting latent spaces. We compare the aggregated space with that derived from an end-to-end model trained over all tasks and show that the two spaces are similar. We then observe that the aggregated space is better suited for classification, and empirically demonstrate that it is due to the unique imprints left by task-specific embedders within the representations. We finally test our framework in scenarios where no shared region exists and show that it can still be used to merge the spaces, albeit with diminished benefits over naive merging.

* To appear in the NeurReps workshop @ NeurIPS 2023

Via

Access Paper or Ask Questions

From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Oct 02, 2023

Irene Cannistraci, Luca Moschella, Marco Fumero, Valentino Maiorca, Emanuele Rodolà

Figure 1 for From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Figure 2 for From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Figure 3 for From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Figure 4 for From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Abstract:It has been observed that representations learned by distinct neural networks conceal structural similarities when the models are trained under similar inductive biases. From a geometric perspective, identifying the classes of transformations and the related invariances that connect these representations is fundamental to unlocking applications, such as merging, stitching, and reusing different neural modules. However, estimating task-specific transformations a priori can be challenging and expensive due to several factors (e.g., weights initialization, training hyperparameters, or data modality). To this end, we introduce a versatile method to directly incorporate a set of invariances into the representations, constructing a product space of invariant components on top of the latent representations without requiring prior knowledge about the optimal invariance to infuse. We validate our solution on classification and reconstruction tasks, observing consistent latent similarity and downstream performance improvements in a zero-shot stitching setting. The experimental analysis comprises three modalities (vision, text, and graphs), twelve pretrained foundational models, eight benchmarks, and several architectures trained from scratch.

* 38 pages, 10 figures and 28 tables

Via

Access Paper or Ask Questions

Bootstrapping Parallel Anchors for Relative Representations

Mar 01, 2023

Irene Cannistraci, Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Emanuele Rodolà

Figure 1 for Bootstrapping Parallel Anchors for Relative Representations

Figure 2 for Bootstrapping Parallel Anchors for Relative Representations

Figure 3 for Bootstrapping Parallel Anchors for Relative Representations

Figure 4 for Bootstrapping Parallel Anchors for Relative Representations

Abstract:The use of relative representations for latent embeddings has shown potential in enabling latent space communication and zero-shot model stitching across a wide range of applications. Nevertheless, relative representations rely on a certain amount of parallel anchors to be given as input, which can be impractical to obtain in certain scenarios. To overcome this limitation, we propose an optimization-based method to discover new parallel anchors from a limited number of seeds. Our approach can be used to find semantic correspondence between different domains, align their relative spaces, and achieve competitive results in several tasks.

* 9 pages, 7 tables

Via

Access Paper or Ask Questions

AI-based Data Preparation and Data Analytics in Healthcare: The Case of Diabetes

Jun 13, 2022

Marianna Maranghi, Aris Anagnostopoulos, Irene Cannistraci, Ioannis Chatzigiannakis, Federico Croce, Giulia Di Teodoro, Michele Gentile, Giorgio Grani, Maurizio Lenzerini, Stefano Leonardi(+6 more)

Figure 1 for AI-based Data Preparation and Data Analytics in Healthcare: The Case of Diabetes

Abstract:The Associazione Medici Diabetologi (AMD) collects and manages one of the largest worldwide-available collections of diabetic patient records, also known as the AMD database. This paper presents the initial results of an ongoing project whose focus is the application of Artificial Intelligence and Machine Learning techniques for conceptualizing, cleaning, and analyzing such an important and valuable dataset, with the goal of providing predictive insights to better support diabetologists in their diagnostic and therapeutic choices.

* The work has been presented at the conference Ital-IA 2022 (https://www.ital-ia2022.it/)

Via

Access Paper or Ask Questions