Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Olivier Teste

IRIT-SIG, IRIT, UT2J, UT

A Survey on Cluster-based Federated Learning

Jan 29, 2025

Omar El-Rifai, Michael Ben Ali, Imen Megdiche, André Peninou, Olivier Teste

Abstract:As the industrial and commercial use of Federated Learning (FL) has expanded, so has the need for optimized algorithms. In settings were FL clients' data is non-independently and identically distributed (non-IID) and with highly heterogeneous distributions, the baseline FL approach seems to fall short. To tackle this issue, recent studies, have looked into personalized FL (PFL) which relaxes the implicit single-model constraint and allows for multiple hypotheses to be learned from the data or local models. Among the personalized FL approaches, cluster-based solutions (CFL) are particularly interesting whenever it is clear -through domain knowledge -that the clients can be separated into groups. In this paper, we study recent works on CFL, proposing: i) a classification of CFL solutions for personalization; ii) a structured review of literature iii) a review of alternative use cases for CFL. CCS Concepts: $\bullet$ General and reference $\rightarrow$ Surveys and overviews; $\bullet$ Computing methodologies $\rightarrow$ Machine learning; $\bullet$ Information systems $\rightarrow$ Clustering; $\bullet$ Security and privacy $\rightarrow$ Privacy-preserving protocols.

Via

Access Paper or Ask Questions

Comparative Evaluation of Clustered Federated Learning Method

Oct 18, 2024

Michael Ben Ali, Omar El-Rifai, Imen Megdiche, André Peninou, Olivier Teste

Abstract:Over recent years, Federated Learning (FL) has proven to be one of the most promising methods of distributed learning which preserves data privacy. As the method evolved and was confronted to various real-world scenarios, new challenges have emerged. One such challenge is the presence of highly heterogeneous (often referred as non-IID) data distributions among participants of the FL protocol. A popular solution to this hurdle is Clustered Federated Learning (CFL), which aims to partition clients into groups where the distribution are homogeneous. In the literature, state-of-the-art CFL algorithms are often tested using a few cases of data heterogeneities, without systematically justifying the choices. Further, the taxonomy used for differentiating the different heterogeneity scenarios is not always straightforward. In this paper, we explore the performance of two state-of-theart CFL algorithms with respect to a proposed taxonomy of data heterogeneities in federated learning (FL). We work with three image classification datasets and analyze the resulting clusters against the heterogeneity classes using extrinsic clustering metrics. Our objective is to provide a clearer understanding of the relationship between CFL performances and data heterogeneity scenarios.

* The 2nd IEEE International Conference on Federated Learning Technologies and Applications (FLTA24), Sep 2024, Valencia (Espagne), Spain

Via

Access Paper or Ask Questions

DECWA : Density-Based Clustering using Wasserstein Distance

Oct 25, 2023

Nabil El Malki, Robin Cugny, Olivier Teste, Franck Ravat

Abstract:Clustering is a data analysis method for extracting knowledge by discovering groups of data called clusters. Among these methods, state-of-the-art density-based clustering methods have proven to be effective for arbitrary-shaped clusters. Despite their encouraging results, they suffer to find low-density clusters, near clusters with similar densities, and high-dimensional data. Our proposals are a new characterization of clusters and a new clustering algorithm based on spatial density and probabilistic approach. First of all, sub-clusters are built using spatial density represented as probability density function ($p.d.f$) of pairwise distances between points. A method is then proposed to agglomerate similar sub-clusters by using both their density ($p.d.f$) and their spatial distance. The key idea we propose is to use the Wasserstein metric, a powerful tool to measure the distance between $p.d.f$ of sub-clusters. We show that our approach outperforms other state-of-the-art density-based clustering methods on a wide variety of datasets.

* 6 pages, CIKM 2020

Via

Access Paper or Ask Questions

Why Should I Choose You? AutoXAI: A Framework for Selecting and Tuning eXplainable AI Solutions

Oct 10, 2022

Robin Cugny, Julien Aligon, Max Chevalier, Geoffrey Roman Jimenez, Olivier Teste

Figure 1 for Why Should I Choose You? AutoXAI: A Framework for Selecting and Tuning eXplainable AI Solutions

Figure 2 for Why Should I Choose You? AutoXAI: A Framework for Selecting and Tuning eXplainable AI Solutions

Figure 3 for Why Should I Choose You? AutoXAI: A Framework for Selecting and Tuning eXplainable AI Solutions

Figure 4 for Why Should I Choose You? AutoXAI: A Framework for Selecting and Tuning eXplainable AI Solutions

Abstract:In recent years, a large number of XAI (eXplainable Artificial Intelligence) solutions have been proposed to explain existing ML (Machine Learning) models or to create interpretable ML models. Evaluation measures have recently been proposed and it is now possible to compare these XAI solutions. However, selecting the most relevant XAI solution among all this diversity is still a tedious task, especially when meeting specific needs and constraints. In this paper, we propose AutoXAI, a framework that recommends the best XAI solution and its hyperparameters according to specific XAI evaluation metrics while considering the user's context (dataset, ML model, XAI needs and constraints). It adapts approaches from context-aware recommender systems and strategies of optimization and evaluation from AutoML (Automated Machine Learning). We apply AutoXAI to two use cases, and show that it recommends XAI solutions adapted to the user's needs with the best hyperparameters matching the user's constraints.

* 16 pages, 7 figures, to be published in CIKM2022

Via

Access Paper or Ask Questions