Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lilian Berton

A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems

Mar 14, 2025

Gökhan Özbulak, Oscar Jimenez-del-Toro, Maíra Fatoretto, Lilian Berton, André Anjos

Abstract:The evaluation of fairness models in Machine Learning involves complex challenges, such as defining appropriate metrics, balancing trade-offs between utility and fairness, and there are still gaps in this stage. This work presents a novel multi-objective evaluation framework that enables the analysis of utility-fairness trade-offs in Machine Learning systems. The framework was developed using criteria from Multi-Objective Optimization that collect comprehensive information regarding this complex evaluation task. The assessment of multiple Machine Learning systems is summarized, both quantitatively and qualitatively, in a straightforward manner through a radar chart and a measurement table encompassing various aspects such as convergence, system capacity, and diversity. The framework's compact representation of performance facilitates the comparative analysis of different Machine Learning strategies for decision-makers, in real-world applications, with single or multiple fairness requirements. The framework is model-agnostic and flexible to be adapted to any kind of Machine Learning systems, that is, black- or white-box, any kind and quantity of evaluation metrics, including multidimensional fairness criteria. The functionality and effectiveness of the proposed framework is shown with different simulations, and an empirical study conducted on a real-world dataset with various Machine Learning systems.

* 11 pages, 13 figures

Via

Access Paper or Ask Questions

Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives

Feb 24, 2025

Dilermando Queiroz, Anderson Carlos, André Anjos, Lilian Berton

Abstract:Ensuring equitable Artificial Intelligence (AI) in healthcare demands systems that make unbiased decisions across all demographic groups, bridging technical innovation with ethical principles. Foundation Models (FMs), trained on vast datasets through self-supervised learning, enable efficient adaptation across medical imaging tasks while reducing dependency on labeled data. These models demonstrate potential for enhancing fairness, though significant challenges remain in achieving consistent performance across demographic groups. Our review indicates that effective bias mitigation in FMs requires systematic interventions throughout all stages of development. While previous approaches focused primarily on model-level bias mitigation, our analysis reveals that fairness in FMs requires integrated interventions throughout the development pipeline, from data documentation to deployment protocols. This comprehensive framework advances current knowledge by demonstrating how systematic bias mitigation, combined with policy engagement, can effectively address both technical and institutional barriers to equitable AI in healthcare. The development of equitable FMs represents a critical step toward democratizing advanced healthcare technologies, particularly for underserved populations and regions with limited medical infrastructure and computational resources.

Via

Access Paper or Ask Questions

Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?

Sep 02, 2024

Dilermando Queiroz, Anderson Carlos, Maíra Fatoretto, Luis Filipe Nakayama, André Anjos, Lilian Berton

Figure 1 for Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?

Figure 2 for Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?

Figure 3 for Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?

Abstract:Foundation models have emerged as robust models with label efficiency in diverse domains. In medical imaging, these models contribute to the advancement of medical diagnoses due to the difficulty in obtaining labeled data. However, it is unclear whether using a large amount of unlabeled data, biased by the presence of sensitive attributes during pre-training, influences the fairness of the model. This research examines the bias in the Foundation model (RetFound) when it is applied to fine-tune the Brazilian Multilabel Ophthalmological Dataset (BRSET), which has a different population than the pre-training dataset. The model evaluation, in comparison with supervised learning, shows that the Foundation Model has the potential to reduce the gap between the maximum AUC and minimum AUC evaluations across gender and age groups. However, in a data-efficient generalization, the model increases the bias when the data amount decreases. These findings suggest that when deploying a Foundation Model in real-life scenarios with limited data, the possibility of fairness issues should be considered.

* Preprint of paper to be presented at Fairness and Ethics Towards Transparent AI: Facing the Challenge through Model Debiasing (FAILED) during ECCV 2024

Via

Access Paper or Ask Questions

Using Backbone Foundation Model for Evaluating Fairness in Chest Radiography Without Demographic Data

Aug 28, 2024

Dilermando Queiroz, André Anjos, Lilian Berton

Abstract:Ensuring consistent performance across diverse populations and incorporating fairness into machine learning models are crucial for advancing medical image diagnostics and promoting equitable healthcare. However, many databases do not provide protected attributes or contain unbalanced representations of demographic groups, complicating the evaluation of model performance across different demographics and the application of bias mitigation techniques that rely on these attributes. This study aims to investigate the effectiveness of using the backbone of Foundation Models as an embedding extractor for creating groups that represent protected attributes, such as gender and age. We propose utilizing these groups in different stages of bias mitigation, including pre-processing, in-processing, and evaluation. Using databases in and out-of-distribution scenarios, it is possible to identify that the method can create groups that represent gender in both databases and reduce in 4.44% the difference between the gender attribute in-distribution and 6.16% in out-of-distribution. However, the model lacks robustness in handling age attributes, underscoring the need for more fundamentally fair and robust Foundation models. These findings suggest a role in promoting fairness assessment in scenarios where we lack knowledge of attributes, contributing to the development of more equitable medical diagnostics.

* Preprint of paper to be presented at Fairness of AI in Medical Imaging (FAIMI) during MICCAI 2024

Via

Access Paper or Ask Questions

QT-Routenet: Improved GNN generalization to larger 5G networks by fine-tuning predictions from queueing theory

Jul 13, 2022

Bruno Klaus de Aquino Afonso, Lilian Berton

Figure 1 for QT-Routenet: Improved GNN generalization to larger 5G networks by fine-tuning predictions from queueing theory

Figure 2 for QT-Routenet: Improved GNN generalization to larger 5G networks by fine-tuning predictions from queueing theory

Figure 3 for QT-Routenet: Improved GNN generalization to larger 5G networks by fine-tuning predictions from queueing theory

Figure 4 for QT-Routenet: Improved GNN generalization to larger 5G networks by fine-tuning predictions from queueing theory

Abstract:In order to promote the use of machine learning in 5G, the International Telecommunication Union (ITU) proposed in 2021 the second edition of the ITU AI/ML in 5G challenge, with over 1600 participants from 82 countries. This work details the second place solution overall, which is also the winning solution of the Graph Neural Networking Challenge 2021. We tackle the problem of generalization when applying a model to a 5G network that may have longer paths and larger link capacities than the ones observed in training. To achieve this, we propose to first extract robust features related to Queueing Theory (QT), and then fine-tune the analytical baseline prediction using a modification of the Routenet Graph Neural Network (GNN) model. The proposed solution generalizes much better than simply using Routenet, and manages to reduce the analytical baseline's 10.42 mean absolute percent error to 1.45 (1.27 with an ensemble). This suggests that making small changes to an approximate model that is known to be robust can be an effective way to improve accuracy without compromising generalization.

* ITU Journal on Future and Evolving Technologies 3 (2022)

Via

Access Paper or Ask Questions

Optimizing Diffusion Rate and Label Reliability in a Graph-Based Semi-supervised Classifier

Jan 10, 2022

Bruno Klaus de Aquino Afonso, Lilian Berton

Figure 1 for Optimizing Diffusion Rate and Label Reliability in a Graph-Based Semi-supervised Classifier

Figure 2 for Optimizing Diffusion Rate and Label Reliability in a Graph-Based Semi-supervised Classifier

Figure 3 for Optimizing Diffusion Rate and Label Reliability in a Graph-Based Semi-supervised Classifier

Figure 4 for Optimizing Diffusion Rate and Label Reliability in a Graph-Based Semi-supervised Classifier

Abstract:Semi-supervised learning has received attention from researchers, as it allows one to exploit the structure of unlabeled data to achieve competitive classification results with much fewer labels than supervised approaches. The Local and Global Consistency (LGC) algorithm is one of the most well-known graph-based semi-supervised (GSSL) classifiers. Notably, its solution can be written as a linear combination of the known labels. The coefficients of this linear combination depend on a parameter $\alpha$, determining the decay of the reward over time when reaching labeled vertices in a random walk. In this work, we discuss how removing the self-influence of a labeled instance may be beneficial, and how it relates to leave-one-out error. Moreover, we propose to minimize this leave-one-out loss with automatic differentiation. Within this framework, we propose methods to estimate label reliability and diffusion rate. Optimizing the diffusion rate is more efficiently accomplished with a spectral representation. Results show that the label reliability approach competes with robust L1-norm methods and that removing diagonal entries reduces the risk of overfitting and leads to suitable criteria for parameter selection.

Via

Access Paper or Ask Questions

Analysis of label noise in graph-based semi-supervised learning

Sep 27, 2020

Bruno Klaus de Aquino Afonso, Lilian Berton

Figure 1 for Analysis of label noise in graph-based semi-supervised learning

Figure 2 for Analysis of label noise in graph-based semi-supervised learning

Figure 3 for Analysis of label noise in graph-based semi-supervised learning

Figure 4 for Analysis of label noise in graph-based semi-supervised learning

Abstract:In machine learning, one must acquire labels to help supervise a model that will be able to generalize to unseen data. However, the labeling process can be tedious, long, costly, and error-prone. It is often the case that most of our data is unlabeled. Semi-supervised learning (SSL) alleviates that by making strong assumptions about the relation between the labels and the input data distribution. This paradigm has been successful in practice, but most SSL algorithms end up fully trusting the few available labels. In real life, both humans and automated systems are prone to mistakes; it is essential that our algorithms are able to work with labels that are both few and also unreliable. Our work aims to perform an extensive empirical evaluation of existing graph-based semi-supervised algorithms, like Gaussian Fields and Harmonic Functions, Local and Global Consistency, Laplacian Eigenmaps, Graph Transduction Through Alternating Minimization. To do that, we compare the accuracy of classifiers while varying the amount of labeled data and label noise for many different samples. Our results show that, if the dataset is consistent with SSL assumptions, we are able to detect the noisiest instances, although this gets harder when the number of available labels decreases. Also, the Laplacian Eigenmaps algorithm performed better than label propagation when the data came from high-dimensional clusters.

Via

Access Paper or Ask Questions

Identifying noisy labels with a transductive semi-supervised leave-one-out filter

Sep 24, 2020

Bruno Klaus de Aquino Afonso, Lilian Berton

Figure 1 for Identifying noisy labels with a transductive semi-supervised leave-one-out filter

Figure 2 for Identifying noisy labels with a transductive semi-supervised leave-one-out filter

Figure 3 for Identifying noisy labels with a transductive semi-supervised leave-one-out filter

Figure 4 for Identifying noisy labels with a transductive semi-supervised leave-one-out filter

Abstract:Obtaining data with meaningful labels is often costly and error-prone. In this situation, semi-supervised learning (SSL) approaches are interesting, as they leverage assumptions about the unlabeled data to make up for the limited amount of labels. However, in real-world situations, we cannot assume that the labeling process is infallible, and the accuracy of many SSL classifiers decreases significantly in the presence of label noise. In this work, we introduce the LGC_LVOF, a leave-one-out filtering approach based on the Local and Global Consistency (LGC) algorithm. Our method aims to detect and remove wrong labels, and thus can be used as a preprocessing step to any SSL classifier. Given the propagation matrix, detecting noisy labels takes O(cl) per step, with c the number of classes and l the number of labels. Moreover, one does not need to compute the whole propagation matrix, but only an $l$ by $l$ submatrix corresponding to interactions between labeled instances. As a result, our approach is best suited to datasets with a large amount of unlabeled data but not many labels. Results are provided for a number of datasets, including MNIST and ISOLET. LGCLVOF appears to be equally or more precise than the adapted gradient-based filter. We show that the best-case accuracy of the embedding of LGCLVOF into LGC yields performance comparable to the best-case of $\ell_1$-based classifiers designed to be robust to label noise. We provide a heuristic to choose the number of removed instances.

* Pattern Recognition Letters, 2020, ISSN 0167-8655

Via

Access Paper or Ask Questions

Cluster analysis of homicide rates in the Brazilian state of Goias from 2002 to 2014

Nov 11, 2018

Samuel bruno da Silva Sousa, Ronaldo de Castro Del-Fiaco, Lilian Berton

Figure 1 for Cluster analysis of homicide rates in the Brazilian state of Goias from 2002 to 2014

Figure 2 for Cluster analysis of homicide rates in the Brazilian state of Goias from 2002 to 2014

Figure 3 for Cluster analysis of homicide rates in the Brazilian state of Goias from 2002 to 2014

Figure 4 for Cluster analysis of homicide rates in the Brazilian state of Goias from 2002 to 2014

Abstract:Homicide mortality is a worldwide concern and has occupied the agenda of researchers and public managers. In Brazil, homicide is the third leading cause of death in the general population and the first in the 15-39 age group. In South America, Brazil has the third highest homicide mortality, behind Venezuela and Colombia. To measure the impacts of violence it is important to assess health systems and criminal justice, as well as other areas. In this paper, we analyze the spatial distribution of homicide mortality in the state of Goias, Center-West of Brazil, since the homicide rate increased from 24.5 per 100,000 in 2002 to 42.6 per 100,000 in 2014 in this location. Moreover, this state had the fifth position of homicides in Brazil in 2014. We considered socio-demographic variables for the state, performed analysis about correlation and employed three clustering algorithms: K-means, Density-based and Hierarchical. The results indicate the homicide rates are higher in cities neighbors of large urban centers, although these cities have the best socioeconomic indicators.

* Proceedings of the 44th Latin American Computing Conference - Clei 2018. S\~ao Paulo, Brazil (2018)

Via

Access Paper or Ask Questions

Performing edge detection by difference of Gaussians using q-Gaussian kernels

Nov 12, 2013

Lucas Assirati, Núbia R. da Silva, Lilian Berton, Alneu de A. Lopes, Odemir M. Bruno

Figure 1 for Performing edge detection by difference of Gaussians using q-Gaussian kernels

Figure 2 for Performing edge detection by difference of Gaussians using q-Gaussian kernels

Figure 3 for Performing edge detection by difference of Gaussians using q-Gaussian kernels

Figure 4 for Performing edge detection by difference of Gaussians using q-Gaussian kernels

Abstract:In image processing, edge detection is a valuable tool to perform the extraction of features from an image. This detection reduces the amount of information to be processed, since the redundant information (considered less relevant) can be unconsidered. The technique of edge detection consists of determining the points of a digital image whose intensity changes sharply. This changes are due to the discontinuities of the orientation on a surface for example. A well known method of edge detection is the Difference of Gaussians (DoG). The method consists of subtracting two Gaussians, where a kernel has a standard deviation smaller than the previous one. The convolution between the subtraction of kernels and the input image results in the edge detection of this image. This paper introduces a method of extracting edges using DoG with kernels based on the q-Gaussian probability distribution, derived from the q-statistic proposed by Constantino Tsallis. To demonstrate the method's potential, we compare the introduced method with the traditional DoG using Gaussians kernels. The results showed that the proposed method can extract edges with more accurate details.

* 5 pages, 5 figures, IC-MSQUARE 2013

Via

Access Paper or Ask Questions