Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lalana Kagal

Massachusetts Institute of Technology

Towards Resource Efficient and Interpretable Bias Mitigation in Large Language Models

Dec 02, 2024

Schrasing Tong, Eliott Zemour, Rawisara Lohanimit, Lalana Kagal

Abstract:Although large language models (LLMs) have demonstrated their effectiveness in a wide range of applications, they have also been observed to perpetuate unwanted biases present in the training data, potentially leading to harm for marginalized communities. In this paper, we mitigate bias by leveraging small biased and anti-biased expert models to obtain a debiasing signal that will be added to the LLM output at decoding-time. This approach combines resource efficiency with interpretability and can be optimized for mitigating specific types of bias, depending on the target use case. Experiments on mitigating gender, race, and religion biases show a reduction in bias on several local and global bias metrics while preserving language model performance.

* 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Safe Generative AI Workshop

Via

Access Paper or Ask Questions

Multi-VFL: A Vertical Federated Learning System for Multiple Data and Label Owners

Jun 17, 2021

Vaikkunth Mugunthan, Pawan Goyal, Lalana Kagal

Figure 1 for Multi-VFL: A Vertical Federated Learning System for Multiple Data and Label Owners

Figure 2 for Multi-VFL: A Vertical Federated Learning System for Multiple Data and Label Owners

Figure 3 for Multi-VFL: A Vertical Federated Learning System for Multiple Data and Label Owners

Figure 4 for Multi-VFL: A Vertical Federated Learning System for Multiple Data and Label Owners

Abstract:Vertical Federated Learning (VFL) refers to the collaborative training of a model on a dataset where the features of the dataset are split among multiple data owners, while label information is owned by a single data owner. In this paper, we propose a novel method, Multi Vertical Federated Learning (Multi-VFL), to train VFL models when there are multiple data and label owners. Our approach is the first to consider the setting where $D$-data owners (across which features are distributed) and $K$-label owners (across which labels are distributed) exist. This proposed configuration allows different entities to train and learn optimal models without having to share their data. Our framework makes use of split learning and adaptive federated optimizers to solve this problem. For empirical evaluation, we run experiments on the MNIST and FashionMNIST datasets. Our results show that using adaptive optimizers for model aggregation fastens convergence and improves accuracy.

Via

Access Paper or Ask Questions

Bias-Free FedGAN

Mar 17, 2021

Vaikkunth Mugunthan, Vignesh Gokul, Lalana Kagal, Shlomo Dubnov

Abstract:Federated Generative Adversarial Network (FedGAN) is a communication-efficient approach to train a GAN across distributed clients without clients having to share their sensitive training data. In this paper, we experimentally show that FedGAN generates biased data points under non-independent-and-identically-distributed (non-iid) settings. Also, we propose Bias-Free FedGAN, an approach to generate bias-free synthetic datasets using FedGAN. Bias-Free FedGAN has the same communication cost as that of FedGAN. Experimental results on image datasets (MNIST and FashionMNIST) validate our claims.

Via

Access Paper or Ask Questions

Investigating Bias in Image Classification using Model Explanations

Dec 10, 2020

Schrasing Tong, Lalana Kagal

Figure 1 for Investigating Bias in Image Classification using Model Explanations

Figure 2 for Investigating Bias in Image Classification using Model Explanations

Figure 3 for Investigating Bias in Image Classification using Model Explanations

Figure 4 for Investigating Bias in Image Classification using Model Explanations

Abstract:We evaluated whether model explanations could efficiently detect bias in image classification by highlighting discriminating features, thereby removing the reliance on sensitive attributes for fairness calculations. To this end, we formulated important characteristics for bias detection and observed how explanations change as the degree of bias in models change. The paper identifies strengths and best practices for detecting bias using explanations, as well as three main weaknesses: explanations poorly estimate the degree of bias, could potentially introduce additional bias into the analysis, and are sometimes inefficient in terms of human effort involved.

Via

Access Paper or Ask Questions

DPD-InfoGAN: Differentially Private Distributed InfoGAN

Oct 24, 2020

Vaikkunth Mugunthan, Vignesh Gokul, Lalana Kagal, Shlomo Dubnov

Figure 1 for DPD-InfoGAN: Differentially Private Distributed InfoGAN

Figure 2 for DPD-InfoGAN: Differentially Private Distributed InfoGAN

Figure 3 for DPD-InfoGAN: Differentially Private Distributed InfoGAN

Figure 4 for DPD-InfoGAN: Differentially Private Distributed InfoGAN

Abstract:Generative Adversarial Networks (GANs) are deep learning architectures capable of generating synthetic datasets. Despite producing high-quality synthetic images, the default GAN has no control over the kinds of images it generates. The Information Maximizing GAN (InfoGAN) is a variant of the default GAN that introduces feature-control variables that are automatically learned by the framework, hence providing greater control over the different kinds of images produced. Due to the high model complexity of InfoGAN, the generative distribution tends to be concentrated around the training data points. This is a critical problem as the models may inadvertently expose the sensitive and private information present in the dataset. To address this problem, we propose a differentially private version of InfoGAN (DP-InfoGAN). We also extend our framework to a distributed setting (DPD-InfoGAN) to allow clients to learn different attributes present in other clients' datasets in a privacy-preserving manner. In our experiments, we show that both DP-InfoGAN and DPD-InfoGAN can synthesize high-quality images with flexible control over image attributes while preserving privacy.

Via

Access Paper or Ask Questions

BlockFLow: An Accountable and Privacy-Preserving Solution for Federated Learning

Jul 08, 2020

Vaikkunth Mugunthan, Ravi Rahman, Lalana Kagal

Figure 1 for BlockFLow: An Accountable and Privacy-Preserving Solution for Federated Learning

Figure 2 for BlockFLow: An Accountable and Privacy-Preserving Solution for Federated Learning

Figure 3 for BlockFLow: An Accountable and Privacy-Preserving Solution for Federated Learning

Figure 4 for BlockFLow: An Accountable and Privacy-Preserving Solution for Federated Learning

Abstract:Federated learning enables the development of a machine learning model among collaborating agents without requiring them to share their underlying data. However, malicious agents who train on random data, or worse, on datasets with the result classes inverted, can weaken the combined model. BlockFLow is an accountable federated learning system that is fully decentralized and privacy-preserving. Its primary goal is to reward agents proportional to the quality of their contribution while protecting the privacy of the underlying datasets and being resilient to malicious adversaries. Specifically, BlockFLow incorporates differential privacy, introduces a novel auditing mechanism for model contribution, and uses Ethereum smart contracts to incentivize good behavior. Unlike existing auditing and accountability methods for federated learning systems, our system does not require a centralized test dataset, sharing of datasets between the agents, or one or more trusted auditors; it is fully decentralized and resilient up to a 50% collusion attack in a malicious trust model. When run on the public Ethereum blockchain, BlockFLow uses the results from the audit to reward parties with cryptocurrency based on the quality of their contribution. We evaluated BlockFLow on two datasets that offer classification tasks solvable via logistic regression models. Our results show that the resultant auditing scores reflect the quality of the honest agents' datasets. Moreover, the scores from dishonest agents are statistically lower than those from the honest agents. These results, along with the reasonable blockchain costs, demonstrate the effectiveness of BlockFLow as an accountable federated learning system.

Via

Access Paper or Ask Questions

PrivacyFL: A simulator for privacy-preserving and secure federated learning

Feb 19, 2020

Vaikkunth Mugunthan, Anton Peraire-Bueno, Lalana Kagal

Figure 1 for PrivacyFL: A simulator for privacy-preserving and secure federated learning

Figure 2 for PrivacyFL: A simulator for privacy-preserving and secure federated learning

Figure 3 for PrivacyFL: A simulator for privacy-preserving and secure federated learning

Figure 4 for PrivacyFL: A simulator for privacy-preserving and secure federated learning

Abstract:Federated learning is a technique that enables distributed clients to collaboratively learn a shared machine learning model while keeping their training data localized. This reduces data privacy risks, however, privacy concerns still exist since it is possible to leak information about the training dataset from the trained model's weights or parameters. Setting up a federated learning environment, especially with security and privacy guarantees, is a time-consuming process with numerous configurations and parameters that can be manipulated. In order to help clients ensure that collaboration is feasible and to check that it improves their model accuracy, a real-world simulator for privacy-preserving and secure federated learning is required. In this paper, we introduce PrivacyFL, which is an extensible, easily configurable and scalable simulator for federated learning environments. Its key features include latency simulation, robustness to client departure, support for both centralized and decentralized learning, and configurable privacy and security mechanisms based on differential privacy and secure multiparty computation. In this paper, we motivate our research, describe the architecture of the simulator and associated protocols, and discuss its evaluation in numerous scenarios that highlight its wide range of functionality and its advantages. Our paper addresses a significant real-world problem: checking the feasibility of participating in a federated learning environment under a variety of circumstances. It also has a strong practical impact because organizations such as hospitals, banks, and research institutes, which have large amounts of sensitive data and would like to collaborate, would greatly benefit from having a system that enables them to do so in a privacy-preserving and secure manner.

* 15 pages

Via

Access Paper or Ask Questions

Explaining Explanations: An Approach to Evaluating Interpretability of Machine Learning

Jun 04, 2018

Leilani H. Gilpin, David Bau, Ben Z. Yuan, Ayesha Bajwa, Michael Specter, Lalana Kagal

Figure 1 for Explaining Explanations: An Approach to Evaluating Interpretability of Machine Learning

Figure 2 for Explaining Explanations: An Approach to Evaluating Interpretability of Machine Learning

Abstract:There has recently been a surge of work in explanatory artificial intelligence (XAI). This research area tackles the important problem that complex machines and algorithms often cannot provide insights into their behavior and thought processes. XAI allows users and parts of the internal system to be more transparent, providing explanations of their decisions in some level of detail. These explanations are important to ensure algorithmic fairness, identify potential bias/problems in the training data, and to ensure that the algorithms perform as expected. However, explanations produced by these systems is neither standardized nor systematically assessed. In an effort to create best practices and identify open challenges, we provide our definition of explainability and show how it can be used to classify existing literature. We discuss why current approaches to explanatory methods especially for deep neural networks are insufficient. Finally, based on our survey, we conclude with suggested future research directions for explanatory artificial intelligence.

* Edited author email

Via

Access Paper or Ask Questions

Iterative Orthogonal Feature Projection for Diagnosing Bias in Black-Box Models

Nov 15, 2016

Julius Adebayo, Lalana Kagal

Figure 1 for Iterative Orthogonal Feature Projection for Diagnosing Bias in Black-Box Models

Abstract:Predictive models are increasingly deployed for the purpose of determining access to services such as credit, insurance, and employment. Despite potential gains in productivity and efficiency, several potential problems have yet to be addressed, particularly the potential for unintentional discrimination. We present an iterative procedure, based on orthogonal projection of input attributes, for enabling interpretability of black-box predictive models. Through our iterative procedure, one can quantify the relative dependence of a black-box model on its input attributes.The relative significance of the inputs to a predictive model can then be used to assess the fairness (or discriminatory extent) of such a model.

Via

Access Paper or Ask Questions