Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kurtis Evan David

Debiasing Convolutional Neural Networks via Meta Orthogonalization

Nov 15, 2020

Kurtis Evan David, Qiang Liu, Ruth Fong

Figure 1 for Debiasing Convolutional Neural Networks via Meta Orthogonalization

Figure 2 for Debiasing Convolutional Neural Networks via Meta Orthogonalization

Figure 3 for Debiasing Convolutional Neural Networks via Meta Orthogonalization

Figure 4 for Debiasing Convolutional Neural Networks via Meta Orthogonalization

Abstract:While deep learning models often achieve strong task performance, their successes are hampered by their inability to disentangle spurious correlations from causative factors, such as when they use protected attributes (e.g., race, gender, etc.) to make decisions. In this work, we tackle the problem of debiasing convolutional neural networks (CNNs) in such instances. Building off of existing work on debiasing word embeddings and model interpretability, our Meta Orthogonalization method encourages the CNN representations of different concepts (e.g., gender and class labels) to be orthogonal to one another in activation space while maintaining strong downstream task performance. Through a variety of experiments, we systematically test our method and demonstrate that it significantly mitigates model bias and is competitive against current adversarial debiasing methods.

* Accepted to NeuRIPS 2020 Workshop on Algorithmic Fairness through the Lens of Causality and Interpretability (AFCI). Supplemental materials provided at: https://drive.google.com/drive/folders/1klIAqZDgg3sCVmzFjLw5Y_T-GTc2E3oh?usp=sharing

Via

Access Paper or Ask Questions

GANchors: Realistic Image Perturbation Distributions for Anchors Using Generative Models

Jun 01, 2019

Kurtis Evan David, Harrison Keane, Jun Min Noh

Figure 1 for GANchors: Realistic Image Perturbation Distributions for Anchors Using Generative Models

Figure 2 for GANchors: Realistic Image Perturbation Distributions for Anchors Using Generative Models

Figure 3 for GANchors: Realistic Image Perturbation Distributions for Anchors Using Generative Models

Figure 4 for GANchors: Realistic Image Perturbation Distributions for Anchors Using Generative Models

Abstract:We extend and improve the work of Model Agnostic Anchors for explanations on image classification through the use of generative adversarial networks (GANs). Using GANs, we generate samples from a more realistic perturbation distribution, by optimizing under a lower dimensional latent space. This increases the trust in an explanation, as results now come from images that are more likely to be found in the original training set of a classifier, rather than an overlay of random images. A large drawback to our method is the computational complexity of sampling through optimization; to address this, we implement more efficient algorithms, including a diverse encoder. Lastly, we share results from the MNIST and CelebA datasets, and note that our explanations can lead to smaller and higher precision anchors.

* Final project for the Fair and Transparent Machine Learning course at UT Austin -- taught by Dr. Joydeep Ghosh

Via

Access Paper or Ask Questions