Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vitali Petsiuk

Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models

Apr 21, 2024

Vitali Petsiuk, Kate Saenko

Abstract:Motivated by ethical and legal concerns, the scientific community is actively developing methods to limit the misuse of Text-to-Image diffusion models for reproducing copyrighted, violent, explicit, or personal information in the generated images. Simultaneously, researchers put these newly developed safety measures to the test by assuming the role of an adversary to find vulnerabilities and backdoors in them. We use compositional property of diffusion models, which allows to leverage multiple prompts in a single image generation. This property allows us to combine other concepts, that should not have been affected by the inhibition, to reconstruct the vector, responsible for target concept generation, even though the direct computation of this vector is no longer accessible. We provide theoretical and empirical evidence why the proposed attacks are possible and discuss the implications of these findings for safe model deployment. We argue that it is essential to consider all possible approaches to image generation with diffusion models that can be employed by an adversary. Our work opens up the discussion about the implications of concept arithmetics and compositional inference for safety mechanisms in diffusion models. Content Advisory: This paper contains discussions and model-generated content that may be considered offensive. Reader discretion is advised. Project page: https://cs-people.bu.edu/vpetsiuk/arc

Via

Access Paper or Ask Questions

Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

Nov 22, 2022

Vitali Petsiuk, Alexander E. Siemenn, Saisamrit Surbehera, Zad Chin, Keith Tyser, Gregory Hunter, Arvind Raghavan, Yann Hicke, Bryan A. Plummer, Ori Kerret(+4 more)

Figure 1 for Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

Figure 2 for Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

Figure 3 for Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

Figure 4 for Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

Abstract:We provide a new multi-task benchmark for evaluating text-to-image models. We perform a human evaluation comparing the most common open-source (Stable Diffusion) and commercial (DALL-E 2) models. Twenty computer science AI graduate students evaluated the two models, on three tasks, at three difficulty levels, across ten prompts each, providing 3,600 ratings. Text-to-image generation has seen rapid progress to the point that many recent models have demonstrated their ability to create realistic high-resolution images for various prompts. However, current text-to-image methods and the broader body of research in vision-language understanding still struggle with intricate text prompts that contain many objects with multiple attributes and relationships. We introduce a new text-to-image benchmark that contains a suite of thirty-two tasks over multiple applications that capture a model's ability to handle different features of a text prompt. For example, asking a model to generate a varying number of the same object to measure its ability to count or providing a text prompt with several objects that each have a different attribute to identify its ability to match objects and attributes correctly. Rather than subjectively evaluating text-to-image results on a set of prompts, our new multi-task benchmark consists of challenge tasks at three difficulty levels (easy, medium, and hard) and human ratings for each generated image.

* NeurIPS 2022 Workshop on Human Evaluation of Generative Models (HEGM)

Via

Access Paper or Ask Questions

Black-box Explanation of Object Detectors via Saliency Maps

Jun 05, 2020

Vitali Petsiuk, Rajiv Jain, Varun Manjunatha, Vlad I. Morariu, Ashutosh Mehra, Vicente Ordonez, Kate Saenko

Figure 1 for Black-box Explanation of Object Detectors via Saliency Maps

Figure 2 for Black-box Explanation of Object Detectors via Saliency Maps

Figure 3 for Black-box Explanation of Object Detectors via Saliency Maps

Figure 4 for Black-box Explanation of Object Detectors via Saliency Maps

Abstract:We propose D-RISE, a method for generating visual explanations for the predictions of object detectors. D-RISE can be considered "black-box" in the software testing sense, it only needs access to the inputs and outputs of an object detector. Compared to gradient-based methods, D-RISE is more general and agnostic to the particular type of object detector being tested as it does not need to know about the inner workings of the model. We show that D-RISE can be easily applied to different object detectors including one-stage detectors such as YOLOv3 and two-stage detectors such as Faster-RCNN. We present a detailed analysis of the generated visual explanations to highlight the utilization of context and the possible biases learned by object detectors.

Via

Access Paper or Ask Questions

Why do These Match? Explaining the Behavior of Image Similarity Models

May 26, 2019

Bryan A. Plummer, Mariya I. Vasileva, Vitali Petsiuk, Kate Saenko, David Forsyth

Figure 1 for Why do These Match? Explaining the Behavior of Image Similarity Models

Figure 2 for Why do These Match? Explaining the Behavior of Image Similarity Models

Figure 3 for Why do These Match? Explaining the Behavior of Image Similarity Models

Figure 4 for Why do These Match? Explaining the Behavior of Image Similarity Models

Abstract:Explaining a deep learning model can help users understand its behavior and allow researchers to discern its shortcomings. Recent work has primarily focused on explaining models for tasks like image classification or visual question answering. In this paper, we introduce an explanation approach for image similarity models, where a model's output is a semantic feature representation rather than a classification. In this task, an explanation depends on both of the input images, so standard methods do not apply. We propose an explanation method that pairs a saliency map identifying important image regions with an attribute that best explains the match. We find that our explanations are more human-interpretable than saliency maps alone, and can also improve performance on the classic task of attribute recognition. The ability of our approach to generalize is demonstrated on two datasets from very different domains, Polyvore Outfits and Animals with Attributes 2.

Via

Access Paper or Ask Questions

Guided Zoom: Questioning Network Evidence for Fine-grained Classification

Dec 06, 2018

Sarah Adel Bargal, Andrea Zunino, Vitali Petsiuk, Jianming Zhang, Kate Saenko, Vittorio Murino, Stan Sclaroff

Figure 1 for Guided Zoom: Questioning Network Evidence for Fine-grained Classification

Figure 2 for Guided Zoom: Questioning Network Evidence for Fine-grained Classification

Figure 3 for Guided Zoom: Questioning Network Evidence for Fine-grained Classification

Figure 4 for Guided Zoom: Questioning Network Evidence for Fine-grained Classification

Abstract:We propose Guided Zoom, an approach that utilizes spatial grounding to make more informed predictions. It does so by making sure the model has "the right reasons" for a prediction, being defined as reasons that are coherent with those used to make similar correct decisions at training time. The reason/evidence upon which a deep neural network makes a prediction is defined to be the spatial grounding, in the pixel space, for a specific class conditional probability in the model output. Guided Zoom questions how reasonable the evidence used to make a prediction is. In state-of-the-art deep single-label classification models, the top-k (k = 2, 3, 4, ...) accuracy is usually significantly higher than the top-1 accuracy. This is more evident in fine-grained datasets, where differences between classes are quite subtle. We show that Guided Zoom results in the refinement of a model's classification accuracy on three finegrained classification datasets. We also explore the complementarity of different grounding techniques, by comparing their ensemble to an adversarial erasing approach that iteratively reveals the next most discriminative evidence.

Via

Access Paper or Ask Questions

RISE: Randomized Input Sampling for Explanation of Black-box Models

Sep 25, 2018

Vitali Petsiuk, Abir Das, Kate Saenko

Figure 1 for RISE: Randomized Input Sampling for Explanation of Black-box Models

Figure 2 for RISE: Randomized Input Sampling for Explanation of Black-box Models

Figure 3 for RISE: Randomized Input Sampling for Explanation of Black-box Models

Figure 4 for RISE: Randomized Input Sampling for Explanation of Black-box Models

Abstract:Deep neural networks are being used increasingly to automate data analysis and decision making, yet their decision-making process is largely unclear and is difficult to explain to the end users. In this paper, we address the problem of Explainable AI for deep neural networks that take images as input and output a class probability. We propose an approach called RISE that generates an importance map indicating how salient each pixel is for the model's prediction. In contrast to white-box approaches that estimate pixel importance using gradients or other internal network state, RISE works on black-box models. It estimates importance empirically by probing the model with randomly masked versions of the input image and obtaining the corresponding outputs. We compare our approach to state-of-the-art importance extraction methods using both an automatic deletion/insertion metric and a pointing metric based on human-annotated object segments. Extensive experiments on several benchmark datasets show that our approach matches or exceeds the performance of other methods, including white-box approaches. Project page: http://cs-people.bu.edu/vpetsiuk/rise/

Via

Access Paper or Ask Questions