Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jurgen P. Schulze

Improving Users' Mental Model with Attention-directed Counterfactual Edits

Oct 15, 2021

Kamran Alipour, Arijit Ray, Xiao Lin, Michael Cogswell, Jurgen P. Schulze, Yi Yao, Giedrius T. Burachas

Figure 1 for Improving Users' Mental Model with Attention-directed Counterfactual Edits

Figure 2 for Improving Users' Mental Model with Attention-directed Counterfactual Edits

Figure 3 for Improving Users' Mental Model with Attention-directed Counterfactual Edits

Figure 4 for Improving Users' Mental Model with Attention-directed Counterfactual Edits

Abstract:In the domain of Visual Question Answering (VQA), studies have shown improvement in users' mental model of the VQA system when they are exposed to examples of how these systems answer certain Image-Question (IQ) pairs. In this work, we show that showing controlled counterfactual image-question examples are more effective at improving the mental model of users as compared to simply showing random examples. We compare a generative approach and a retrieval-based approach to show counterfactual examples. We use recent advances in generative adversarial networks (GANs) to generate counterfactual images by deleting and inpainting certain regions of interest in the image. We then expose users to changes in the VQA system's answer on those altered images. To select the region of interest for inpainting, we experiment with using both human-annotated attention maps and a fully automatic method that uses the VQA system's attention values. Finally, we test the user's mental model by asking them to predict the model's performance on a test counterfactual image. We note an overall improvement in users' accuracy to predict answer change when shown counterfactual explanations. While realistic retrieved counterfactuals obviously are the most effective at improving the mental model, we show that a generative approach can also be equally effective.

* Accepted for publication in Applied AI Letters

Via

Access Paper or Ask Questions

The Impact of Explanations on AI Competency Prediction in VQA

Jul 02, 2020

Kamran Alipour, Arijit Ray, Xiao Lin, Jurgen P. Schulze, Yi Yao, Giedrius T. Burachas

Figure 1 for The Impact of Explanations on AI Competency Prediction in VQA

Figure 2 for The Impact of Explanations on AI Competency Prediction in VQA

Figure 3 for The Impact of Explanations on AI Competency Prediction in VQA

Figure 4 for The Impact of Explanations on AI Competency Prediction in VQA

Abstract:Explainability is one of the key elements for building trust in AI systems. Among numerous attempts to make AI explainable, quantifying the effect of explanations remains a challenge in conducting human-AI collaborative tasks. Aside from the ability to predict the overall behavior of AI, in many applications, users need to understand an AI agent's competency in different aspects of the task domain. In this paper, we evaluate the impact of explanations on the user's mental model of AI agent competency within the task of visual question answering (VQA). We quantify users' understanding of competency, based on the correlation between the actual system performance and user rankings. We introduce an explainable VQA system that uses spatial and object features and is powered by the BERT language model. Each group of users sees only one kind of explanation to rank the competencies of the VQA model. The proposed model is evaluated through between-subject experiments to probe explanations' impact on the user's perception of competency. The comparison between two VQA models shows BERT based explanations and the use of object features improve the user's prediction of the model's competencies.

* Submitted to HCCAI 2020

Via

Access Paper or Ask Questions

Deep Learning Improves Contrast in Low-Fluence Photoacoustic Imaging

Apr 19, 2020

Ali Hariri, Kamran Alipour, Yash Mantri, Jurgen P. Schulze, Jesse V. Jokerst

Figure 1 for Deep Learning Improves Contrast in Low-Fluence Photoacoustic Imaging

Figure 2 for Deep Learning Improves Contrast in Low-Fluence Photoacoustic Imaging

Figure 3 for Deep Learning Improves Contrast in Low-Fluence Photoacoustic Imaging

Figure 4 for Deep Learning Improves Contrast in Low-Fluence Photoacoustic Imaging

Abstract:Low fluence illumination sources can facilitate clinical transition of photoacoustic imaging because they are rugged, portable, affordable, and safe. However, these sources also decrease image quality due to their low fluence. Here, we propose a denoising method using a multi-level wavelet-convolutional neural network to map low fluence illumination source images to its corresponding high fluence excitation map. Quantitative and qualitative results show a significant potential to remove the background noise and preserve the structures of target. Substantial improvements up to 2.20, 2.25, and 4.3-fold for PSNR, SSIM, and CNR metrics were observed, respectively. We also observed enhanced contrast (up to 1.76-fold) in an in vivo application using our proposed methods. We suggest that this tool can improve the value of such sources in photoacoustic imaging.

* submitted to Biomedical Optics Express journal

Via

Access Paper or Ask Questions

A Study on Multimodal and Interactive Explanations for Visual Question Answering

Mar 01, 2020

Kamran Alipour, Jurgen P. Schulze, Yi Yao, Avi Ziskind, Giedrius Burachas

Figure 1 for A Study on Multimodal and Interactive Explanations for Visual Question Answering

Figure 2 for A Study on Multimodal and Interactive Explanations for Visual Question Answering

Figure 3 for A Study on Multimodal and Interactive Explanations for Visual Question Answering

Figure 4 for A Study on Multimodal and Interactive Explanations for Visual Question Answering

Abstract:Explainability and interpretability of AI models is an essential factor affecting the safety of AI. While various explainable AI (XAI) approaches aim at mitigating the lack of transparency in deep networks, the evidence of the effectiveness of these approaches in improving usability, trust, and understanding of AI systems are still missing. We evaluate multimodal explanations in the setting of a Visual Question Answering (VQA) task, by asking users to predict the response accuracy of a VQA agent with and without explanations. We use between-subjects and within-subjects experiments to probe explanation effectiveness in terms of improving user prediction accuracy, confidence, and reliance, among other factors. The results indicate that the explanations help improve human prediction accuracy, especially in trials when the VQA system's answer is inaccurate. Furthermore, we introduce active attention, a novel method for evaluating causal attentional effects through intervention by editing attention maps. User explanation ratings are strongly correlated with human prediction accuracy and suggest the efficacy of these explanations in human-machine AI collaboration tasks.

* Proceedings of the Workshop on Artificial Intelligence Safety (SafeAI 2020) co-located with 34th AAAI Conference on Artificial Intelligence (AAAI 2020), New York, USA, Feb 7, 2020
* http://ceur-ws.org/Vol-2560/paper44.pdf

Via

Access Paper or Ask Questions