Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrea Storås

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Sep 02, 2024

Sushant Gautam, Andrea Storås, Cise Midoglu, Steven A. Hicks, Vajira Thambawita, Pål Halvorsen, Michael A. Riegler

Figure 1 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Figure 2 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Figure 3 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Figure 4 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Abstract:We introduce Kvasir-VQA, an extended dataset derived from the HyperKvasir and Kvasir-Instrument datasets, augmented with question-and-answer annotations to facilitate advanced machine learning tasks in Gastrointestinal (GI) diagnostics. This dataset comprises 6,500 annotated images spanning various GI tract conditions and surgical instruments, and it supports multiple question types including yes/no, choice, location, and numerical count. The dataset is intended for applications such as image captioning, Visual Question Answering (VQA), text-based generation of synthetic medical images, object detection, and classification. Our experiments demonstrate the dataset's effectiveness in training models for three selected tasks, showcasing significant applications in medical image analysis and diagnostics. We also present evaluation metrics for each task, highlighting the usability and versatility of our dataset. The dataset and supporting artifacts are available at https://datasets.simula.no/kvasir-vqa.

* to be published in VLM4Bio 2024, part of the ACM Multimedia (ACM MM) conference 2024

Via

Access Paper or Ask Questions

Visual explanations for polyp detection: How medical doctors assess intrinsic versus extrinsic explanations

Mar 23, 2022

Steven Hicks, Andrea Storås, Michael Riegler, Cise Midoglu, Malek Hammou, Thomas de Lange, Sravanthi Parasa, Pål Halvorsen, Inga Strümke

Figure 1 for Visual explanations for polyp detection: How medical doctors assess intrinsic versus extrinsic explanations

Figure 2 for Visual explanations for polyp detection: How medical doctors assess intrinsic versus extrinsic explanations

Figure 3 for Visual explanations for polyp detection: How medical doctors assess intrinsic versus extrinsic explanations

Figure 4 for Visual explanations for polyp detection: How medical doctors assess intrinsic versus extrinsic explanations

Abstract:Deep learning has in recent years achieved immense success in all areas of computer vision and has the potential of assisting medical doctors in analyzing visual content for disease and other abnormalities. However, the current state of deep learning is very much a black box, making medical professionals highly skeptical about integrating these methods into clinical practice. Several methods have been proposed in order to shine some light onto these black boxes, but there is no consensus on the opinion of the medical doctors that will consume these explanations. This paper presents a study asking medical doctors about their opinion of current state-of-the-art explainable artificial intelligence methods when applied to a gastrointestinal disease detection use case. We compare two different categories of explanation methods, intrinsic and extrinsic, and gauge their opinion of the current value of these explanations. The results indicate that intrinsic explanations are preferred and that explanation.

Via

Access Paper or Ask Questions