Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Medical visual question answering using joint self-supervised learning

Feb 25, 2023

Yuan Zhou, Jing Mei, Yiqin Yu, Tanveer Syeda-Mahmood

Figure 1 for Medical visual question answering using joint self-supervised learning

Figure 2 for Medical visual question answering using joint self-supervised learning

Figure 3 for Medical visual question answering using joint self-supervised learning

Figure 4 for Medical visual question answering using joint self-supervised learning

Share this with someone who'll enjoy it:

Abstract:Visual Question Answering (VQA) becomes one of the most active research problems in the medical imaging domain. A well-known VQA challenge is the intrinsic diversity between the image and text modalities, and in the medical VQA task, there is another critical problem relying on the limited size of labelled image-question-answer data. In this study we propose an encoder-decoder framework that leverages the image-text joint representation learned from large-scaled medical image-caption data and adapted to the small-sized medical VQA task. The encoder embeds across the image-text dual modalities with self-attention mechanism and is independently pre-trained on the large-scaled medical image-caption dataset by multiple self-supervised learning tasks. Then the decoder is connected to the top of the encoder and fine-tuned using the small-sized medical VQA dataset. The experiment results present that our proposed method achieves better performance comparing with the baseline and SOTA methods.

View paper on

Share this with someone who'll enjoy it:

Title:Medical visual question answering using joint self-supervised learning

Paper and Code