Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

Dec 03, 2022

Christopher Beckham, Martin Weiss, Florian Golemo, Sina Honari, Derek Nowrouzezahrai, Christopher Pal

Figure 1 for Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

Figure 2 for Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

Figure 3 for Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

Figure 4 for Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

Share this with someone who'll enjoy it:

Abstract:Different types of mental rotation tests have been used extensively in psychology to understand human visual reasoning and perception. Understanding what an object or visual scene would look like from another viewpoint is a challenging problem that is made even harder if it must be performed from a single image. We explore a controlled setting whereby questions are posed about the properties of a scene if that scene was observed from another viewpoint. To do this we have created a new version of the CLEVR dataset that we call CLEVR Mental Rotation Tests (CLEVR-MRT). Using CLEVR-MRT we examine standard methods, show how they fall short, then explore novel neural architectures that involve inferring volumetric representations of a scene. These volumes can be manipulated via camera-conditioned transformations to answer the question. We examine the efficacy of different model variants through rigorous ablations and demonstrate the efficacy of volumetric representations.

* Accepted for publication to Pattern Recognition journal

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

Paper and Code