Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory

Jul 10, 2021

Xuejiao Tang, Xin Huang, Wenbin Zhang, Travers B. Child, Qiong Hu, Zhen Liu, Ji Zhang

Figure 1 for Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory

Figure 2 for Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory

Figure 3 for Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory

Figure 4 for Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory

Share this with someone who'll enjoy it:

Abstract:Visual Commonsense Reasoning (VCR) predicts an answer with corresponding rationale, given a question-image input. VCR is a recently introduced visual scene understanding task with a wide range of applications, including visual question answering, automated vehicle systems, and clinical decision support. Previous approaches to solving the VCR task generally rely on pre-training or exploiting memory with long dependency relationship encoded models. However, these approaches suffer from a lack of generalizability and prior knowledge. In this paper we propose a dynamic working memory based cognitive VCR network, which stores accumulated commonsense between sentences to provide prior knowledge for inference. Extensive experiments show that the proposed model yields significant improvements over existing methods on the benchmark VCR dataset. Moreover, the proposed model provides intuitive interpretation into visual commonsense reasoning. A Python implementation of our mechanism is publicly available at https://github.com/tanjatang/DMVCR

* DaWaK 2021

View paper on

Share this with someone who'll enjoy it:

Title:Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory

Paper and Code