Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Venkat Kodali

Recent, rapid advancement in visual question answering architecture: a review

Mar 31, 2022

Venkat Kodali, Daniel Berleant

Figure 1 for Recent, rapid advancement in visual question answering architecture: a review

Figure 2 for Recent, rapid advancement in visual question answering architecture: a review

Figure 3 for Recent, rapid advancement in visual question answering architecture: a review

Figure 4 for Recent, rapid advancement in visual question answering architecture: a review

Abstract:Understanding visual question answering is going to be crucial for numerous human activities. However, it presents major challenges at the heart of the artificial intelligence endeavor. This paper presents an update on the rapid advancements in visual question answering using images that have occurred in the last couple of years. Tremendous growth in research on improving visual question answering system architecture has been published recently, showing the importance of multimodal architectures. Several points on the benefits of visual question answering are mentioned in the review paper by Manmadhan et al. (2020), on which the present article builds, including subsequent updates in the field.

* 11 pages

Via

Access Paper or Ask Questions