Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:FunQA: Towards Surprising Video Comprehension

Jun 26, 2023

Binzhu Xie, Sicheng Zhang, Zitang Zhou, Bo Li, Yuanhan Zhang, Jack Hessel, Jingkang Yang, Ziwei Liu

Figure 1 for FunQA: Towards Surprising Video Comprehension

Figure 2 for FunQA: Towards Surprising Video Comprehension

Figure 3 for FunQA: Towards Surprising Video Comprehension

Figure 4 for FunQA: Towards Surprising Video Comprehension

Share this with someone who'll enjoy it:

Abstract:Surprising videos, e.g., funny clips, creative performances, or visual illusions, attract significant attention. Enjoyment of these videos is not simply a response to visual stimuli; rather, it hinges on the human capacity to understand (and appreciate) commonsense violations depicted in these videos. We introduce FunQA, a challenging video question answering (QA) dataset specifically designed to evaluate and enhance the depth of video reasoning based on counter-intuitive and fun videos. Unlike most video QA benchmarks which focus on less surprising contexts, e.g., cooking or instructional videos, FunQA covers three previously unexplored types of surprising videos: 1) HumorQA, 2) CreativeQA, and 3) MagicQA. For each subset, we establish rigorous QA tasks designed to assess the model's capability in counter-intuitive timestamp localization, detailed video description, and reasoning around counter-intuitiveness. We also pose higher-level tasks, such as attributing a fitting and vivid title to the video, and scoring the video creativity. In total, the FunQA benchmark consists of 312K free-text QA pairs derived from 4.3K video clips, spanning a total of 24 video hours. Extensive experiments with existing VideoQA models reveal significant performance gaps for the FunQA videos across spatial-temporal reasoning, visual-centered reasoning, and free-text generation.

* Ask VLMs about humor, creation, and magics. Project Page: https://funqa-benchmark.github.io/ Codebase: https://github.com/Jingkang50/FunQA

View paper on

Share this with someone who'll enjoy it:

Title:FunQA: Towards Surprising Video Comprehension

Paper and Code