Picture for Sasha Sheng

Sasha Sheng

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

Add code
Apr 28, 2022
Figure 1 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 2 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 3 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 4 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Viaarxiv icon

Human-Adversarial Visual Question Answering

Add code
Jun 04, 2021
Figure 1 for Human-Adversarial Visual Question Answering
Figure 2 for Human-Adversarial Visual Question Answering
Figure 3 for Human-Adversarial Visual Question Answering
Figure 4 for Human-Adversarial Visual Question Answering
Viaarxiv icon