Picture for Qunbo Wang

Qunbo Wang

Boter: Bootstrapping Knowledge Selection and Question Answering for Knowledge-based VQA

Add code
Apr 22, 2024
Viaarxiv icon

Knowledge Condensation and Reasoning for Knowledge-based VQA

Add code
Mar 15, 2024
Viaarxiv icon

VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Add code
May 29, 2023
Figure 1 for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Figure 2 for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Figure 3 for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Figure 4 for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Viaarxiv icon