Picture for Fanyi Pu

Fanyi Pu

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Add code
Jan 23, 2025
Figure 1 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 2 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 3 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 4 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Viaarxiv icon

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Add code
Jul 17, 2024
Viaarxiv icon

WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning

Add code
May 06, 2024
Figure 1 for WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning
Figure 2 for WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning
Figure 3 for WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning
Figure 4 for WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning
Viaarxiv icon

OtterHD: A High-Resolution Multi-modality Model

Add code
Nov 07, 2023
Viaarxiv icon

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Add code
Jun 08, 2023
Figure 1 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Figure 2 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Figure 3 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Figure 4 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Viaarxiv icon