Picture for Zheqi He

Zheqi He

Emu3: Next-Token Prediction is All You Need

Add code
Sep 27, 2024
Viaarxiv icon

Evaluating Attribute Comprehension in Large Vision-Language Models

Add code
Aug 25, 2024
Figure 1 for Evaluating Attribute Comprehension in Large Vision-Language Models
Figure 2 for Evaluating Attribute Comprehension in Large Vision-Language Models
Figure 3 for Evaluating Attribute Comprehension in Large Vision-Language Models
Figure 4 for Evaluating Attribute Comprehension in Large Vision-Language Models
Viaarxiv icon

CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning

Add code
Jan 26, 2024
Figure 1 for CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
Figure 2 for CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
Figure 3 for CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
Figure 4 for CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
Viaarxiv icon