Picture for Bei Yan

Bei Yan

Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

Add code
Jun 27, 2024
Figure 1 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Figure 2 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Figure 3 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Figure 4 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Viaarxiv icon

Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models

Add code
Jun 24, 2024
Figure 1 for Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models
Figure 2 for Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models
Figure 3 for Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models
Figure 4 for Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models
Viaarxiv icon