Picture for Bei Yan

Bei Yan

M$^3$oralBench: A MultiModal Moral Benchmark for LVLMs

Add code
Dec 30, 2024
Viaarxiv icon

Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

Add code
Jun 27, 2024
Viaarxiv icon

Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models

Add code
Jun 24, 2024
Viaarxiv icon