Picture for Shaoyang Guo

Shaoyang Guo

PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

Add code
Apr 22, 2025
Viaarxiv icon