Picture for Aliasgahr Khani

Aliasgahr Khani

MMLU-Pro+: Evaluating Higher-Order Reasoning and Shortcut Learning in LLMs

Add code
Sep 03, 2024
Viaarxiv icon