Picture for Michael K. Chen

Michael K. Chen

JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning in Large Language Models

Add code
Jan 24, 2025
Viaarxiv icon