Picture for Kavish

Kavish

DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models

Add code
Feb 10, 2025
Viaarxiv icon