Picture for Ruoxi Ning

Ruoxi Ning

NovelQA: A Benchmark for Long-Range Novel Question Answering

Add code
Mar 18, 2024
Viaarxiv icon

GLoRE: Evaluating Logical Reasoning of Large Language Models

Add code
Oct 13, 2023
Viaarxiv icon

Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4

Add code
Apr 20, 2023
Viaarxiv icon