Picture for Yash Saxena

Yash Saxena

REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs

Add code
May 03, 2024
Viaarxiv icon

Evaluating Consistency and Reasoning Capabilities of Large Language Models

Add code
Apr 25, 2024
Viaarxiv icon

Deploying and Evaluating LLMs to Program Service Mobile Robots

Add code
Nov 18, 2023
Viaarxiv icon