Picture for Anni Zou

Anni Zou

DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems

Add code
Jul 15, 2024
Viaarxiv icon

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning

Add code
Nov 16, 2023
Viaarxiv icon

Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models

Add code
Oct 11, 2023
Viaarxiv icon

Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact Verification

Add code
May 10, 2023
Viaarxiv icon