Picture for Chelsea Zou

Chelsea Zou

Michael Pokorny

CalBench: Evaluating Coordination-Privacy Trade-offs in Multi-Agent LLMs

Add code
May 10, 2026
Viaarxiv icon

Talk is Cheap, Communication is Hard: Dynamic Grounding Failures and Repair in Multi-Agent Negotiation

Add code
May 03, 2026
Viaarxiv icon

A Unified Definition of Hallucination, Or: It's the World Model, Stupid

Add code
Dec 25, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

Abstracted Gaussian Prototypes for One-Shot Concept Learning

Add code
Aug 30, 2024
Figure 1 for Abstracted Gaussian Prototypes for One-Shot Concept Learning
Figure 2 for Abstracted Gaussian Prototypes for One-Shot Concept Learning
Figure 3 for Abstracted Gaussian Prototypes for One-Shot Concept Learning
Figure 4 for Abstracted Gaussian Prototypes for One-Shot Concept Learning
Viaarxiv icon

ARDIE: AR, Dialogue, and Eye Gaze Policies for Human-Robot Collaboration

Add code
May 08, 2023
Figure 1 for ARDIE: AR, Dialogue, and Eye Gaze Policies for Human-Robot Collaboration
Figure 2 for ARDIE: AR, Dialogue, and Eye Gaze Policies for Human-Robot Collaboration
Figure 3 for ARDIE: AR, Dialogue, and Eye Gaze Policies for Human-Robot Collaboration
Viaarxiv icon