Picture for Caiqi Zhang

Caiqi Zhang

Value of Information: A Framework for Human-Agent Communication

Add code
Jan 10, 2026
Viaarxiv icon

Agent-as-a-Judge

Add code
Jan 08, 2026
Viaarxiv icon

Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective

Add code
Jan 06, 2026
Viaarxiv icon

Confidence Estimation for LLMs in Multi-turn Interactions

Add code
Jan 05, 2026
Viaarxiv icon

All Roads Lead to Rome: Graph-Based Confidence Estimation for Large Language Model Reasoning

Add code
Sep 16, 2025
Viaarxiv icon

Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation

Add code
May 29, 2025
Figure 1 for Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Figure 2 for Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Figure 3 for Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Figure 4 for Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Viaarxiv icon

UNCLE: Uncertainty Expressions in Long-Form Generation

Add code
May 22, 2025
Viaarxiv icon

Visual Planning: Let's Think Only with Images

Add code
May 16, 2025
Viaarxiv icon

A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs

Add code
May 13, 2025
Viaarxiv icon

Supposedly Equivalent Facts That Aren't? Entity Frequency in Pre-training Induces Asymmetry in LLMs

Add code
Mar 28, 2025
Viaarxiv icon