Picture for Xuandong Zhao

Xuandong Zhao

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Add code
Jan 02, 2026
Viaarxiv icon

Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models

Add code
Jul 10, 2025
Viaarxiv icon

AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

Add code
Jun 17, 2025
Viaarxiv icon

OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models

Add code
May 28, 2025
Viaarxiv icon

Learning to Reason without External Rewards

Add code
May 26, 2025
Figure 1 for Learning to Reason without External Rewards
Figure 2 for Learning to Reason without External Rewards
Figure 3 for Learning to Reason without External Rewards
Figure 4 for Learning to Reason without External Rewards
Viaarxiv icon

Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services

Add code
May 24, 2025
Figure 1 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Figure 2 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Figure 3 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Figure 4 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Viaarxiv icon

SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning

Add code
May 22, 2025
Figure 1 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 2 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 3 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 4 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Viaarxiv icon

In-Context Watermarks for Large Language Models

Add code
May 22, 2025
Figure 1 for In-Context Watermarks for Large Language Models
Figure 2 for In-Context Watermarks for Large Language Models
Figure 3 for In-Context Watermarks for Large Language Models
Figure 4 for In-Context Watermarks for Large Language Models
Viaarxiv icon

AgentXploit: End-to-End Redteaming of Black-Box AI Agents

Add code
May 09, 2025
Figure 1 for AgentXploit: End-to-End Redteaming of Black-Box AI Agents
Figure 2 for AgentXploit: End-to-End Redteaming of Black-Box AI Agents
Figure 3 for AgentXploit: End-to-End Redteaming of Black-Box AI Agents
Figure 4 for AgentXploit: End-to-End Redteaming of Black-Box AI Agents
Viaarxiv icon