Picture for Zhuohan Long

Zhuohan Long

Strong Reasoning Isn't Enough: Evaluating Evidence Elicitation in Interactive Diagnosis

Add code
Jan 27, 2026
Viaarxiv icon

From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking

Add code
Jun 21, 2024
Viaarxiv icon

Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation

Add code
Feb 18, 2024
Viaarxiv icon