Picture for Zhi Li

Zhi Li

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

PolySpeech-100: A Large-Scale Benchmark for Speech Understanding Across 100+ Languages and Dialects

Add code
May 31, 2026
Viaarxiv icon

Fisher-Preserving Guidance: Training-Free Manifold Constraints for Safe Diffusion Control

Add code
May 28, 2026
Viaarxiv icon

MUSE: Benchmarking Manufacturable, Functional, and Assemblable Text-to-CAD Generation

Add code
May 27, 2026
Viaarxiv icon

Random Walk on Point Clouds for Feature Detection

Add code
Apr 22, 2026
Viaarxiv icon

WebWorld: A Large-Scale World Model for Web Agent Training

Add code
Feb 16, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

Not All Negative Samples Are Equal: LLMs Learn Better from Plausible Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

A Hitchhiker's Guide to Poisson Gradient Estimation

Add code
Feb 03, 2026
Viaarxiv icon

Human-in-the-Loop Failure Recovery with Adaptive Task Allocation

Add code
Feb 03, 2026
Viaarxiv icon