Picture for Wenhao Chai

Wenhao Chai

UniHPR: Unified Human Pose Representation via Singular Value Contrastive Learning

Add code
Oct 21, 2025
Viaarxiv icon

VideoNSA: Native Sparse Attention Scales Video Understanding

Add code
Oct 02, 2025
Viaarxiv icon

Dense Video Understanding with Gated Residual Tokenization

Add code
Sep 18, 2025
Viaarxiv icon

AuroraLong: Bringing RNNs Back to Efficient Open-Ended Video Understanding

Add code
Jul 03, 2025
Viaarxiv icon

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Add code
Jun 13, 2025
Viaarxiv icon

GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning

Add code
May 29, 2025
Viaarxiv icon

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Add code
May 29, 2025
Viaarxiv icon

TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action

Add code
May 02, 2025
Viaarxiv icon

Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark

Add code
Apr 20, 2025
Viaarxiv icon

Science-T2I: Addressing Scientific Illusions in Image Synthesis

Add code
Apr 17, 2025
Viaarxiv icon