Picture for Zhihui Li

Zhihui Li

Exploring Adaptive Masked Reconstruction for Self-Supervised Skeleton-Based Action Recognition

Add code
Jun 09, 2026
Viaarxiv icon

CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection

Add code
Jun 02, 2026
Viaarxiv icon

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

Add code
May 28, 2026
Viaarxiv icon

SRC-Flow: Compact Semantic Representations Enable Normalizing Flows for Image Generation

Add code
May 18, 2026
Viaarxiv icon

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Add code
May 13, 2026
Viaarxiv icon

Progressive Online Video Understanding with Evidence-Aligned Timing and Transparent Decisions

Add code
Apr 20, 2026
Viaarxiv icon

LatentPilot: Scene-Aware Vision-and-Language Navigation by Dreaming Ahead with Latent Visual Reasoning

Add code
Mar 31, 2026
Viaarxiv icon

RiskProp: Collision-Anchored Self-Supervised Risk Propagation for Early Accident Anticipation

Add code
Mar 28, 2026
Viaarxiv icon

Beyond Dense Futures: World Models as Structured Planners for Robotic Manipulation

Add code
Mar 13, 2026
Viaarxiv icon

See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation

Add code
Mar 10, 2026
Viaarxiv icon