Picture for Chen Gao

Chen Gao

TetherCache: Stabilizing Autoregressive Long-Form Video Generation with Gated Recall and Trusted Alignment

Add code
Jun 11, 2026
Viaarxiv icon

TouchThinker: Scaling Tactile Commonsense Reasoning to the Open World with Large-scale Data and Action-aware Representation

Add code
Jun 10, 2026
Viaarxiv icon

TacForeSight: Force-Guided Tactile World Model for Contact-Rich Manipulation

Add code
Jun 09, 2026
Viaarxiv icon

CP4D: Compositional Physics-aware 4D Scene Generation

Add code
Jun 08, 2026
Viaarxiv icon

Dreaming when Necessary: Advancing World Action Models with Adaptive Multi-Modal Reasoning

Add code
Jun 05, 2026
Viaarxiv icon

WorldFly: A World-Model-Based Vision-Language-Action Model for UAV Navigation

Add code
Jun 04, 2026
Viaarxiv icon

Future Forcing: Future-aware Training-free KV Cache Policy for Autoregressive Video Generation

Add code
May 28, 2026
Viaarxiv icon

GE-Sim 2.0: A Roadmap Towards Comprehensive Closed-loop Video World Simulators for Robotic Manipulation

Add code
May 26, 2026
Viaarxiv icon

EventPrune: Cascaded Event-Assisted Token Pruning for Efficient First-Person Dynamic Spatial Reasoning

Add code
May 19, 2026
Viaarxiv icon

ManiSoft: Towards Vision-Language Manipulation for Soft Continuum Robotics

Add code
May 18, 2026
Viaarxiv icon