Picture for Shanghang Zhang

Shanghang Zhang

TOPS: First-Principles Visual Token Pruning via Constructing Token Optimal Preservation Sets for Efficient MLLM Inference

Add code
Jun 25, 2026
Viaarxiv icon

FORCE: Efficient VLA Reinforcement Fine-Tuning via Value-Calibrated Warm-up and Self-Distillation

Add code
Jun 24, 2026
Viaarxiv icon

LaST-HD: Learning Latent Physical Reasoning from Scalable Human Data for Robot Manipulation

Add code
Jun 22, 2026
Viaarxiv icon

WAM-RL: World-Action Model Reinforcement Learning with Reconstruction Rewards and Online Video SFT

Add code
Jun 16, 2026
Viaarxiv icon

WAM4D: Fast 4D World Action Model via Spatial Register Tokens

Add code
Jun 12, 2026
Viaarxiv icon

Vector Map as Language: Toward Unified Remote Sensing Vector Mapping

Add code
Jun 09, 2026
Viaarxiv icon

Efficient-WAM: A 1B-Parameter World-Action Model with Low-Cost Future Imagination

Add code
Jun 08, 2026
Viaarxiv icon

Dream-Tac: A Unified Tactile World Action Model for Contact-Rich Robot Manipulation

Add code
Jun 07, 2026
Viaarxiv icon

SparseStreet: Sparse Gaussian Splatting for Real-Time Street Scene Simulation

Add code
Jun 02, 2026
Viaarxiv icon

Demo-JEPA: Joint-Embedding Predictive Architecture for One-shot Cross-Embodiment Imitation

Add code
May 20, 2026
Viaarxiv icon