Picture for Zhiheng Ma

Zhiheng Ma

ProSR: Process-Shaped Spatial Reasoning for Reliable Chain-of-Thought in VLMs

Add code
May 25, 2026
Viaarxiv icon

Dance Across Shifts: Forward-Facilitation Continual Test-Time Adaptation through Dynamic Style Bridging

Add code
May 18, 2026
Viaarxiv icon

Beyond World-Frame Action Heads: Motion-Centric Action Frames for Vision-Language-Action Models

Add code
May 12, 2026
Viaarxiv icon

Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation

Add code
May 12, 2026
Viaarxiv icon

ALAM: Algebraically Consistent Latent Transitions for Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs

Add code
May 11, 2026
Viaarxiv icon

Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restoration

Add code
May 07, 2026
Viaarxiv icon

ABot-Claw: A Foundation for Persistent, Cooperative, and Self-Evolving Robotic Agents

Add code
Apr 11, 2026
Viaarxiv icon

ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Add code
Mar 24, 2026
Viaarxiv icon

HumanOmni-Speaker: Identifying Who said What and When

Add code
Mar 23, 2026
Viaarxiv icon