Picture for Zhizheng Zhang

Zhizheng Zhang

Southeast University, China

NavGSim: High-Fidelity Gaussian Splatting Simulator for Large-Scale Navigation

Add code
Mar 16, 2026
Viaarxiv icon

SPAN-Nav: Generalized Spatial Awareness for Versatile Vision-Language Navigation

Add code
Mar 10, 2026
Viaarxiv icon

Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning

Add code
Mar 10, 2026
Viaarxiv icon

SimRecon: SimReady Compositional Scene Reconstruction from Real Videos

Add code
Mar 03, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds

Add code
Jan 31, 2026
Viaarxiv icon

StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision

Add code
Dec 26, 2025
Figure 1 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 2 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 3 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 4 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Viaarxiv icon

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Add code
Oct 08, 2025
Viaarxiv icon

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Add code
Jul 03, 2025
Viaarxiv icon

TrackVLA: Embodied Visual Tracking in the Wild

Add code
May 29, 2025
Figure 1 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 2 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 3 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 4 for TrackVLA: Embodied Visual Tracking in the Wild
Viaarxiv icon