Picture for Jiazhao Zhang

Jiazhao Zhang

SPAN-Nav: Generalized Spatial Awareness for Versatile Vision-Language Navigation

Add code
Mar 10, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models

Add code
Jan 18, 2026
Viaarxiv icon

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Add code
Oct 08, 2025
Viaarxiv icon

RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction

Add code
Jul 23, 2025
Viaarxiv icon

BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion

Add code
Jun 18, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon

TrackVLA: Embodied Visual Tracking in the Wild

Add code
May 29, 2025
Figure 1 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 2 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 3 for TrackVLA: Embodied Visual Tracking in the Wild
Figure 4 for TrackVLA: Embodied Visual Tracking in the Wild
Viaarxiv icon

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Add code
Apr 26, 2025
Figure 1 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 2 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 3 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 4 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Viaarxiv icon

OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging

Add code
Mar 03, 2025
Viaarxiv icon