Picture for Shanghang Zhang

Shanghang Zhang

MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation

Add code
Feb 19, 2025
Viaarxiv icon

CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World

Add code
Feb 12, 2025
Viaarxiv icon

LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information

Add code
Feb 04, 2025
Viaarxiv icon

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation

Add code
Jan 28, 2025
Viaarxiv icon

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders

Add code
Jan 03, 2025
Figure 1 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Figure 2 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Figure 3 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Figure 4 for MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Viaarxiv icon

SCBench: A Sports Commentary Benchmark for Video LLMs

Add code
Dec 23, 2024
Viaarxiv icon

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation

Add code
Dec 18, 2024
Viaarxiv icon

GaussianAD: Gaussian-Centric End-to-End Autonomous Driving

Add code
Dec 13, 2024
Figure 1 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Figure 2 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Figure 3 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Figure 4 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Viaarxiv icon

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Figure 1 for GPD-1: Generative Pre-training for Driving
Figure 2 for GPD-1: Generative Pre-training for Driving
Figure 3 for GPD-1: Generative Pre-training for Driving
Figure 4 for GPD-1: Generative Pre-training for Driving
Viaarxiv icon

ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance

Add code
Dec 09, 2024
Viaarxiv icon