Picture for Songen Gu

Songen Gu

DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model

Add code
Oct 14, 2024
Viaarxiv icon

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Add code
Oct 07, 2024
Viaarxiv icon

GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping

Add code
Mar 14, 2024
Figure 1 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Figure 2 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Figure 3 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Figure 4 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Viaarxiv icon

Text2Street: Controllable Text-to-image Generation for Street Views

Add code
Feb 07, 2024
Viaarxiv icon

ASSIST: Interactive Scene Nodes for Scalable and Realistic Indoor Simulation

Add code
Nov 10, 2023
Viaarxiv icon