Picture for Bu Jin

Bu Jin

Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving

Add code
Dec 03, 2024
Viaarxiv icon

DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model

Add code
Oct 14, 2024
Figure 1 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 2 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 3 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 4 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Viaarxiv icon

Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving

Add code
Sep 10, 2024
Figure 1 for Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
Figure 2 for Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
Figure 3 for Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
Figure 4 for Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
Viaarxiv icon

HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Add code
Sep 04, 2024
Viaarxiv icon

PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning

Add code
Jun 04, 2024
Viaarxiv icon

TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes

Add code
Mar 28, 2024
Figure 1 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Figure 2 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Figure 3 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Figure 4 for TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Viaarxiv icon

GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping

Add code
Mar 14, 2024
Figure 1 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Figure 2 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Figure 3 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Figure 4 for GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Viaarxiv icon

MonoOcc: Digging into Monocular Semantic Occupancy Prediction

Add code
Mar 13, 2024
Viaarxiv icon

STEPS: Joint Self-supervised Nighttime Image Enhancement and Depth Estimation

Add code
Feb 02, 2023
Viaarxiv icon

ADAPT: Action-aware Driving Caption Transformer

Add code
Feb 01, 2023
Viaarxiv icon