Picture for Wei Yin

Wei Yin

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

Add code
Dec 30, 2024
Viaarxiv icon

DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT

Add code
Dec 27, 2024
Viaarxiv icon

RoMeO: Robust Metric Visual Odometry

Add code
Dec 16, 2024
Viaarxiv icon

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

Add code
Nov 26, 2024
Figure 1 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 2 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 3 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 4 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Viaarxiv icon

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Add code
Oct 29, 2024
Figure 1 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 2 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 3 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 4 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Viaarxiv icon

DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model

Add code
Oct 14, 2024
Figure 1 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 2 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 3 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 4 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Viaarxiv icon

Depth Any Video with Scalable Synthetic Data

Add code
Oct 14, 2024
Figure 1 for Depth Any Video with Scalable Synthetic Data
Figure 2 for Depth Any Video with Scalable Synthetic Data
Figure 3 for Depth Any Video with Scalable Synthetic Data
Figure 4 for Depth Any Video with Scalable Synthetic Data
Viaarxiv icon

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Add code
Oct 07, 2024
Viaarxiv icon

OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity

Add code
Sep 30, 2024
Figure 1 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 2 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 3 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 4 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Viaarxiv icon

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Add code
Sep 26, 2024
Viaarxiv icon