Picture for Wei Yin

Wei Yin

RoMeO: Robust Metric Visual Odometry

Add code
Dec 16, 2024
Viaarxiv icon

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

Add code
Nov 26, 2024
Figure 1 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 2 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 3 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Figure 4 for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Viaarxiv icon

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Add code
Oct 29, 2024
Figure 1 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 2 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 3 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 4 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Viaarxiv icon

Depth Any Video with Scalable Synthetic Data

Add code
Oct 14, 2024
Figure 1 for Depth Any Video with Scalable Synthetic Data
Figure 2 for Depth Any Video with Scalable Synthetic Data
Figure 3 for Depth Any Video with Scalable Synthetic Data
Figure 4 for Depth Any Video with Scalable Synthetic Data
Viaarxiv icon

DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model

Add code
Oct 14, 2024
Figure 1 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 2 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 3 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Figure 4 for DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
Viaarxiv icon

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Add code
Oct 07, 2024
Viaarxiv icon

OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity

Add code
Sep 30, 2024
Figure 1 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 2 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 3 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 4 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Viaarxiv icon

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Add code
Sep 26, 2024
Viaarxiv icon

DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos

Add code
May 29, 2024
Figure 1 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Figure 2 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Figure 3 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Figure 4 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Viaarxiv icon

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

Add code
Mar 21, 2024
Viaarxiv icon