Picture for Wei Yin

Wei Yin

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Add code
Oct 29, 2024
Viaarxiv icon

DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model

Add code
Oct 14, 2024
Viaarxiv icon

Depth Any Video with Scalable Synthetic Data

Add code
Oct 14, 2024
Figure 1 for Depth Any Video with Scalable Synthetic Data
Figure 2 for Depth Any Video with Scalable Synthetic Data
Figure 3 for Depth Any Video with Scalable Synthetic Data
Figure 4 for Depth Any Video with Scalable Synthetic Data
Viaarxiv icon

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Add code
Oct 07, 2024
Viaarxiv icon

OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity

Add code
Sep 30, 2024
Figure 1 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 2 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 3 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Figure 4 for OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
Viaarxiv icon

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Add code
Sep 26, 2024
Viaarxiv icon

DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos

Add code
May 29, 2024
Figure 1 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Figure 2 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Figure 3 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Figure 4 for DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Viaarxiv icon

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

Add code
Mar 21, 2024
Viaarxiv icon

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Add code
Mar 18, 2024
Viaarxiv icon

Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving

Add code
Mar 12, 2024
Viaarxiv icon