Picture for Wenzhao Zheng

Wenzhao Zheng

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Add code
Mar 18, 2025
Viaarxiv icon

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation

Add code
Jan 28, 2025
Figure 1 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 2 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 3 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 4 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Viaarxiv icon

GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting

Add code
Jan 26, 2025
Viaarxiv icon

Preventing Local Pitfalls in Vector Quantization via Optimal Transport

Add code
Dec 19, 2024
Figure 1 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Figure 2 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Figure 3 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Figure 4 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Viaarxiv icon

GaussianAD: Gaussian-Centric End-to-End Autonomous Driving

Add code
Dec 13, 2024
Figure 1 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Figure 2 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Figure 3 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Figure 4 for GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
Viaarxiv icon

GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction

Add code
Dec 13, 2024
Figure 1 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 2 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 3 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 4 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Viaarxiv icon

DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving

Add code
Dec 12, 2024
Figure 1 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 2 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 3 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 4 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Viaarxiv icon

Doe-1: Closed-Loop Autonomous Driving with Large World Model

Add code
Dec 12, 2024
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Figure 1 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 2 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 3 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 4 for Owl-1: Omni World Model for Consistent Long Video Generation
Viaarxiv icon

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Figure 1 for GPD-1: Generative Pre-training for Driving
Figure 2 for GPD-1: Generative Pre-training for Driving
Figure 3 for GPD-1: Generative Pre-training for Driving
Figure 4 for GPD-1: Generative Pre-training for Driving
Viaarxiv icon