Picture for Yunpeng Zhang

Yunpeng Zhang

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

Add code
Feb 19, 2025
Viaarxiv icon

DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving

Add code
Dec 12, 2024
Figure 1 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 2 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 3 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 4 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Viaarxiv icon

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Figure 1 for GPD-1: Generative Pre-training for Driving
Figure 2 for GPD-1: Generative Pre-training for Driving
Figure 3 for GPD-1: Generative Pre-training for Driving
Figure 4 for GPD-1: Generative Pre-training for Driving
Viaarxiv icon

GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

Add code
Dec 06, 2024
Viaarxiv icon

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Add code
Dec 06, 2024
Figure 1 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 2 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 3 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 4 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Viaarxiv icon

Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

Add code
Dec 05, 2024
Viaarxiv icon

FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models

Add code
Aug 15, 2024
Figure 1 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 2 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 3 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 4 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Viaarxiv icon

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

Add code
May 27, 2024
Figure 1 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Figure 2 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Figure 3 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Figure 4 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Viaarxiv icon

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

Add code
Apr 10, 2024
Figure 1 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 2 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 3 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 4 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Viaarxiv icon

GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving

Add code
Apr 07, 2024
Figure 1 for GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving
Figure 2 for GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving
Figure 3 for GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving
Figure 4 for GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving
Viaarxiv icon