Picture for Wenzhao Zheng

Wenzhao Zheng

Training-free Regional Prompting for Diffusion Transformers

Add code
Nov 04, 2024
Viaarxiv icon

PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views

Add code
Oct 24, 2024
Viaarxiv icon

UniDrive: Towards Universal Driving Perception Across Camera Configurations

Add code
Oct 17, 2024
Figure 1 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Figure 2 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Figure 3 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Figure 4 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Viaarxiv icon

V2M: Visual 2-Dimensional Mamba for Image Representation Learning

Add code
Oct 14, 2024
Viaarxiv icon

GlobalMamba: Global Image Serialization for Vision Mamba

Add code
Oct 14, 2024
Viaarxiv icon

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference

Add code
Oct 06, 2024
Figure 1 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Figure 2 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Figure 3 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Figure 4 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Viaarxiv icon

FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models

Add code
Aug 15, 2024
Figure 1 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 2 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 3 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 4 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Viaarxiv icon

LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation

Add code
Jun 28, 2024
Viaarxiv icon

Instruct Large Language Models to Drive like Humans

Add code
Jun 11, 2024
Viaarxiv icon

$\textit{S}^3$Gaussian: Self-Supervised Street Gaussians for Autonomous Driving

Add code
May 30, 2024
Viaarxiv icon