Picture for Jiwen Lu

Jiwen Lu

EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models

Add code
Mar 19, 2025
Viaarxiv icon

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Add code
Mar 18, 2025
Viaarxiv icon

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Add code
Mar 13, 2025
Viaarxiv icon

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Add code
Feb 06, 2025
Figure 1 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 2 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 3 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 4 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Viaarxiv icon

GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting

Add code
Jan 26, 2025
Viaarxiv icon

Preventing Local Pitfalls in Vector Quantization via Optimal Transport

Add code
Dec 19, 2024
Figure 1 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Figure 2 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Figure 3 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Figure 4 for Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Viaarxiv icon

GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction

Add code
Dec 13, 2024
Figure 1 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 2 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 3 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 4 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Viaarxiv icon

Doe-1: Closed-Loop Autonomous Driving with Large World Model

Add code
Dec 12, 2024
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Figure 1 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 2 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 3 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 4 for Owl-1: Omni World Model for Consistent Long Video Generation
Viaarxiv icon

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Figure 1 for GPD-1: Generative Pre-training for Driving
Figure 2 for GPD-1: Generative Pre-training for Driving
Figure 3 for GPD-1: Generative Pre-training for Driving
Figure 4 for GPD-1: Generative Pre-training for Driving
Viaarxiv icon