Picture for Jiwen Lu

Jiwen Lu

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Add code
Feb 06, 2025
Figure 1 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 2 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 3 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 4 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Viaarxiv icon

GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting

Add code
Jan 26, 2025
Viaarxiv icon

Preventing Local Pitfalls in Vector Quantization via Optimal Transport

Add code
Dec 19, 2024
Viaarxiv icon

GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction

Add code
Dec 13, 2024
Figure 1 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 2 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 3 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Figure 4 for GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Viaarxiv icon

Doe-1: Closed-Loop Autonomous Driving with Large World Model

Add code
Dec 12, 2024
Viaarxiv icon

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Figure 1 for GPD-1: Generative Pre-training for Driving
Figure 2 for GPD-1: Generative Pre-training for Driving
Figure 3 for GPD-1: Generative Pre-training for Driving
Figure 4 for GPD-1: Generative Pre-training for Driving
Viaarxiv icon

Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving

Add code
Dec 09, 2024
Figure 1 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Figure 2 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Figure 3 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Figure 4 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Viaarxiv icon

Bridging the Divide: Reconsidering Softmax and Linear Attention

Add code
Dec 09, 2024
Figure 1 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 2 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 3 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 4 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Viaarxiv icon

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Add code
Dec 06, 2024
Viaarxiv icon