Picture for Jie Zhou

Jie Zhou

PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension

Add code
Dec 16, 2024
Viaarxiv icon

GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction

Add code
Dec 13, 2024
Viaarxiv icon

Doe-1: Closed-Loop Autonomous Driving with Large World Model

Add code
Dec 12, 2024
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Viaarxiv icon

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Viaarxiv icon

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Add code
Dec 11, 2024
Viaarxiv icon

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Add code
Dec 06, 2024
Viaarxiv icon

Compound Gaussian Radar Clutter Model With Positive Tempered Alpha-Stable Texture

Add code
Dec 06, 2024
Viaarxiv icon

Retrieval-Augmented Machine Translation with Unstructured Knowledge

Add code
Dec 05, 2024
Viaarxiv icon

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding

Add code
Dec 05, 2024
Viaarxiv icon