Picture for Guosheng Zhao

Guosheng Zhao

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Add code
Nov 13, 2024
Viaarxiv icon

DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation

Add code
Oct 17, 2024
Figure 1 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 2 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 3 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 4 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Viaarxiv icon

CoReS: Orchestrating the Dance of Reasoning and Segmentation

Add code
Apr 08, 2024
Viaarxiv icon

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation

Add code
Mar 11, 2024
Figure 1 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 2 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 3 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 4 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Viaarxiv icon