Picture for Zheng Zhu

Zheng Zhu

Tencent, WeChat Pay

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Add code
Nov 13, 2024
Viaarxiv icon

A Modulo Sampling Hardware Prototype and Reconstruction Algorithm Evaluation

Add code
Oct 25, 2024
Viaarxiv icon

Ubiquitous Field Transportation Robots with Robust Wheel-Leg Transformable Modules

Add code
Oct 24, 2024
Viaarxiv icon

DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation

Add code
Oct 17, 2024
Figure 1 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 2 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 3 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 4 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Viaarxiv icon

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Add code
Jul 15, 2024
Viaarxiv icon

The SkatingVerse Workshop & Challenge: Methods and Results

Add code
May 27, 2024
Viaarxiv icon

MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving

Add code
May 13, 2024
Viaarxiv icon

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

Add code
May 07, 2024
Viaarxiv icon

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

Add code
May 06, 2024
Viaarxiv icon

Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments

Add code
Apr 21, 2024
Viaarxiv icon