Picture for Xingang Wang

Xingang Wang

ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration

Add code
Nov 29, 2024
Viaarxiv icon

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Add code
Nov 13, 2024
Figure 1 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Figure 2 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Figure 3 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Figure 4 for EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Viaarxiv icon

DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation

Add code
Oct 17, 2024
Figure 1 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 2 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 3 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Figure 4 for DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Viaarxiv icon

CoReS: Orchestrating the Dance of Reasoning and Segmentation

Add code
Apr 08, 2024
Viaarxiv icon

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation

Add code
Mar 11, 2024
Figure 1 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 2 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 3 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 4 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Viaarxiv icon

Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation

Add code
Dec 11, 2023
Viaarxiv icon

Inferring Attracting Basins of Power System with Machine Learning

Add code
May 20, 2023
Viaarxiv icon

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

Add code
Mar 30, 2023
Figure 1 for FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
Figure 2 for FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
Figure 3 for FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
Figure 4 for FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
Viaarxiv icon

DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception

Add code
Mar 15, 2023
Viaarxiv icon

OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

Add code
Mar 07, 2023
Figure 1 for OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
Figure 2 for OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
Figure 3 for OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
Figure 4 for OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
Viaarxiv icon