Picture for Yingcong Chen

Yingcong Chen

TransPixar: Advancing Text-to-Video Generation with Transparency

Add code
Jan 06, 2025
Viaarxiv icon

Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion

Add code
Dec 19, 2024
Figure 1 for Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion
Figure 2 for Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion
Figure 3 for Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion
Figure 4 for Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion
Viaarxiv icon

DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving

Add code
Dec 12, 2024
Figure 1 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 2 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 3 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 4 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Viaarxiv icon

Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning

Add code
Nov 30, 2024
Figure 1 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Figure 2 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Figure 3 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Figure 4 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Viaarxiv icon

LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images

Add code
Oct 21, 2024
Figure 1 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images
Figure 2 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images
Figure 3 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images
Figure 4 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images
Viaarxiv icon

From Bird's-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model

Add code
Sep 02, 2024
Viaarxiv icon

SEED-Story: Multimodal Long Story Generation with Large Language Model

Add code
Jul 11, 2024
Viaarxiv icon

PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement

Add code
May 10, 2024
Viaarxiv icon

Resolve Domain Conflicts for Generalizable Remote Physiological Measurement

Add code
Apr 11, 2024
Viaarxiv icon

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

Add code
Apr 10, 2024
Figure 1 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 2 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 3 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 4 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Viaarxiv icon