Picture for Qifeng Chen

Qifeng Chen

VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention

Add code
Mar 20, 2025
Viaarxiv icon

Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation

Add code
Mar 14, 2025
Viaarxiv icon

EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing

Add code
Mar 13, 2025
Viaarxiv icon

AudioX: Diffusion Transformer for Anything-to-Audio Generation

Add code
Mar 13, 2025
Viaarxiv icon

Generative Artificial Intelligence in Robotic Manipulation: A Survey

Add code
Mar 05, 2025
Viaarxiv icon

MangaNinja: Line Art Colorization with Precise Reference Following

Add code
Jan 14, 2025
Viaarxiv icon

Edicho: Consistent Image Editing in the Wild

Add code
Dec 30, 2024
Viaarxiv icon

ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement

Add code
Dec 25, 2024
Viaarxiv icon

DepthLab: From Partial to Complete

Add code
Dec 24, 2024
Figure 1 for DepthLab: From Partial to Complete
Figure 2 for DepthLab: From Partial to Complete
Figure 3 for DepthLab: From Partial to Complete
Figure 4 for DepthLab: From Partial to Complete
Viaarxiv icon

Large Motion Video Autoencoding with Cross-modal Video VAE

Add code
Dec 23, 2024
Figure 1 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 2 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 3 for Large Motion Video Autoencoding with Cross-modal Video VAE
Figure 4 for Large Motion Video Autoencoding with Cross-modal Video VAE
Viaarxiv icon