Picture for Xiaodong Cun

Xiaodong Cun

VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

Add code
Dec 27, 2024
Viaarxiv icon

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

Add code
Dec 24, 2024
Viaarxiv icon

CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training

Add code
Dec 23, 2024
Viaarxiv icon

DEIM: DETR with Improved Matching for Fast Convergence

Add code
Dec 05, 2024
Figure 1 for DEIM: DETR with Improved Matching for Fast Convergence
Figure 2 for DEIM: DETR with Improved Matching for Fast Convergence
Figure 3 for DEIM: DETR with Improved Matching for Fast Convergence
Figure 4 for DEIM: DETR with Improved Matching for Fast Convergence
Viaarxiv icon

AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation

Add code
Nov 26, 2024
Figure 1 for AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation
Figure 2 for AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation
Figure 3 for AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation
Figure 4 for AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation
Viaarxiv icon

ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training

Add code
Oct 05, 2024
Figure 1 for ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training
Figure 2 for ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training
Figure 3 for ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training
Figure 4 for ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training
Viaarxiv icon

Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach

Add code
Oct 04, 2024
Figure 1 for Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Figure 2 for Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Figure 3 for Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Figure 4 for Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Viaarxiv icon

StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos

Add code
Sep 11, 2024
Figure 1 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 2 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 3 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 4 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Viaarxiv icon

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Add code
Sep 03, 2024
Figure 1 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Figure 2 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Figure 3 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Figure 4 for DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Viaarxiv icon

Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models

Add code
Jul 14, 2024
Figure 1 for Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Figure 2 for Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Figure 3 for Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Figure 4 for Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Viaarxiv icon