Picture for Sergey Tulyakov

Sergey Tulyakov

Visual Personalization Turing Test

Add code
Jan 30, 2026
Viaarxiv icon

S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation

Add code
Jan 19, 2026
Viaarxiv icon

SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices

Add code
Jan 13, 2026
Viaarxiv icon

Tuning-free Visual Effect Transfer across Videos

Add code
Jan 13, 2026
Viaarxiv icon

Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning

Add code
Jan 07, 2026
Viaarxiv icon

EasyV2V: A High-quality Instruction-based Video Editing Framework

Add code
Dec 18, 2025
Figure 1 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Figure 2 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Figure 3 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Figure 4 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Viaarxiv icon

AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation

Add code
Dec 11, 2025
Viaarxiv icon

Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization

Add code
Dec 11, 2025
Viaarxiv icon

OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

Add code
Dec 11, 2025
Figure 1 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Figure 2 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Figure 3 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Figure 4 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Viaarxiv icon

AlphaFlow: Understanding and Improving MeanFlow Models

Add code
Oct 23, 2025
Viaarxiv icon