Picture for Sergey Tulyakov

Sergey Tulyakov

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon

Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Add code
Dec 12, 2024
Viaarxiv icon

Video Motion Transfer with Diffusion Transformers

Add code
Dec 10, 2024
Viaarxiv icon

Mind the Time: Temporally-Controlled Multi-Event Video Generation

Add code
Dec 06, 2024
Viaarxiv icon

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

Add code
Dec 05, 2024
Viaarxiv icon

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Add code
Dec 02, 2024
Viaarxiv icon

AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation

Add code
Nov 07, 2024
Figure 1 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Figure 2 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Figure 3 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Figure 4 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Viaarxiv icon

DELTA: Dense Efficient Long-range 3D Tracking for any video

Add code
Oct 31, 2024
Viaarxiv icon

Scalable Ranked Preference Optimization for Text-to-Image Generation

Add code
Oct 23, 2024
Figure 1 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 2 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 3 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 4 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Viaarxiv icon

ControlMM: Controllable Masked Motion Generation

Add code
Oct 14, 2024
Viaarxiv icon