Picture for Sergey Tulyakov

Sergey Tulyakov

Multi-subject Open-set Personalization in Video Generation

Add code
Jan 10, 2025
Viaarxiv icon

Nested Attention: Semantic-aware Attention Values for Concept Personalization

Add code
Jan 02, 2025
Viaarxiv icon

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Add code
Dec 19, 2024
Figure 1 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 2 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 3 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Figure 4 for AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
Viaarxiv icon

Wonderland: Navigating 3D Scenes from a Single Image

Add code
Dec 16, 2024
Viaarxiv icon

SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device

Add code
Dec 13, 2024
Viaarxiv icon

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon

Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Add code
Dec 12, 2024
Viaarxiv icon

Video Motion Transfer with Diffusion Transformers

Add code
Dec 10, 2024
Viaarxiv icon

Mind the Time: Temporally-Controlled Multi-Event Video Generation

Add code
Dec 06, 2024
Viaarxiv icon

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

Add code
Dec 05, 2024
Figure 1 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Figure 2 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Figure 3 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Figure 4 for 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Viaarxiv icon