Picture for Songwei Ge

Songwei Ge

Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors

Add code
Dec 12, 2024
Viaarxiv icon

Rethinking Score Distillation as a Bridge Between Image Distributions

Add code
Jun 13, 2024
Viaarxiv icon

Coherent Zero-Shot Visual Instruction Generation

Add code
Jun 06, 2024
Viaarxiv icon

On the Content Bias in Fréchet Video Distance

Add code
Apr 18, 2024
Viaarxiv icon

Grounded Text-to-Image Synthesis with Attention Refocusing

Add code
Jun 08, 2023
Viaarxiv icon

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

Add code
May 17, 2023
Viaarxiv icon

Expressive Text-to-Image Generation with Rich Text

Add code
Apr 13, 2023
Viaarxiv icon

Text-driven Visual Synthesis with Latent Diffusion Prior

Add code
Feb 16, 2023
Figure 1 for Text-driven Visual Synthesis with Latent Diffusion Prior
Figure 2 for Text-driven Visual Synthesis with Latent Diffusion Prior
Figure 3 for Text-driven Visual Synthesis with Latent Diffusion Prior
Figure 4 for Text-driven Visual Synthesis with Latent Diffusion Prior
Viaarxiv icon

Hyperbolic Contrastive Learning for Visual Representations beyond Objects

Add code
Dec 01, 2022
Viaarxiv icon

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

Add code
Apr 28, 2022
Figure 1 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 2 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 3 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 4 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Viaarxiv icon