Picture for Jiatao Gu

Jiatao Gu

TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models

Add code
Nov 02, 2024
Figure 1 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 2 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 3 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 4 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Viaarxiv icon

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Add code
Oct 10, 2024
Figure 1 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 2 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 3 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 4 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Viaarxiv icon

Improving GFlowNets for Text-to-Image Diffusion Alignment

Add code
Jun 02, 2024
Figure 1 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 2 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 3 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 4 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Viaarxiv icon

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling

Add code
May 31, 2024
Viaarxiv icon

GECO: Generative Image-to-3D within a SECOnd

Add code
May 30, 2024
Viaarxiv icon

Many-to-many Image Generation with Auto-regressive Diffusion Models

Add code
Apr 03, 2024
Viaarxiv icon

How Far Are We from Intelligent Visual Deductive Reasoning?

Add code
Mar 08, 2024
Figure 1 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 2 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 3 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 4 for How Far Are We from Intelligent Visual Deductive Reasoning?
Viaarxiv icon

Divide-or-Conquer? Which Part Should You Distill Your LLM?

Add code
Feb 22, 2024
Figure 1 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Figure 2 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Figure 3 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Figure 4 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Viaarxiv icon

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

Add code
Dec 26, 2023
Figure 1 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 2 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 3 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 4 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Viaarxiv icon

Diffusion Models Without Attention

Add code
Nov 30, 2023
Figure 1 for Diffusion Models Without Attention
Figure 2 for Diffusion Models Without Attention
Figure 3 for Diffusion Models Without Attention
Figure 4 for Diffusion Models Without Attention
Viaarxiv icon