Picture for Josh Susskind

Josh Susskind

Normalizing Flows are Capable Generative Models

Add code
Dec 10, 2024
Viaarxiv icon

Coordinate In and Value Out: Training Flow Transformers in Ambient Space

Add code
Dec 05, 2024
Figure 1 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Figure 2 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Figure 3 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Figure 4 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Viaarxiv icon

TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models

Add code
Nov 02, 2024
Figure 1 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 2 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 3 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 4 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Viaarxiv icon

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP

Add code
Oct 31, 2024
Viaarxiv icon

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Add code
Oct 10, 2024
Figure 1 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 2 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 3 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 4 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Viaarxiv icon

On the benefits of pixel-based hierarchical policies for task generalization

Add code
Jul 27, 2024
Viaarxiv icon

Improving GFlowNets for Text-to-Image Diffusion Alignment

Add code
Jun 02, 2024
Figure 1 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 2 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 3 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 4 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Viaarxiv icon

How Far Are We from Intelligent Visual Deductive Reasoning?

Add code
Mar 08, 2024
Figure 1 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 2 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 3 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 4 for How Far Are We from Intelligent Visual Deductive Reasoning?
Viaarxiv icon

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Add code
Jan 29, 2024
Viaarxiv icon

What Algorithms can Transformers Learn? A Study in Length Generalization

Add code
Oct 24, 2023
Figure 1 for What Algorithms can Transformers Learn? A Study in Length Generalization
Figure 2 for What Algorithms can Transformers Learn? A Study in Length Generalization
Figure 3 for What Algorithms can Transformers Learn? A Study in Length Generalization
Figure 4 for What Algorithms can Transformers Learn? A Study in Length Generalization
Viaarxiv icon