Picture for Josh Susskind

Josh Susskind

3D Shape Tokenization

Add code
Dec 24, 2024
Viaarxiv icon

Normalizing Flows are Capable Generative Models

Add code
Dec 10, 2024
Viaarxiv icon

Coordinate In and Value Out: Training Flow Transformers in Ambient Space

Add code
Dec 05, 2024
Figure 1 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Figure 2 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Figure 3 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Figure 4 for Coordinate In and Value Out: Training Flow Transformers in Ambient Space
Viaarxiv icon

TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models

Add code
Nov 02, 2024
Figure 1 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 2 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 3 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 4 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Viaarxiv icon

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP

Add code
Oct 31, 2024
Viaarxiv icon

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Add code
Oct 10, 2024
Figure 1 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 2 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 3 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 4 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Viaarxiv icon

On the benefits of pixel-based hierarchical policies for task generalization

Add code
Jul 27, 2024
Figure 1 for On the benefits of pixel-based hierarchical policies for task generalization
Figure 2 for On the benefits of pixel-based hierarchical policies for task generalization
Figure 3 for On the benefits of pixel-based hierarchical policies for task generalization
Figure 4 for On the benefits of pixel-based hierarchical policies for task generalization
Viaarxiv icon

Improving GFlowNets for Text-to-Image Diffusion Alignment

Add code
Jun 02, 2024
Figure 1 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 2 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 3 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 4 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Viaarxiv icon

How Far Are We from Intelligent Visual Deductive Reasoning?

Add code
Mar 08, 2024
Figure 1 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 2 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 3 for How Far Are We from Intelligent Visual Deductive Reasoning?
Figure 4 for How Far Are We from Intelligent Visual Deductive Reasoning?
Viaarxiv icon

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Add code
Jan 29, 2024
Viaarxiv icon