Picture for Navdeep Jaitly

Navdeep Jaitly

Normalizing Flows are Capable Generative Models

Add code
Dec 10, 2024
Viaarxiv icon

Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis

Add code
Nov 26, 2024
Figure 1 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 2 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 3 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Figure 4 for Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
Viaarxiv icon

TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models

Add code
Nov 02, 2024
Figure 1 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 2 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 3 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 4 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Viaarxiv icon

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP

Add code
Oct 31, 2024
Viaarxiv icon

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Add code
Oct 10, 2024
Figure 1 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 2 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 3 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 4 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Viaarxiv icon

Achieving Human Level Competitive Robot Table Tennis

Add code
Aug 07, 2024
Figure 1 for Achieving Human Level Competitive Robot Table Tennis
Figure 2 for Achieving Human Level Competitive Robot Table Tennis
Figure 3 for Achieving Human Level Competitive Robot Table Tennis
Figure 4 for Achieving Human Level Competitive Robot Table Tennis
Viaarxiv icon

dMel: Speech Tokenization made Simple

Add code
Jul 22, 2024
Figure 1 for dMel: Speech Tokenization made Simple
Figure 2 for dMel: Speech Tokenization made Simple
Figure 3 for dMel: Speech Tokenization made Simple
Figure 4 for dMel: Speech Tokenization made Simple
Viaarxiv icon

Improving GFlowNets for Text-to-Image Diffusion Alignment

Add code
Jun 02, 2024
Figure 1 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 2 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 3 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Figure 4 for Improving GFlowNets for Text-to-Image Diffusion Alignment
Viaarxiv icon

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling

Add code
May 31, 2024
Viaarxiv icon

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Add code
May 24, 2024
Figure 1 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 2 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 3 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Figure 4 for Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Viaarxiv icon