Picture for Rithesh Kumar

Rithesh Kumar

DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization

Add code
Oct 14, 2024
Viaarxiv icon

VampNet: Music Generation via Masked Acoustic Token Modeling

Add code
Jul 12, 2023
Viaarxiv icon

High-Fidelity Audio Compression with Improved RVQGAN

Add code
Jun 11, 2023
Viaarxiv icon

Chunked Autoregressive GAN for Conditional Waveform Synthesis

Add code
Oct 19, 2021
Figure 1 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Figure 2 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Figure 3 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Figure 4 for Chunked Autoregressive GAN for Conditional Waveform Synthesis
Viaarxiv icon

NU-GAN: High resolution neural upsampling with GAN

Add code
Oct 22, 2020
Figure 1 for NU-GAN: High resolution neural upsampling with GAN
Figure 2 for NU-GAN: High resolution neural upsampling with GAN
Figure 3 for NU-GAN: High resolution neural upsampling with GAN
Viaarxiv icon

MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

Add code
Oct 28, 2019
Figure 1 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Figure 2 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Figure 3 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Figure 4 for MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
Viaarxiv icon

Maximum Entropy Generators for Energy-Based Models

Add code
Jan 24, 2019
Figure 1 for Maximum Entropy Generators for Energy-Based Models
Figure 2 for Maximum Entropy Generators for Energy-Based Models
Figure 3 for Maximum Entropy Generators for Energy-Based Models
Figure 4 for Maximum Entropy Generators for Energy-Based Models
Viaarxiv icon

Harmonic Recomposition using Conditional Autoregressive Modeling

Add code
Nov 18, 2018
Figure 1 for Harmonic Recomposition using Conditional Autoregressive Modeling
Figure 2 for Harmonic Recomposition using Conditional Autoregressive Modeling
Viaarxiv icon

ObamaNet: Photo-realistic lip-sync from text

Add code
Dec 06, 2017
Figure 1 for ObamaNet: Photo-realistic lip-sync from text
Figure 2 for ObamaNet: Photo-realistic lip-sync from text
Figure 3 for ObamaNet: Photo-realistic lip-sync from text
Figure 4 for ObamaNet: Photo-realistic lip-sync from text
Viaarxiv icon

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

Add code
Feb 11, 2017
Figure 1 for SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Figure 2 for SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Figure 3 for SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Figure 4 for SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Viaarxiv icon