Picture for Jordi Pons

Jordi Pons

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

Add code
Nov 29, 2024
Viaarxiv icon

Stable Audio Open

Add code
Jul 19, 2024
Viaarxiv icon

Long-form music generation with latent diffusion

Add code
Apr 16, 2024
Viaarxiv icon

Fast Timing-Conditioned Latent Audio Diffusion

Add code
Feb 08, 2024
Viaarxiv icon

GASS: Generalizing Audio Source Separation with Large-scale Data

Add code
Sep 29, 2023
Viaarxiv icon

Mono-to-stereo through parametric stereo generation

Add code
Jun 26, 2023
Figure 1 for Mono-to-stereo through parametric stereo generation
Figure 2 for Mono-to-stereo through parametric stereo generation
Figure 3 for Mono-to-stereo through parametric stereo generation
Figure 4 for Mono-to-stereo through parametric stereo generation
Viaarxiv icon

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models

Add code
Jun 16, 2023
Viaarxiv icon

Towards Robust Image-in-Audio Deep Steganography

Add code
Mar 14, 2023
Viaarxiv icon

Full-band General Audio Synthesis with Score-based Diffusion

Add code
Oct 26, 2022
Viaarxiv icon

PodcastMix: A dataset for separating music and speech in podcasts

Add code
Jul 15, 2022
Figure 1 for PodcastMix: A dataset for separating music and speech in podcasts
Figure 2 for PodcastMix: A dataset for separating music and speech in podcasts
Figure 3 for PodcastMix: A dataset for separating music and speech in podcasts
Figure 4 for PodcastMix: A dataset for separating music and speech in podcasts
Viaarxiv icon