Picture for Robin San Roman

Robin San Roman

Large Concept Models: Language Modeling in a Sentence Representation Space

Add code
Dec 11, 2024
Viaarxiv icon

Latent Watermarking of Audio Generative Models

Add code
Sep 04, 2024
Viaarxiv icon

Proactive Detection of Voice Cloning with Localized Watermarking

Add code
Jan 30, 2024
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion

Add code
Aug 02, 2023
Figure 1 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Figure 2 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Figure 3 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Figure 4 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Viaarxiv icon

Denoising Diffusion Gamma Models

Add code
Oct 10, 2021
Figure 1 for Denoising Diffusion Gamma Models
Figure 2 for Denoising Diffusion Gamma Models
Figure 3 for Denoising Diffusion Gamma Models
Viaarxiv icon

Non Gaussian Denoising Diffusion Models

Add code
Jun 14, 2021
Figure 1 for Non Gaussian Denoising Diffusion Models
Figure 2 for Non Gaussian Denoising Diffusion Models
Figure 3 for Non Gaussian Denoising Diffusion Models
Figure 4 for Non Gaussian Denoising Diffusion Models
Viaarxiv icon