Picture for Yuhta Takida

Yuhta Takida

Music Foundation Model as Generic Booster for Music Downstream Tasks

Add code
Nov 05, 2024
Viaarxiv icon

Mitigating Embedding Collapse in Diffusion Models for Categorical Data

Add code
Oct 18, 2024
Viaarxiv icon

VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression

Add code
Oct 12, 2024
Viaarxiv icon

Distillation of Discrete Diffusion through Dimensional Correlations

Add code
Oct 11, 2024
Figure 1 for Distillation of Discrete Diffusion through Dimensional Correlations
Figure 2 for Distillation of Discrete Diffusion through Dimensional Correlations
Viaarxiv icon

$\textit{Jump Your Steps}$: Optimizing Sampling Schedule of Discrete Diffusion Models

Add code
Oct 10, 2024
Viaarxiv icon

Variable Bitrate Residual Vector Quantization for Audio Coding

Add code
Oct 08, 2024
Viaarxiv icon

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation

Add code
Aug 20, 2024
Viaarxiv icon

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training

Add code
Jun 04, 2024
Figure 1 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 2 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 3 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 4 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Viaarxiv icon

SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation

Add code
May 28, 2024
Viaarxiv icon

PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher

Add code
May 23, 2024
Viaarxiv icon