Picture for Yuki Mitsufuji

Yuki Mitsufuji

Music Foundation Model as Generic Booster for Music Downstream Tasks

Add code
Nov 05, 2024
Viaarxiv icon

OpenMU: Your Swiss Army Knife for Music Understanding

Add code
Oct 21, 2024
Viaarxiv icon

Mitigating Embedding Collapse in Diffusion Models for Categorical Data

Add code
Oct 18, 2024
Viaarxiv icon

VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression

Add code
Oct 12, 2024
Viaarxiv icon

Distillation of Discrete Diffusion through Dimensional Correlations

Add code
Oct 11, 2024
Figure 1 for Distillation of Discrete Diffusion through Dimensional Correlations
Figure 2 for Distillation of Discrete Diffusion through Dimensional Correlations
Viaarxiv icon

$\textit{Jump Your Steps}$: Optimizing Sampling Schedule of Discrete Diffusion Models

Add code
Oct 10, 2024
Viaarxiv icon

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models

Add code
Oct 08, 2024
Viaarxiv icon

Variable Bitrate Residual Vector Quantization for Audio Coding

Add code
Oct 08, 2024
Viaarxiv icon

Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning

Add code
Oct 07, 2024
Viaarxiv icon

Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space

Add code
Oct 02, 2024
Figure 1 for Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Figure 2 for Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Figure 3 for Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Figure 4 for Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Viaarxiv icon