Picture for Takashi Shibuya

Takashi Shibuya

CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation

Add code
Jan 06, 2025
Viaarxiv icon

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Add code
Dec 19, 2024
Viaarxiv icon

SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation

Add code
Dec 18, 2024
Figure 1 for SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation
Figure 2 for SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation
Figure 3 for SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation
Figure 4 for SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation
Viaarxiv icon

TraSCE: Trajectory Steering for Concept Erasure

Add code
Dec 10, 2024
Figure 1 for TraSCE: Trajectory Steering for Concept Erasure
Viaarxiv icon

Classifier-Free Guidance inside the Attraction Basin May Cause Memorization

Add code
Nov 23, 2024
Viaarxiv icon

Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning

Add code
Oct 07, 2024
Viaarxiv icon

Embedded Topic Models Enhanced by Wikification

Add code
Oct 03, 2024
Viaarxiv icon

A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation

Add code
Sep 26, 2024
Figure 1 for A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Figure 2 for A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Figure 3 for A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Figure 4 for A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Viaarxiv icon

SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond

Add code
Jun 26, 2024
Figure 1 for SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond
Figure 2 for SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond
Figure 3 for SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond
Figure 4 for SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond
Viaarxiv icon

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training

Add code
Jun 04, 2024
Figure 1 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 2 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 3 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 4 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Viaarxiv icon