Picture for Kengo Uchida

Kengo Uchida

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training

Add code
Jun 04, 2024
Figure 1 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 2 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 3 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Figure 4 for MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Viaarxiv icon

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes

Add code
Dec 31, 2023
Viaarxiv icon

Zero- and Few-shot Sound Event Localization and Detection

Add code
Sep 17, 2023
Figure 1 for Zero- and Few-shot Sound Event Localization and Detection
Figure 2 for Zero- and Few-shot Sound Event Localization and Detection
Figure 3 for Zero- and Few-shot Sound Event Localization and Detection
Figure 4 for Zero- and Few-shot Sound Event Localization and Detection
Viaarxiv icon

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

Add code
Jun 15, 2023
Figure 1 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 2 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 3 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 4 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Viaarxiv icon