Picture for Yuichiro Koyama

Yuichiro Koyama

Music Foundation Model as Generic Booster for Music Downstream Tasks

Add code
Nov 05, 2024
Figure 1 for Music Foundation Model as Generic Booster for Music Downstream Tasks
Figure 2 for Music Foundation Model as Generic Booster for Music Downstream Tasks
Figure 3 for Music Foundation Model as Generic Booster for Music Downstream Tasks
Figure 4 for Music Foundation Model as Generic Booster for Music Downstream Tasks
Viaarxiv icon

Zero- and Few-shot Sound Event Localization and Detection

Add code
Sep 17, 2023
Figure 1 for Zero- and Few-shot Sound Event Localization and Detection
Figure 2 for Zero- and Few-shot Sound Event Localization and Detection
Figure 3 for Zero- and Few-shot Sound Event Localization and Detection
Figure 4 for Zero- and Few-shot Sound Event Localization and Detection
Viaarxiv icon

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

Add code
Jun 15, 2023
Figure 1 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 2 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 3 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Figure 4 for STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Viaarxiv icon

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders

Add code
May 18, 2023
Viaarxiv icon

Diffusion-based Signal Refiner for Speech Separation

Add code
May 12, 2023
Figure 1 for Diffusion-based Signal Refiner for Speech Separation
Figure 2 for Diffusion-based Signal Refiner for Speech Separation
Figure 3 for Diffusion-based Signal Refiner for Speech Separation
Figure 4 for Diffusion-based Signal Refiner for Speech Separation
Viaarxiv icon

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

Add code
Jun 04, 2022
Figure 1 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 2 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 3 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 4 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Viaarxiv icon

Removing Distortion Effects in Music Using Deep Neural Networks

Add code
Feb 03, 2022
Figure 1 for Removing Distortion Effects in Music Using Deep Neural Networks
Figure 2 for Removing Distortion Effects in Music Using Deep Neural Networks
Figure 3 for Removing Distortion Effects in Music Using Deep Neural Networks
Figure 4 for Removing Distortion Effects in Music Using Deep Neural Networks
Viaarxiv icon

Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training

Add code
Oct 14, 2021
Figure 1 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 2 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 3 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 4 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Viaarxiv icon

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection

Add code
Oct 13, 2021
Figure 1 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 2 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 3 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 4 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Viaarxiv icon

Music Source Separation with Deep Equilibrium Models

Add code
Oct 13, 2021
Figure 1 for Music Source Separation with Deep Equilibrium Models
Figure 2 for Music Source Separation with Deep Equilibrium Models
Figure 3 for Music Source Separation with Deep Equilibrium Models
Viaarxiv icon