Picture for Romain Serizel

Romain Serizel

MULTISPEECH

Angular Distance Distribution Loss for Audio Classification

Add code
Oct 31, 2024
Viaarxiv icon

A decade of DCASE: Achievements, practices, evaluations and future challenges

Add code
Oct 07, 2024
Viaarxiv icon

Diffusion-based Unsupervised Audio-visual Speech Enhancement

Add code
Oct 04, 2024
Viaarxiv icon

Domain-Invariant Representation Learning of Bird Sounds

Add code
Sep 16, 2024
Figure 1 for Domain-Invariant Representation Learning of Bird Sounds
Figure 2 for Domain-Invariant Representation Learning of Bird Sounds
Viaarxiv icon

Energy Consumption Trends in Sound Event Detection Systems

Add code
Sep 13, 2024
Viaarxiv icon

Normalizing Energy Consumption for Hardware-Independent Evaluation

Add code
Sep 09, 2024
Viaarxiv icon

From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems

Add code
Sep 08, 2024
Viaarxiv icon

Latent Watermarking of Audio Generative Models

Add code
Sep 04, 2024
Viaarxiv icon

DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels

Add code
Jun 12, 2024
Viaarxiv icon

Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds

Add code
Mar 14, 2024
Viaarxiv icon