Picture for Slim Essid

Slim Essid

IDS, S2A, LTCI

A Contrastive Self-Supervised Learning scheme for beat tracking amenable to few-shot learning

Add code
Nov 06, 2024
Viaarxiv icon

An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment

Add code
Oct 08, 2024
Viaarxiv icon

SALT: Standardized Audio event Label Taxonomy

Add code
Sep 18, 2024
Viaarxiv icon

Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing

Add code
Jul 22, 2024
Figure 1 for Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
Figure 2 for Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
Figure 3 for Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
Figure 4 for Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
Viaarxiv icon

Less Forgetting for Better Generalization: Exploring Continual-learning Fine-tuning Methods for Speech Self-supervised Representations

Add code
Jun 30, 2024
Viaarxiv icon

Winner-takes-all learners are geometry-aware conditional density estimators

Add code
Jun 07, 2024
Figure 1 for Winner-takes-all learners are geometry-aware conditional density estimators
Figure 2 for Winner-takes-all learners are geometry-aware conditional density estimators
Figure 3 for Winner-takes-all learners are geometry-aware conditional density estimators
Figure 4 for Winner-takes-all learners are geometry-aware conditional density estimators
Viaarxiv icon

A lightweight dual-stage framework for personalized speech enhancement based on DeepFilterNet2

Add code
Apr 11, 2024
Viaarxiv icon

Online speaker diarization of meetings guided by speech separation

Add code
Jan 30, 2024
Figure 1 for Online speaker diarization of meetings guided by speech separation
Figure 2 for Online speaker diarization of meetings guided by speech separation
Figure 3 for Online speaker diarization of meetings guided by speech separation
Figure 4 for Online speaker diarization of meetings guided by speech separation
Viaarxiv icon

On the choice of the optimal temporal support for audio classification with Pre-trained embeddings

Add code
Dec 21, 2023
Viaarxiv icon

Collaborating Foundation models for Domain Generalized Semantic Segmentation

Add code
Dec 15, 2023
Viaarxiv icon