Picture for Samuele Cornell

Samuele Cornell

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Add code
Feb 21, 2025
Viaarxiv icon

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Add code
Sep 14, 2024
Viaarxiv icon

Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition

Add code
Aug 17, 2024
Viaarxiv icon

The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization

Add code
Jul 23, 2024
Figure 1 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Figure 2 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Figure 3 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Figure 4 for The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Viaarxiv icon

Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement

Add code
Jun 19, 2024
Figure 1 for Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
Figure 2 for Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
Viaarxiv icon

DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels

Add code
Jun 12, 2024
Figure 1 for DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
Figure 2 for DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
Figure 3 for DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
Viaarxiv icon

URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement

Add code
Jun 07, 2024
Figure 1 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Figure 2 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Figure 3 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Figure 4 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Viaarxiv icon

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition

Add code
Oct 02, 2023
Viaarxiv icon

A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

Add code
Jul 28, 2023
Figure 1 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 2 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 3 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 4 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Viaarxiv icon