Picture for Matteo Negri

Matteo Negri

Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection

Add code
Dec 16, 2024
Viaarxiv icon

Findings of the IWSLT 2024 Evaluation Campaign

Add code
Nov 07, 2024
Viaarxiv icon

SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation

Add code
Nov 03, 2024
Figure 1 for SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation
Figure 2 for SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation
Figure 3 for SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation
Figure 4 for SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation
Viaarxiv icon

What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study

Add code
Oct 01, 2024
Viaarxiv icon

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Add code
Oct 01, 2024
Figure 1 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 2 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 3 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Figure 4 for MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Viaarxiv icon

Self-attention as an attractor network: transient memories without backpropagation

Add code
Sep 24, 2024
Viaarxiv icon

Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond

Add code
Aug 07, 2024
Viaarxiv icon

Random Features Hopfield Networks generalize retrieval to previously unseen examples

Add code
Jul 08, 2024
Figure 1 for Random Features Hopfield Networks generalize retrieval to previously unseen examples
Figure 2 for Random Features Hopfield Networks generalize retrieval to previously unseen examples
Figure 3 for Random Features Hopfield Networks generalize retrieval to previously unseen examples
Viaarxiv icon

SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation

Add code
Jun 20, 2024
Viaarxiv icon

StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection

Add code
Jun 10, 2024
Viaarxiv icon