Picture for Mirco Ravanelli

Mirco Ravanelli

Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?

Add code
Oct 08, 2024
Figure 1 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 2 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 3 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 4 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Viaarxiv icon

Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming

Add code
Oct 07, 2024
Viaarxiv icon

What Are They Doing? Joint Audio-Speech Co-Reasoning

Add code
Sep 22, 2024
Viaarxiv icon

LMAC-TD: Producing Time Domain Explanations for Audio Classifiers

Add code
Sep 13, 2024
Viaarxiv icon

ProGRes: Prompted Generative Rescoring on ASR n-Best

Add code
Aug 30, 2024
Viaarxiv icon

Open-Source Conversational AI with SpeechBrain 1.0

Add code
Jul 02, 2024
Figure 1 for Open-Source Conversational AI with SpeechBrain 1.0
Figure 2 for Open-Source Conversational AI with SpeechBrain 1.0
Viaarxiv icon

DASB -- Discrete Audio and Speech Benchmark

Add code
Jun 20, 2024
Figure 1 for DASB -- Discrete Audio and Speech Benchmark
Figure 2 for DASB -- Discrete Audio and Speech Benchmark
Figure 3 for DASB -- Discrete Audio and Speech Benchmark
Figure 4 for DASB -- Discrete Audio and Speech Benchmark
Viaarxiv icon

How Should We Extract Discrete Audio Tokens from Self-Supervised Models?

Add code
Jun 15, 2024
Viaarxiv icon

Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice

Add code
Jun 14, 2024
Viaarxiv icon

Listenable Maps for Zero-Shot Audio Classifiers

Add code
May 27, 2024
Viaarxiv icon