Picture for Jarod Duret

Jarod Duret

LIA

MSP-Podcast SER Challenge 2024: L'antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition

Add code
Jul 08, 2024
Viaarxiv icon

Open-Source Conversational AI with SpeechBrain 1.0

Add code
Jul 02, 2024
Figure 1 for Open-Source Conversational AI with SpeechBrain 1.0
Figure 2 for Open-Source Conversational AI with SpeechBrain 1.0
Viaarxiv icon

DASB -- Discrete Audio and Speech Benchmark

Add code
Jun 20, 2024
Figure 1 for DASB -- Discrete Audio and Speech Benchmark
Figure 2 for DASB -- Discrete Audio and Speech Benchmark
Figure 3 for DASB -- Discrete Audio and Speech Benchmark
Figure 4 for DASB -- Discrete Audio and Speech Benchmark
Viaarxiv icon

How Should We Extract Discrete Audio Tokens from Self-Supervised Models?

Add code
Jun 15, 2024
Figure 1 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Figure 2 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Figure 3 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Figure 4 for How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Viaarxiv icon

Enhancing expressivity transfer in textless speech-to-speech translation

Add code
Oct 11, 2023
Viaarxiv icon

Direct Text to Speech Translation System using Acoustic Units

Add code
Sep 14, 2023
Viaarxiv icon

Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

Add code
Jun 29, 2023
Figure 1 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data
Figure 2 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data
Figure 3 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data
Figure 4 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data
Viaarxiv icon

End-to-end model for named entity recognition from speech without paired training data

Add code
Apr 02, 2022
Figure 1 for End-to-end model for named entity recognition from speech without paired training data
Figure 2 for End-to-end model for named entity recognition from speech without paired training data
Figure 3 for End-to-end model for named entity recognition from speech without paired training data
Figure 4 for End-to-end model for named entity recognition from speech without paired training data
Viaarxiv icon

Study on the temporal pooling used in deep neural networks for speaker verification

Add code
May 10, 2021
Figure 1 for Study on the temporal pooling used in deep neural networks for speaker verification
Figure 2 for Study on the temporal pooling used in deep neural networks for speaker verification
Figure 3 for Study on the temporal pooling used in deep neural networks for speaker verification
Figure 4 for Study on the temporal pooling used in deep neural networks for speaker verification
Viaarxiv icon