Picture for Morgane Rivière

Morgane Rivière

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Apr 11, 2024
Figure 1 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 2 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 3 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 4 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

Add code
Oct 27, 2022
Viaarxiv icon

Textless Speech Emotion Conversion using Decomposed and Discrete Representations

Add code
Nov 14, 2021
Figure 1 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 2 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 3 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 4 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Viaarxiv icon

Text-Free Prosody-Aware Generative Spoken Language Modeling

Add code
Sep 07, 2021
Figure 1 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 2 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 3 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 4 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Viaarxiv icon

The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling

Add code
Apr 29, 2021
Figure 1 for The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling
Figure 2 for The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling
Viaarxiv icon

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation

Add code
Jan 02, 2021
Figure 1 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 2 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 3 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 4 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Viaarxiv icon

The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling

Add code
Dec 01, 2020
Figure 1 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Figure 2 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Figure 3 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Figure 4 for The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Viaarxiv icon

Data Augmenting Contrastive Learning of Speech Representations in the Time Domain

Add code
Jul 02, 2020
Figure 1 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Figure 2 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Figure 3 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Figure 4 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Viaarxiv icon