Picture for Alexei Baevski

Alexei Baevski

Jack

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Toward Joint Language Modeling for Speech Units and Text

Add code
Oct 12, 2023
Figure 1 for Toward Joint Language Modeling for Speech Units and Text
Figure 2 for Toward Joint Language Modeling for Speech Units and Text
Figure 3 for Toward Joint Language Modeling for Speech Units and Text
Figure 4 for Toward Joint Language Modeling for Speech Units and Text
Viaarxiv icon

Scaling Speech Technology to 1,000+ Languages

Add code
May 22, 2023
Viaarxiv icon

OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav

Add code
Mar 14, 2023
Viaarxiv icon

AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations

Add code
Feb 10, 2023
Viaarxiv icon

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language

Add code
Dec 14, 2022
Figure 1 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Figure 2 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Figure 3 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Figure 4 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Viaarxiv icon

Introducing Semantics into Speech Encoders

Add code
Nov 15, 2022
Viaarxiv icon

Masked Autoencoders that Listen

Add code
Jul 13, 2022
Figure 1 for Masked Autoencoders that Listen
Figure 2 for Masked Autoencoders that Listen
Figure 3 for Masked Autoencoders that Listen
Figure 4 for Masked Autoencoders that Listen
Viaarxiv icon

Wav2Vec-Aug: Improved self-supervised training with limited data

Add code
Jun 27, 2022
Figure 1 for Wav2Vec-Aug: Improved self-supervised training with limited data
Figure 2 for Wav2Vec-Aug: Improved self-supervised training with limited data
Figure 3 for Wav2Vec-Aug: Improved self-supervised training with limited data
Figure 4 for Wav2Vec-Aug: Improved self-supervised training with limited data
Viaarxiv icon

Offline Visual Representation Learning for Embodied Navigation

Add code
Apr 27, 2022
Figure 1 for Offline Visual Representation Learning for Embodied Navigation
Figure 2 for Offline Visual Representation Learning for Embodied Navigation
Figure 3 for Offline Visual Representation Learning for Embodied Navigation
Figure 4 for Offline Visual Representation Learning for Embodied Navigation
Viaarxiv icon