Picture for Ann Lee

Ann Lee

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation

Add code
Jun 04, 2024
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

Add code
Aug 23, 2023
Figure 1 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 2 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 3 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 4 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Viaarxiv icon

Multilingual Speech-to-Speech Translation into Multiple Target Languages

Add code
Jul 17, 2023
Viaarxiv icon

Enhancing Speech-to-Speech Translation with Multiple TTS Targets

Add code
Apr 10, 2023
Figure 1 for Enhancing Speech-to-Speech Translation with Multiple TTS Targets
Figure 2 for Enhancing Speech-to-Speech Translation with Multiple TTS Targets
Figure 3 for Enhancing Speech-to-Speech Translation with Multiple TTS Targets
Figure 4 for Enhancing Speech-to-Speech Translation with Multiple TTS Targets
Viaarxiv icon

A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation

Add code
Jan 25, 2023
Figure 1 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 2 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 3 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 4 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Viaarxiv icon

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units

Add code
Dec 15, 2022
Viaarxiv icon

Speech-to-Speech Translation For A Real-world Unwritten Language

Add code
Nov 11, 2022
Viaarxiv icon

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations

Add code
Nov 08, 2022
Figure 1 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 2 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 3 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Figure 4 for SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Viaarxiv icon