Picture for Pierre Andrews

Pierre Andrews

NLLB Team

LCFO: Long Context and Long Form Output Dataset and Benchmarking

Add code
Dec 12, 2024
Viaarxiv icon

2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset

Add code
Dec 11, 2024
Viaarxiv icon

Large Concept Models: Language Modeling in a Sentence Representation Space

Add code
Dec 11, 2024
Viaarxiv icon

Towards Red Teaming in Multimodal and Multilingual Translation

Add code
Jan 29, 2024
Viaarxiv icon

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector

Add code
Jan 10, 2024
Figure 1 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 2 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 3 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 4 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

Gender-specific Machine Translation with Large Language Models

Add code
Sep 06, 2023
Figure 1 for Gender-specific Machine Translation with Large Language Models
Figure 2 for Gender-specific Machine Translation with Large Language Models
Figure 3 for Gender-specific Machine Translation with Large Language Models
Figure 4 for Gender-specific Machine Translation with Large Language Models
Viaarxiv icon

The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages

Add code
Aug 31, 2023
Viaarxiv icon

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

Add code
Aug 23, 2023
Figure 1 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 2 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 3 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 4 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Viaarxiv icon

Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale

Add code
May 22, 2023
Viaarxiv icon