Picture for Pierre Andrews

Pierre Andrews

NLLB Team

BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation

Add code
Feb 06, 2025
Figure 1 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Figure 2 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Figure 3 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Figure 4 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Viaarxiv icon

LCFO: Long Context and Long Form Output Dataset and Benchmarking

Add code
Dec 12, 2024
Figure 1 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Figure 2 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Figure 3 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Figure 4 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Viaarxiv icon

Large Concept Models: Language Modeling in a Sentence Representation Space

Add code
Dec 11, 2024
Figure 1 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 2 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 3 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 4 for Large Concept Models: Language Modeling in a Sentence Representation Space
Viaarxiv icon

2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset

Add code
Dec 11, 2024
Viaarxiv icon

Towards Red Teaming in Multimodal and Multilingual Translation

Add code
Jan 29, 2024
Viaarxiv icon

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector

Add code
Jan 10, 2024
Figure 1 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 2 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 3 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 4 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

Gender-specific Machine Translation with Large Language Models

Add code
Sep 06, 2023
Figure 1 for Gender-specific Machine Translation with Large Language Models
Figure 2 for Gender-specific Machine Translation with Large Language Models
Figure 3 for Gender-specific Machine Translation with Large Language Models
Figure 4 for Gender-specific Machine Translation with Large Language Models
Viaarxiv icon

The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages

Add code
Aug 31, 2023
Viaarxiv icon

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

Add code
Aug 23, 2023
Figure 1 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 2 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 3 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Figure 4 for SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Viaarxiv icon