Picture for Christophe Ropers

Christophe Ropers

NLLB Team

LCFO: Long Context and Long Form Output Dataset and Benchmarking

Add code
Dec 12, 2024
Viaarxiv icon

Large Concept Models: Language Modeling in a Sentence Representation Space

Add code
Dec 11, 2024
Figure 1 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 2 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 3 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 4 for Large Concept Models: Language Modeling in a Sentence Representation Space
Viaarxiv icon

Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation

Add code
Dec 11, 2024
Viaarxiv icon

2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset

Add code
Dec 11, 2024
Viaarxiv icon

On the Role of Speech Data in Reducing Toxicity Detection Bias

Add code
Nov 12, 2024
Figure 1 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Figure 2 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Figure 3 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Figure 4 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Viaarxiv icon

Linguini: A benchmark for language-agnostic linguistic reasoning

Add code
Sep 18, 2024
Viaarxiv icon

Towards Massive Multilingual Holistic Bias

Add code
Jun 29, 2024
Figure 1 for Towards Massive Multilingual Holistic Bias
Figure 2 for Towards Massive Multilingual Holistic Bias
Figure 3 for Towards Massive Multilingual Holistic Bias
Figure 4 for Towards Massive Multilingual Holistic Bias
Viaarxiv icon

Towards Red Teaming in Multimodal and Multilingual Translation

Add code
Jan 29, 2024
Viaarxiv icon

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector

Add code
Jan 10, 2024
Figure 1 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 2 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 3 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Figure 4 for MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon