Picture for Jörg Tiedemann

Jörg Tiedemann

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

Add code
Sep 26, 2024
Viaarxiv icon

Two Stacks Are Better Than One: A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives

Add code
Jul 22, 2024
Viaarxiv icon

Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?

Add code
Mar 25, 2024
Viaarxiv icon

A New Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 20, 2024
Viaarxiv icon

SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

Add code
Mar 20, 2024
Viaarxiv icon

MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki

Add code
Mar 12, 2024
Figure 1 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Figure 2 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Figure 3 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Figure 4 for MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki
Viaarxiv icon

MaLA-500: Massive Language Adaptation of Large Language Models

Add code
Jan 24, 2024
Viaarxiv icon

Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health

Add code
Apr 20, 2023
Viaarxiv icon

Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging

Add code
Apr 10, 2023
Viaarxiv icon

Democratizing Machine Translation with OPUS-MT

Add code
Dec 04, 2022
Viaarxiv icon