Picture for Jonne Sälevä

Jonne Sälevä

Evaluating Morphological Compositional Generalization in Large Language Models

Add code
Oct 16, 2024
Viaarxiv icon

ParaNames 1.0: Creating an Entity Name Corpus for 400+ Languages using Wikidata

Add code
May 15, 2024
Viaarxiv icon

What changes when you randomly choose BPE merge operations? Not much

Add code
May 04, 2023
Viaarxiv icon

ParaNames: A Massively Multilingual Entity Name Corpus

Add code
Mar 31, 2022
Figure 1 for ParaNames: A Massively Multilingual Entity Name Corpus
Figure 2 for ParaNames: A Massively Multilingual Entity Name Corpus
Figure 3 for ParaNames: A Massively Multilingual Entity Name Corpus
Viaarxiv icon

Toward More Meaningful Resources for Lower-resourced Languages

Add code
Feb 24, 2022
Figure 1 for Toward More Meaningful Resources for Lower-resourced Languages
Figure 2 for Toward More Meaningful Resources for Lower-resourced Languages
Figure 3 for Toward More Meaningful Resources for Lower-resourced Languages
Figure 4 for Toward More Meaningful Resources for Lower-resourced Languages
Viaarxiv icon

Mining Wikidata for Name Resources for African Languages

Add code
Apr 01, 2021
Figure 1 for Mining Wikidata for Name Resources for African Languages
Figure 2 for Mining Wikidata for Name Resources for African Languages
Figure 3 for Mining Wikidata for Name Resources for African Languages
Figure 4 for Mining Wikidata for Name Resources for African Languages
Viaarxiv icon

The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation

Add code
Mar 20, 2021
Figure 1 for The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
Figure 2 for The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
Figure 3 for The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
Figure 4 for The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
Viaarxiv icon