Picture for Ariel Ekgren

Ariel Ekgren

SWEb: A Large Web Dataset for the Scandinavian Languages

Add code
Oct 06, 2024
Viaarxiv icon

GPT-SW3: An Autoregressive Language Model for the Nordic Languages

Add code
May 23, 2023
Viaarxiv icon

The Nordic Pile: A 1.2TB Nordic Dataset for Language Modeling

Add code
Mar 30, 2023
Viaarxiv icon

Cross-lingual Transfer of Monolingual Models

Add code
Sep 15, 2021
Figure 1 for Cross-lingual Transfer of Monolingual Models
Figure 2 for Cross-lingual Transfer of Monolingual Models
Figure 3 for Cross-lingual Transfer of Monolingual Models
Figure 4 for Cross-lingual Transfer of Monolingual Models
Viaarxiv icon

R-grams: Unsupervised Learning of Semantic Units in Natural Language

Add code
Aug 14, 2018
Figure 1 for R-grams: Unsupervised Learning of Semantic Units in Natural Language
Figure 2 for R-grams: Unsupervised Learning of Semantic Units in Natural Language
Figure 3 for R-grams: Unsupervised Learning of Semantic Units in Natural Language
Figure 4 for R-grams: Unsupervised Learning of Semantic Units in Natural Language
Viaarxiv icon