Picture for Javier de la Rosa

Javier de la Rosa

The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective

Add code
Dec 12, 2024
Viaarxiv icon

Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges

Add code
Feb 02, 2024
Viaarxiv icon

Boosting Norwegian Automatic Speech Recognition

Add code
Jul 04, 2023
Viaarxiv icon

ALBERTI, a Multilingual Domain Specific Language Model for Poetry Analysis

Add code
Jul 03, 2023
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Figure 1 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 2 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 3 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 4 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling

Add code
Jul 14, 2022
Figure 1 for BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Figure 2 for BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Figure 3 for BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Figure 4 for BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Viaarxiv icon

Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0

Add code
Apr 11, 2022
Figure 1 for Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
Figure 2 for Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
Figure 3 for Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
Figure 4 for Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
Viaarxiv icon

The futility of STILTs for the classification of lexical borrowings in Spanish

Add code
Sep 17, 2021
Figure 1 for The futility of STILTs for the classification of lexical borrowings in Spanish
Figure 2 for The futility of STILTs for the classification of lexical borrowings in Spanish
Figure 3 for The futility of STILTs for the classification of lexical borrowings in Spanish
Figure 4 for The futility of STILTs for the classification of lexical borrowings in Spanish
Viaarxiv icon

Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model

Add code
Apr 19, 2021
Figure 1 for Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model
Figure 2 for Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model
Figure 3 for Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model
Figure 4 for Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model
Viaarxiv icon