Picture for David Samuel

David Samuel

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 13, 2025
Viaarxiv icon

Multi-label Scandinavian Language Identification (SLIDE)

Add code
Feb 10, 2025
Figure 1 for Multi-label Scandinavian Language Identification (SLIDE)
Figure 2 for Multi-label Scandinavian Language Identification (SLIDE)
Figure 3 for Multi-label Scandinavian Language Identification (SLIDE)
Figure 4 for Multi-label Scandinavian Language Identification (SLIDE)
Viaarxiv icon

The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective

Add code
Dec 12, 2024
Figure 1 for The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective
Figure 2 for The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective
Figure 3 for The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective
Figure 4 for The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective
Viaarxiv icon

Small Languages, Big Models: A Study of Continual Training on Languages of Norway

Add code
Dec 09, 2024
Figure 1 for Small Languages, Big Models: A Study of Continual Training on Languages of Norway
Figure 2 for Small Languages, Big Models: A Study of Continual Training on Languages of Norway
Figure 3 for Small Languages, Big Models: A Study of Continual Training on Languages of Norway
Figure 4 for Small Languages, Big Models: A Study of Continual Training on Languages of Norway
Viaarxiv icon

GPT or BERT: why not both?

Add code
Oct 31, 2024
Figure 1 for GPT or BERT: why not both?
Figure 2 for GPT or BERT: why not both?
Figure 3 for GPT or BERT: why not both?
Figure 4 for GPT or BERT: why not both?
Viaarxiv icon

BERTs are Generative In-Context Learners

Add code
Jun 07, 2024
Viaarxiv icon

It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments

Add code
Apr 29, 2024
Viaarxiv icon

More Room for Language: Investigating the Effect of Retrieval on Language Models

Add code
Apr 16, 2024
Viaarxiv icon

Not all layers are equally as important: Every Layer Counts BERT

Add code
Nov 07, 2023
Viaarxiv icon

Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings

Add code
Oct 30, 2023
Viaarxiv icon