Picture for Lintang Sutawika

Lintang Sutawika

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Add code
Oct 21, 2024
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Viaarxiv icon

Utilizing Weak Supervision To Generate Indonesian Conservation Dataset

Add code
Oct 24, 2023
Viaarxiv icon

Emergent and Predictable Memorization in Large Language Models

Add code
Apr 21, 2023
Viaarxiv icon

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Add code
Apr 03, 2023
Viaarxiv icon

Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages

Add code
Mar 30, 2023
Figure 1 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Figure 2 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Figure 3 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Figure 4 for Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Viaarxiv icon

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

Add code
Dec 19, 2022
Figure 1 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 2 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 3 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 4 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

What Language Model to Train if You Have One Million GPU Hours?

Add code
Nov 08, 2022
Viaarxiv icon

Crosslingual Generalization through Multitask Finetuning

Add code
Nov 03, 2022
Viaarxiv icon