Picture for Thomas Wang

Thomas Wang

Pixtral 12B

Add code
Oct 09, 2024
Viaarxiv icon

Mixtral of Experts

Add code
Jan 08, 2024
Viaarxiv icon

FinGPT: Large Generative Models for a Small Language

Add code
Nov 03, 2023
Viaarxiv icon

Mistral 7B

Add code
Oct 10, 2023
Viaarxiv icon

OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Add code
Jun 21, 2023
Viaarxiv icon

StarCoder: may the source be with you!

Add code
May 09, 2023
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

What Language Model to Train if You Have One Million GPU Hours?

Add code
Nov 08, 2022
Viaarxiv icon

Crosslingual Generalization through Multitask Finetuning

Add code
Nov 03, 2022
Viaarxiv icon