Picture for Hugo Laurençon

Hugo Laurençon

Building and better understanding vision-language models: insights and future directions

Add code
Aug 22, 2024
Viaarxiv icon

What matters when building vision-language models?

Add code
May 03, 2024
Viaarxiv icon

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Add code
Mar 14, 2024
Viaarxiv icon

CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias

Add code
Aug 24, 2023
Viaarxiv icon

OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Add code
Jun 21, 2023
Figure 1 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Figure 2 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Figure 3 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Figure 4 for OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Figure 1 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 2 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 3 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 4 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Viaarxiv icon

The ROOTS Search Tool: Data Transparency for LLMs

Add code
Feb 27, 2023
Figure 1 for The ROOTS Search Tool: Data Transparency for LLMs
Figure 2 for The ROOTS Search Tool: Data Transparency for LLMs
Figure 3 for The ROOTS Search Tool: Data Transparency for LLMs
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon

Add code
Jun 22, 2022
Figure 1 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 2 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 3 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 4 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Viaarxiv icon

Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents

Add code
Sep 14, 2021
Figure 1 for Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents
Figure 2 for Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents
Figure 3 for Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents
Figure 4 for Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents
Viaarxiv icon