Picture for Ben Bogin

Ben Bogin

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

Add code
Sep 11, 2024
Viaarxiv icon

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Add code
Jul 22, 2024
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Figure 1 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 2 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 3 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 4 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Jan 31, 2024
Figure 1 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 2 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 3 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 4 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Viaarxiv icon

Leveraging Code to Improve In-context Learning for Semantic Parsing

Add code
Nov 16, 2023
Viaarxiv icon

Answering Questions by Meta-Reasoning over Multiple Chains of Thought

Add code
Apr 25, 2023
Viaarxiv icon

Diverse Demonstrations Improve In-context Compositional Generalization

Add code
Dec 20, 2022
Viaarxiv icon

Training Vision-Language Models with Less Bimodal Supervision

Add code
Nov 01, 2022
Viaarxiv icon

Unobserved Local Structures Make Compositional Generalization Hard

Add code
Jan 15, 2022
Figure 1 for Unobserved Local Structures Make Compositional Generalization Hard
Figure 2 for Unobserved Local Structures Make Compositional Generalization Hard
Figure 3 for Unobserved Local Structures Make Compositional Generalization Hard
Figure 4 for Unobserved Local Structures Make Compositional Generalization Hard
Viaarxiv icon

COVR: A test-bed for Visually Grounded Compositional Generalization with real images

Add code
Sep 22, 2021
Figure 1 for COVR: A test-bed for Visually Grounded Compositional Generalization with real images
Figure 2 for COVR: A test-bed for Visually Grounded Compositional Generalization with real images
Figure 3 for COVR: A test-bed for Visually Grounded Compositional Generalization with real images
Figure 4 for COVR: A test-bed for Visually Grounded Compositional Generalization with real images
Viaarxiv icon