Picture for Roberta Raileanu

Roberta Raileanu

Jack

MaestroMotif: Skill Design from Artificial Intelligence Feedback

Add code
Dec 11, 2024
Viaarxiv icon

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

Add code
Sep 12, 2024
Figure 1 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Figure 2 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Figure 3 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Figure 4 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games

Add code
Jul 05, 2024
Viaarxiv icon

DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft

Add code
Apr 23, 2024
Viaarxiv icon

Teaching Large Language Models to Reason with Reinforcement Learning

Add code
Mar 07, 2024
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Feb 26, 2024
Figure 1 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 2 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 3 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 4 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Viaarxiv icon

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Add code
Feb 21, 2024
Figure 1 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 2 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 3 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 4 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Viaarxiv icon

The Generalization Gap in Offline Reinforcement Learning

Add code
Dec 10, 2023
Viaarxiv icon

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Add code
Dec 06, 2023
Viaarxiv icon