Picture for Andrei Lupu

Andrei Lupu

Jack

CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants

Add code
Oct 28, 2024
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Behaviour Distillation

Add code
Jun 21, 2024
Figure 1 for Behaviour Distillation
Figure 2 for Behaviour Distillation
Figure 3 for Behaviour Distillation
Figure 4 for Behaviour Distillation
Viaarxiv icon

Discovering Minimal Reinforcement Learning Environments

Add code
Jun 18, 2024
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Feb 26, 2024
Viaarxiv icon

JaxMARL: Multi-Agent RL Environments in JAX

Add code
Nov 20, 2023
Viaarxiv icon

Grounding Aleatoric Uncertainty in Unsupervised Environment Design

Add code
Jul 11, 2022
Figure 1 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Figure 2 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Figure 3 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Figure 4 for Grounding Aleatoric Uncertainty in Unsupervised Environment Design
Viaarxiv icon

Option-critic in cooperative multi-agent systems

Add code
Jan 06, 2020
Figure 1 for Option-critic in cooperative multi-agent systems
Figure 2 for Option-critic in cooperative multi-agent systems
Figure 3 for Option-critic in cooperative multi-agent systems
Figure 4 for Option-critic in cooperative multi-agent systems
Viaarxiv icon