Picture for Natasha Jaques

Natasha Jaques

Adaptive Accompaniment with ReaLchords

Add code
Jun 17, 2025
Viaarxiv icon

Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Improving Human-AI Coordination through Adversarial Training and Generative Models

Add code
Apr 21, 2025
Viaarxiv icon

Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination

Add code
Apr 20, 2025
Viaarxiv icon

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward

Add code
Apr 04, 2025
Viaarxiv icon

ReaLJam: Real-Time Human-AI Music Jamming with Reinforcement Learning-Tuned Transformers

Add code
Feb 28, 2025
Viaarxiv icon

Learning to Cooperate with Humans using Generative Agents

Add code
Nov 21, 2024
Viaarxiv icon

InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma

Add code
Nov 15, 2024
Figure 1 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Figure 2 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Figure 3 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Figure 4 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Viaarxiv icon

Infer Human's Intentions Before Following Natural Language Instructions

Add code
Sep 26, 2024
Figure 1 for Infer Human's Intentions Before Following Natural Language Instructions
Figure 2 for Infer Human's Intentions Before Following Natural Language Instructions
Figure 3 for Infer Human's Intentions Before Following Natural Language Instructions
Figure 4 for Infer Human's Intentions Before Following Natural Language Instructions
Viaarxiv icon

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Add code
Aug 19, 2024
Viaarxiv icon