Picture for Adam Lerer

Adam Lerer

Tony

OpenAI o1 System Card

Add code
Dec 21, 2024
Viaarxiv icon

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Attention Sorting Combats Recency Bias In Long Context Language Models

Add code
Sep 28, 2023
Figure 1 for Attention Sorting Combats Recency Bias In Long Context Language Models
Figure 2 for Attention Sorting Combats Recency Bias In Long Context Language Models
Figure 3 for Attention Sorting Combats Recency Bias In Long Context Language Models
Figure 4 for Attention Sorting Combats Recency Bias In Long Context Language Models
Viaarxiv icon

Human-AI Coordination via Human-Regularized Search and Learning

Add code
Oct 11, 2022
Figure 1 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 2 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 3 for Human-AI Coordination via Human-Regularized Search and Learning
Viaarxiv icon

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Add code
Oct 11, 2022
Figure 1 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 2 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 3 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 4 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Viaarxiv icon

Efficient Heterogeneous Treatment Effect Estimation With Multiple Experiments and Multiple Outcomes

Add code
Jun 10, 2022
Figure 1 for Efficient Heterogeneous Treatment Effect Estimation With Multiple Experiments and Multiple Outcomes
Figure 2 for Efficient Heterogeneous Treatment Effect Estimation With Multiple Experiments and Multiple Outcomes
Figure 3 for Efficient Heterogeneous Treatment Effect Estimation With Multiple Experiments and Multiple Outcomes
Figure 4 for Efficient Heterogeneous Treatment Effect Estimation With Multiple Experiments and Multiple Outcomes
Viaarxiv icon

Modeling Strong and Human-Like Gameplay with KL-Regularized Search

Add code
Dec 14, 2021
Figure 1 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 2 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 3 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 4 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Viaarxiv icon

No-Press Diplomacy from Scratch

Add code
Oct 06, 2021
Figure 1 for No-Press Diplomacy from Scratch
Figure 2 for No-Press Diplomacy from Scratch
Figure 3 for No-Press Diplomacy from Scratch
Figure 4 for No-Press Diplomacy from Scratch
Viaarxiv icon

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

Add code
Jun 16, 2021
Figure 1 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 2 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 3 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 4 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Viaarxiv icon

Off-Belief Learning

Add code
Mar 06, 2021
Figure 1 for Off-Belief Learning
Figure 2 for Off-Belief Learning
Figure 3 for Off-Belief Learning
Figure 4 for Off-Belief Learning
Viaarxiv icon