Picture for Anton Bakhtin

Anton Bakhtin

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Add code
Jun 28, 2023
Figure 1 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Figure 2 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Figure 3 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Figure 4 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Viaarxiv icon

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Add code
Oct 11, 2022
Figure 1 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 2 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 3 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 4 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Viaarxiv icon

Modeling Strong and Human-Like Gameplay with KL-Regularized Search

Add code
Dec 14, 2021
Figure 1 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 2 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 3 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 4 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Viaarxiv icon

No-Press Diplomacy from Scratch

Add code
Oct 06, 2021
Figure 1 for No-Press Diplomacy from Scratch
Figure 2 for No-Press Diplomacy from Scratch
Figure 3 for No-Press Diplomacy from Scratch
Figure 4 for No-Press Diplomacy from Scratch
Viaarxiv icon

Physical Reasoning Using Dynamics-Aware Models

Add code
Feb 20, 2021
Figure 1 for Physical Reasoning Using Dynamics-Aware Models
Figure 2 for Physical Reasoning Using Dynamics-Aware Models
Figure 3 for Physical Reasoning Using Dynamics-Aware Models
Figure 4 for Physical Reasoning Using Dynamics-Aware Models
Viaarxiv icon

Human-Level Performance in No-Press Diplomacy via Equilibrium Search

Add code
Oct 06, 2020
Figure 1 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 2 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 3 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 4 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Viaarxiv icon

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

Add code
Jul 27, 2020
Figure 1 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 2 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 3 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 4 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Viaarxiv icon

Residual Energy-Based Models for Text Generation

Add code
Apr 22, 2020
Figure 1 for Residual Energy-Based Models for Text Generation
Figure 2 for Residual Energy-Based Models for Text Generation
Figure 3 for Residual Energy-Based Models for Text Generation
Figure 4 for Residual Energy-Based Models for Text Generation
Viaarxiv icon

Energy-Based Models for Text

Add code
Apr 06, 2020
Figure 1 for Energy-Based Models for Text
Figure 2 for Energy-Based Models for Text
Figure 3 for Energy-Based Models for Text
Figure 4 for Energy-Based Models for Text
Viaarxiv icon

Language Models as Knowledge Bases?

Add code
Sep 04, 2019
Figure 1 for Language Models as Knowledge Bases?
Figure 2 for Language Models as Knowledge Bases?
Figure 3 for Language Models as Knowledge Bases?
Figure 4 for Language Models as Knowledge Bases?
Viaarxiv icon