Picture for Filip Wolski

Filip Wolski

Long-Term Planning and Situational Awareness in OpenAI Five

Add code
Dec 13, 2019
Figure 1 for Long-Term Planning and Situational Awareness in OpenAI Five
Figure 2 for Long-Term Planning and Situational Awareness in OpenAI Five
Figure 3 for Long-Term Planning and Situational Awareness in OpenAI Five
Figure 4 for Long-Term Planning and Situational Awareness in OpenAI Five
Viaarxiv icon

Dota 2 with Large Scale Deep Reinforcement Learning

Add code
Dec 13, 2019
Figure 1 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 2 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 3 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 4 for Dota 2 with Large Scale Deep Reinforcement Learning
Viaarxiv icon

Evolved Policy Gradients

Add code
Apr 29, 2018
Figure 1 for Evolved Policy Gradients
Figure 2 for Evolved Policy Gradients
Figure 3 for Evolved Policy Gradients
Figure 4 for Evolved Policy Gradients
Viaarxiv icon

Hindsight Experience Replay

Add code
Feb 23, 2018
Figure 1 for Hindsight Experience Replay
Figure 2 for Hindsight Experience Replay
Figure 3 for Hindsight Experience Replay
Figure 4 for Hindsight Experience Replay
Viaarxiv icon

Proximal Policy Optimization Algorithms

Add code
Aug 28, 2017
Figure 1 for Proximal Policy Optimization Algorithms
Figure 2 for Proximal Policy Optimization Algorithms
Figure 3 for Proximal Policy Optimization Algorithms
Figure 4 for Proximal Policy Optimization Algorithms
Viaarxiv icon