Picture for Dhruva TB

Dhruva TB

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Add code
Jun 15, 2021
Figure 1 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 2 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 3 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 4 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Viaarxiv icon

Distributed Distributional Deterministic Policy Gradients

Add code
Apr 23, 2018
Figure 1 for Distributed Distributional Deterministic Policy Gradients
Figure 2 for Distributed Distributional Deterministic Policy Gradients
Figure 3 for Distributed Distributional Deterministic Policy Gradients
Figure 4 for Distributed Distributional Deterministic Policy Gradients
Viaarxiv icon

Probing Physics Knowledge Using Tools from Developmental Psychology

Add code
Apr 03, 2018
Figure 1 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 2 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 3 for Probing Physics Knowledge Using Tools from Developmental Psychology
Figure 4 for Probing Physics Knowledge Using Tools from Developmental Psychology
Viaarxiv icon

Emergence of Locomotion Behaviours in Rich Environments

Add code
Jul 10, 2017
Figure 1 for Emergence of Locomotion Behaviours in Rich Environments
Figure 2 for Emergence of Locomotion Behaviours in Rich Environments
Figure 3 for Emergence of Locomotion Behaviours in Rich Environments
Figure 4 for Emergence of Locomotion Behaviours in Rich Environments
Viaarxiv icon

Learning human behaviors from motion capture by adversarial imitation

Add code
Jul 10, 2017
Figure 1 for Learning human behaviors from motion capture by adversarial imitation
Figure 2 for Learning human behaviors from motion capture by adversarial imitation
Figure 3 for Learning human behaviors from motion capture by adversarial imitation
Figure 4 for Learning human behaviors from motion capture by adversarial imitation
Viaarxiv icon