Picture for Supratik Paul

Supratik Paul

Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula

Add code
Dec 02, 2022
Viaarxiv icon

Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving

Add code
Oct 18, 2022
Figure 1 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 2 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 3 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 4 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Viaarxiv icon

Fast Efficient Hyperparameter Tuning for Policy Gradients

Add code
Feb 18, 2019
Figure 1 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 2 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 3 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 4 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Viaarxiv icon

Learning from Demonstration in the Wild

Add code
Nov 08, 2018
Figure 1 for Learning from Demonstration in the Wild
Figure 2 for Learning from Demonstration in the Wild
Figure 3 for Learning from Demonstration in the Wild
Figure 4 for Learning from Demonstration in the Wild
Viaarxiv icon

Fingerprint Policy Optimisation for Robust Reinforcement Learning

Add code
Sep 15, 2018
Figure 1 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Figure 2 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Figure 3 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Figure 4 for Fingerprint Policy Optimisation for Robust Reinforcement Learning
Viaarxiv icon

Alternating Optimisation and Quadrature for Robust Control

Add code
Dec 18, 2017
Figure 1 for Alternating Optimisation and Quadrature for Robust Control
Figure 2 for Alternating Optimisation and Quadrature for Robust Control
Figure 3 for Alternating Optimisation and Quadrature for Robust Control
Figure 4 for Alternating Optimisation and Quadrature for Robust Control
Viaarxiv icon