Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

René Traoré

DisCoRL: Continual Reinforcement Learning via Policy Distillation

Jul 11, 2019

René Traoré, Hugo Caselles-Dupré, Timothée Lesort, Te Sun, Guanghang Cai, Natalia Díaz-Rodríguez, David Filliat

Figure 1 for DisCoRL: Continual Reinforcement Learning via Policy Distillation

Figure 2 for DisCoRL: Continual Reinforcement Learning via Policy Distillation

Figure 3 for DisCoRL: Continual Reinforcement Learning via Policy Distillation

Figure 4 for DisCoRL: Continual Reinforcement Learning via Policy Distillation

Abstract:In multi-task reinforcement learning there are two main challenges: at training time, the ability to learn different policies with a single model; at test time, inferring which of those policies applying without an external signal. In the case of continual reinforcement learning a third challenge arises: learning tasks sequentially without forgetting the previous ones. In this paper, we tackle these challenges by proposing DisCoRL, an approach combining state representation learning and policy distillation. We experiment on a sequence of three simulated 2D navigation tasks with a 3 wheel omni-directional robot. Moreover, we tested our approach's robustness by transferring the final policy into a real life setting. The policy can solve all tasks and automatically infer which one to run.

* arXiv admin note: text overlap with arXiv:1906.04452

Via

Access Paper or Ask Questions

Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer

Jun 11, 2019

René Traoré, Hugo Caselles-Dupré, Timothée Lesort, Te Sun, Natalia Díaz-Rodríguez, David Filliat

Figure 1 for Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer

Figure 2 for Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer

Figure 3 for Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer

Figure 4 for Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer

Abstract:We focus on the problem of teaching a robot to solve tasks presented sequentially, i.e., in a continual learning scenario. The robot should be able to solve all tasks it has encountered, without forgetting past tasks. We provide preliminary work on applying Reinforcement Learning to such setting, on 2D navigation tasks for a 3 wheel omni-directional robot. Our approach takes advantage of state representation learning and policy distillation. Policies are trained using learned features as input, rather than raw observations, allowing better sample efficiency. Policy distillation is used to combine multiple policies into a single one that solves all encountered tasks.

* accepted to the Workshop on Multi-Task and Lifelong Reinforcement Learning, ICML 2019

Via

Access Paper or Ask Questions

Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Feb 03, 2019

Antonin Raffin, Ashley Hill, René Traoré, Timothée Lesort, Natalia Díaz-Rodríguez, David Filliat

Figure 1 for Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Figure 2 for Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Figure 3 for Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Abstract:Scaling end-to-end reinforcement learning to control real robots from vision presents a series of challenges, in particular in terms of sample efficiency. Against end-to-end learning, state representation learning can help learn a compact, efficient and relevant representation of states that speeds up policy learning, reducing the number of samples needed, and that is easier to interpret. We evaluate several state representation learning methods on goal based robotics tasks and propose a new unsupervised model that stacks representations and combines strengths of several of these approaches. This method encodes all the relevant features, performs on par or better than end-to-end learning, and is robust to hyper-parameters change.

* Github repo: https://github.com/araffin/srl-zoo Documentation: https://srl-zoo.readthedocs.io/en/latest/, As part of SRL-Toolbox: https://s-rl-toolbox.readthedocs.io/en/latest/

Via

Access Paper or Ask Questions

S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning

Oct 10, 2018

Antonin Raffin, Ashley Hill, René Traoré, Timothée Lesort, Natalia Díaz-Rodríguez, David Filliat

Figure 1 for S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning

Figure 2 for S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning

Figure 3 for S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning

Figure 4 for S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning

Abstract:State representation learning aims at learning compact representations from raw observations in robotics and control applications. Approaches used for this objective are auto-encoders, learning forward models, inverse dynamics or learning using generic priors on the state characteristics. However, the diversity in applications and methods makes the field lack standard evaluation datasets, metrics and tasks. This paper provides a set of environments, data generators, robotic control tasks, metrics and tools to facilitate iterative state representation learning and evaluation in reinforcement learning settings.

* Github repo: https://github.com/araffin/robotics-rl-srl Documentation: https://s-rl-toolbox.readthedocs.io/en/latest/

Via

Access Paper or Ask Questions