Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sumedh Sontakke

Value Explicit Pretraining for Goal-Based Transfer Learning

Dec 19, 2023

Kiran Lekkala, Henghui Bao, Sumedh Sontakke, Laurent Itti

Figure 1 for Value Explicit Pretraining for Goal-Based Transfer Learning

Figure 2 for Value Explicit Pretraining for Goal-Based Transfer Learning

Figure 3 for Value Explicit Pretraining for Goal-Based Transfer Learning

Figure 4 for Value Explicit Pretraining for Goal-Based Transfer Learning

Abstract:We propose a method that allows for learning task-agnostic representations based on value function estimates from a sequence of observations where the last frame corresponds to a goal. These representations would learn to relate states across different tasks, based on the temporal distance to the goal state, irrespective of the appearance changes and dynamics. This method could be used to transfer learnt policies/skills to unseen related tasks.

* Accepted at CoRL 2023 Workshop on PRL

Via

Access Paper or Ask Questions

Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

Sep 18, 2023

Yevgen Chebotar, Quan Vuong, Alex Irpan, Karol Hausman, Fei Xia, Yao Lu, Aviral Kumar, Tianhe Yu, Alexander Herzog, Karl Pertsch(+15 more)

Figure 1 for Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

Figure 2 for Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

Figure 3 for Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

Figure 4 for Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

Abstract:In this work, we present a scalable reinforcement learning method for training multi-task policies from large offline datasets that can leverage both human demonstrations and autonomously collected data. Our method uses a Transformer to provide a scalable representation for Q-functions trained via offline temporal difference backups. We therefore refer to the method as Q-Transformer. By discretizing each action dimension and representing the Q-value of each action dimension as separate tokens, we can apply effective high-capacity sequence modeling techniques for Q-learning. We present several design decisions that enable good performance with offline RL training, and show that Q-Transformer outperforms prior offline RL algorithms and imitation learning techniques on a large diverse real-world robotic manipulation task suite. The project's website and videos can be found at https://q-transformer.github.io

* See website at https://q-transformer.github.io

Via

Access Paper or Ask Questions

RT-1: Robotics Transformer for Real-World Control at Scale

Dec 13, 2022

Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Jasmine Hsu(+41 more)

Figure 1 for RT-1: Robotics Transformer for Real-World Control at Scale

Figure 2 for RT-1: Robotics Transformer for Real-World Control at Scale

Figure 3 for RT-1: Robotics Transformer for Real-World Control at Scale

Figure 4 for RT-1: Robotics Transformer for Real-World Control at Scale

Abstract:By transferring knowledge from large, diverse, task-agnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small task-specific datasets to a high level of performance. While this capability has been demonstrated in other fields such as computer vision, natural language processing or speech recognition, it remains to be shown in robotics, where the generalization capabilities of the models are particularly critical due to the difficulty of collecting real-world robotic data. We argue that one of the keys to the success of such general robotic models lies with open-ended task-agnostic training, combined with high-capacity architectures that can absorb all of the diverse, robotic data. In this paper, we present a model class, dubbed Robotics Transformer, that exhibits promising scalable model properties. We verify our conclusions in a study of different model classes and their ability to generalize as a function of the data size, model size, and data diversity based on a large-scale data collection on real robots performing real-world tasks. The project's website and videos can be found at robotics-transformer.github.io

* See website at robotics-transformer.github.io

Via

Access Paper or Ask Questions