Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jelle Luijkx

EAGERx: Graph-Based Framework for Sim2real Robot Learning

Jul 05, 2024

Bas van der Heijden, Jelle Luijkx, Laura Ferranti, Jens Kober, Robert Babuska

Figure 1 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Figure 2 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Figure 3 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Figure 4 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Abstract:Sim2real, that is, the transfer of learned control policies from simulation to real world, is an area of growing interest in robotics due to its potential to efficiently handle complex tasks. The sim2real approach faces challenges due to mismatches between simulation and reality. These discrepancies arise from inaccuracies in modeling physical phenomena and asynchronous control, among other factors. To this end, we introduce EAGERx, a framework with a unified software pipeline for both real and simulated robot learning. It can support various simulators and aids in integrating state, action and time-scale abstractions to facilitate learning. EAGERx's integrated delay simulation, domain randomization features, and proposed synchronization algorithm contribute to narrowing the sim2real gap. We demonstrate (in the context of robot learning and beyond) the efficacy of EAGERx in accommodating diverse robotic systems and maintaining consistent simulation behavior. EAGERx is open source and its code is available at https://eagerx.readthedocs.io.

* For an introductory video, see http://www.youtube.com/watch?v=D0CQNnTT010 . The documentation, tutorials, and our open-source code can be found at http://eagerx.readthedocs.io

Via

Access Paper or Ask Questions

ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Mar 15, 2024

Runyu Ma, Jelle Luijkx, Zlatan Ajanovic, Jens Kober

Figure 1 for ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Figure 2 for ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Figure 3 for ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Figure 4 for ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models

Abstract:In image-based robot manipulation tasks with large observation and action spaces, reinforcement learning struggles with low sample efficiency, slow training speed, and uncertain convergence. As an alternative, large pre-trained foundation models have shown promise in robotic manipulation, particularly in zero-shot and few-shot applications. However, using these models directly is unreliable due to limited reasoning capabilities and challenges in understanding physical and spatial contexts. This paper introduces ExploRLLM, a novel approach that leverages the inductive bias of foundation models (e.g. Large Language Models) to guide exploration in reinforcement learning. We also exploit these foundation models to reformulate the action and observation spaces to enhance the training efficiency in reinforcement learning. Our experiments demonstrate that guided exploration enables much quicker convergence than training without it. Additionally, we validate that ExploRLLM outperforms vanilla foundation model baselines and that the policy trained in simulation can be applied in real-world settings without additional training.

* 8 pages,8 figures, conference IROS 2024

Via

Access Paper or Ask Questions

PARTNR: Pick and place Ambiguity Resolving by Trustworthy iNteractive leaRning

Nov 15, 2022

Jelle Luijkx, Zlatan Ajanovic, Laura Ferranti, Jens Kober

Abstract:Several recent works show impressive results in mapping language-based human commands and image scene observations to direct robot executable policies (e.g., pick and place poses). However, these approaches do not consider the uncertainty of the trained policy and simply always execute actions suggested by the current policy as the most probable ones. This makes them vulnerable to domain shift and inefficient in the number of required demonstrations. We extend previous works and present the PARTNR algorithm that can detect ambiguities in the trained policy by analyzing multiple modalities in the pick and place poses using topological analysis. PARTNR employs an adaptive, sensitivity-based, gating function that decides if additional user demonstrations are required. User demonstrations are aggregated to the dataset and used for subsequent training. In this way, the policy can adapt promptly to domain shift and it can minimize the number of required demonstrations for a well-trained policy. The adaptive threshold enables to achieve the user-acceptable level of ambiguity to execute the policy autonomously and in turn, increase the trustworthiness of our system. We demonstrate the performance of PARTNR in a table-top pick and place task.

* Accepted to NeurIPS 2022 Workshop on Robot Learning; 8 pages; 4 figures; partnr-learn.github.io

Via

Access Paper or Ask Questions