Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Robert Babuska

Beyond Behavior Cloning: Robustness through Interactive Imitation and Contrastive Learning

Feb 11, 2025

Zhaoting Li, Rodrigo Pérez-Dattari, Robert Babuska, Cosimo Della Santina, Jens Kober

Abstract:Behavior cloning (BC) traditionally relies on demonstration data, assuming the demonstrated actions are optimal. This can lead to overfitting under noisy data, particularly when expressive models are used (e.g., the energy-based model in Implicit BC). To address this, we extend behavior cloning into an iterative process of optimal action estimation within the Interactive Imitation Learning framework. Specifically, we introduce Contrastive policy Learning from Interactive Corrections (CLIC). CLIC leverages human corrections to estimate a set of desired actions and optimizes the policy to select actions from this set. We provide theoretical guarantees for the convergence of the desired action set to optimal actions in both single and multiple optimal action cases. Extensive simulation and real-robot experiments validate CLIC's advantages over existing state-of-the-art methods, including stable training of energy-based models, robustness to feedback noise, and adaptability to diverse feedback types beyond demonstrations. Our code will be publicly available soon.

Via

Access Paper or Ask Questions

EAGERx: Graph-Based Framework for Sim2real Robot Learning

Jul 05, 2024

Bas van der Heijden, Jelle Luijkx, Laura Ferranti, Jens Kober, Robert Babuska

Figure 1 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Figure 2 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Figure 3 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Figure 4 for EAGERx: Graph-Based Framework for Sim2real Robot Learning

Abstract:Sim2real, that is, the transfer of learned control policies from simulation to real world, is an area of growing interest in robotics due to its potential to efficiently handle complex tasks. The sim2real approach faces challenges due to mismatches between simulation and reality. These discrepancies arise from inaccuracies in modeling physical phenomena and asynchronous control, among other factors. To this end, we introduce EAGERx, a framework with a unified software pipeline for both real and simulated robot learning. It can support various simulators and aids in integrating state, action and time-scale abstractions to facilitate learning. EAGERx's integrated delay simulation, domain randomization features, and proposed synchronization algorithm contribute to narrowing the sim2real gap. We demonstrate (in the context of robot learning and beyond) the efficacy of EAGERx in accommodating diverse robotic systems and maintaining consistent simulation behavior. EAGERx is open source and its code is available at https://eagerx.readthedocs.io.

* For an introductory video, see http://www.youtube.com/watch?v=D0CQNnTT010 . The documentation, tutorials, and our open-source code can be found at http://eagerx.readthedocs.io

Via

Access Paper or Ask Questions

A Vision-Guided Robotic System for Grasping Harvested Tomato Trusses in Cluttered Environments

Sep 29, 2023

Luuk van den Bent, Tomás Coleman, Robert Babuska

Figure 1 for A Vision-Guided Robotic System for Grasping Harvested Tomato Trusses in Cluttered Environments

Figure 2 for A Vision-Guided Robotic System for Grasping Harvested Tomato Trusses in Cluttered Environments

Figure 3 for A Vision-Guided Robotic System for Grasping Harvested Tomato Trusses in Cluttered Environments

Figure 4 for A Vision-Guided Robotic System for Grasping Harvested Tomato Trusses in Cluttered Environments

Abstract:Currently, truss tomato weighing and packaging require significant manual work. The main obstacle to automation lies in the difficulty of developing a reliable robotic grasping system for already harvested trusses. We propose a method to grasp trusses that are stacked in a crate with considerable clutter, which is how they are commonly stored and transported after harvest. The method consists of a deep learning-based vision system to first identify the individual trusses in the crate and then determine a suitable grasping location on the stem. To this end, we have introduced a grasp pose ranking algorithm with online learning capabilities. After selecting the most promising grasp pose, the robot executes a pinch grasp without needing touch sensors or geometric models. Lab experiments with a robotic manipulator equipped with an eye-in-hand RGB-D camera showed a 100% clearance rate when tasked to pick all trusses from a pile. 93% of the trusses were successfully grasped on the first try, while the remaining 7% required more attempts.

* 7 pages, 7 figures

Via

Access Paper or Ask Questions

Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators

Sep 19, 2022

Jiri Sedlar, Karla Stepanova, Matus Tuna, Radoslav Skoviera, Jan Kristof Behrens, Gabriela Sejnova, Josef Sivic, Robert Babuska

Figure 1 for Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators

Figure 2 for Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators

Figure 3 for Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators

Figure 4 for Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators

Abstract:This paper introduces a dataset for training and evaluating methods for 6D pose estimation of hand-held tools in task demonstrations captured by a standard RGB camera. Despite the significant progress of 6D pose estimation methods, their performance is usually limited for heavily occluded objects, which is a common case in imitation learning where the object is typically partially occluded by the manipulating hand. Currently, there is a lack of datasets that would enable the development of robust 6D pose estimation methods for these conditions. To overcome this problem, we collect a new dataset (Imitrob) aimed at 6D pose estimation in imitation learning and other applications where a human holds a tool and performs a task. The dataset contains image sequences of three different tools and six manipulation tasks with two camera viewpoints, four human subjects, and left/right hand. Each image is accompanied by an accurate ground truth measurement of the 6D object pose, obtained by the HTC Vive motion tracking device. The use of the dataset is demonstrated by training and evaluating a recent 6D object pose estimation method (DOPE) in various setups. The dataset and code are publicly available at http://imitrob.ciirc.cvut.cz/imitrobdataset.php.

Via

Access Paper or Ask Questions

Geometry-Based Grasping of Vine Tomatoes

Mar 01, 2021

Taeke de Haan, Padmaja Kulkarni, Robert Babuska

Figure 1 for Geometry-Based Grasping of Vine Tomatoes

Figure 2 for Geometry-Based Grasping of Vine Tomatoes

Figure 3 for Geometry-Based Grasping of Vine Tomatoes

Figure 4 for Geometry-Based Grasping of Vine Tomatoes

Abstract:We propose a geometry-based grasping method for vine tomatoes. It relies on a computer-vision pipeline to identify the required geometric features of the tomatoes and of the truss stem. The grasping method then uses a geometric model of the robotic hand and the truss to determine a suitable grasping location on the stem. This approach allows for grasping tomato trusses without requiring delicate contact sensors or complex mechanistic models and under minimal risk of damaging the tomatoes. Lab experiments were conducted to validate the proposed methods, using an RGB-D camera and a low-cost robotic manipulator. The success rate was 83% to 92%, depending on the type of truss.

* 8 pages, 12 figures. This work has been submitted to the IEEE for possible publication (IROS + RAL). Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions

GEM: Glare or Gloom, I Can Still See You -- End-to-End Multimodal Object Detector

Feb 24, 2021

Osama Mazhar, Jens Kober, Robert Babuska

Figure 1 for GEM: Glare or Gloom, I Can Still See You -- End-to-End Multimodal Object Detector

Figure 2 for GEM: Glare or Gloom, I Can Still See You -- End-to-End Multimodal Object Detector

Figure 3 for GEM: Glare or Gloom, I Can Still See You -- End-to-End Multimodal Object Detector

Figure 4 for GEM: Glare or Gloom, I Can Still See You -- End-to-End Multimodal Object Detector

Abstract:Deep neural networks designed for vision tasks are often prone to failure when they encounter environmental conditions not covered by the training data. Efficient fusion strategies for multi-sensor configurations can enhance the robustness of the detection algorithms by exploiting redundancy from different sensor streams. In this paper, we propose sensor-aware multi-modal fusion strategies for 2D object detection in harsh-lighting conditions. Our network learns to estimate the measurement reliability of each sensor modality in the form of scalar weights and masks, without prior knowledge of the sensor characteristics. The obtained weights are assigned to the extracted feature maps which are subsequently fused and passed to the transformer encoder-decoder network for object detection. This is critical in the case of asymmetric sensor failures and to prevent any tragic consequences. Through extensive experimentation, we show that the proposed strategies out-perform the existing state-of-the-art methods on the FLIR-Thermal dataset, improving the mAP up-to 25.2%. We also propose a new "r-blended" hybrid depth modality for RGB-D multi-modal detection tasks. Our proposed method also obtained promising results on the SUNRGB-D dataset.

* 7 pages, 5 figures

Via

Access Paper or Ask Questions

DeepKoCo: Efficient latent planning with an invariant Koopman representation

Nov 25, 2020

Bas van der Heijden, Laura Ferranti, Jens Kober, Robert Babuska

Figure 1 for DeepKoCo: Efficient latent planning with an invariant Koopman representation

Figure 2 for DeepKoCo: Efficient latent planning with an invariant Koopman representation

Figure 3 for DeepKoCo: Efficient latent planning with an invariant Koopman representation

Abstract:This paper presents DeepKoCo, a novel model-based agent that learns a latent Koopman representation from images. This representation allows DeepKoCo to plan efficiently using linear control methods, such as linear model predictive control. Compared to traditional agents, DeepKoCo is invariant to task-irrelevant dynamics, thanks to the use of a tailored lossy autoencoder network that allows DeepKoCo to learn latent dynamics that reconstruct and predict only observed costs, rather than all observed dynamics. As our results show, DeepKoCo achieves a similar final performance as traditional model-free methods on complex control tasks, while being considerably more robust to distractor dynamics, making the proposed agent more amenable for real-life applications.

Via

Access Paper or Ask Questions

Teaching robots to imitate a human with no on-teacher sensors. What are the key challenges?

Jan 24, 2019

Radoslav Skoviera, Karla Stepanova, Michael Tesar, Gabriela Sejnova, Jiri Sedlar, Michal Vavrecka, Robert Babuska, Josef Sivic

Figure 1 for Teaching robots to imitate a human with no on-teacher sensors. What are the key challenges?

Figure 2 for Teaching robots to imitate a human with no on-teacher sensors. What are the key challenges?

Figure 3 for Teaching robots to imitate a human with no on-teacher sensors. What are the key challenges?

Figure 4 for Teaching robots to imitate a human with no on-teacher sensors. What are the key challenges?

Abstract:In this paper, we consider the problem of learning object manipulation tasks from human demonstration using RGB or RGB-D cameras. We highlight the key challenges in capturing sufficiently good data with no tracking devices - starting from sensor selection and accurate 6DoF pose estimation to natural language processing. In particular, we focus on two showcases: gluing task with a glue gun and simple block-stacking with variable blocks. Furthermore, we discuss how a linguistic description of the task could help to improve the accuracy of task description. We also present the whole architecture of our transfer of the imitated task to the simulated and real robot environment.

* The IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018, Workshop on: Towards Intelligent Social Robots: From Naive Robots to Robot Sapiens http://intelligent-social-robots-ws.com/materials/

Via

Access Paper or Ask Questions

Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice

Nov 29, 2015

Frederik Ruelens, Bert Claessens, Salman Quaiyum, Bart De Schutter, Robert Babuska, Ronnie Belmans

Figure 1 for Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice

Figure 2 for Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice

Figure 3 for Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice

Figure 4 for Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice

Abstract:Electric water heaters have the ability to store energy in their water buffer without impacting the comfort of the end user. This feature makes them a prime candidate for residential demand response. However, the stochastic and nonlinear dynamics of electric water heaters, makes it challenging to harness their flexibility. Driven by this challenge, this paper formulates the underlying sequential decision-making problem as a Markov decision process and uses techniques from reinforcement learning. Specifically, we apply an auto-encoder network to find a compact feature representation of the sensor measurements, which helps to mitigate the curse of dimensionality. A wellknown batch reinforcement learning technique, fitted Q-iteration, is used to find a control policy, given this feature representation. In a simulation-based experiment using an electric water heater with 50 temperature sensors, the proposed method was able to achieve good policies much faster than when using the full state information. In a lab experiment, we apply fitted Q-iteration to an electric water heater with eight temperature sensors. Further reducing the state vector did not improve the results of fitted Q-iteration. The results of the lab experiment, spanning 40 days, indicate that compared to a thermostat controller, the presented approach was able to reduce the total cost of energy consumption of the electric water heater by 15%.

* Submitted to IEEE transaction on smart grid

Via

Access Paper or Ask Questions

Sampling-based Approximations with Quantitative Performance for the Probabilistic Reach-Avoid Problem over General Markov Processes

Sep 09, 2015

Sofie Haesaert, Robert Babuska, Alessandro Abate

Figure 1 for Sampling-based Approximations with Quantitative Performance for the Probabilistic Reach-Avoid Problem over General Markov Processes

Figure 2 for Sampling-based Approximations with Quantitative Performance for the Probabilistic Reach-Avoid Problem over General Markov Processes

Figure 3 for Sampling-based Approximations with Quantitative Performance for the Probabilistic Reach-Avoid Problem over General Markov Processes

Figure 4 for Sampling-based Approximations with Quantitative Performance for the Probabilistic Reach-Avoid Problem over General Markov Processes

Abstract:This article deals with stochastic processes endowed with the Markov (memoryless) property and evolving over general (uncountable) state spaces. The models further depend on a non-deterministic quantity in the form of a control input, which can be selected to affect the probabilistic dynamics. We address the computation of maximal reach-avoid specifications, together with the synthesis of the corresponding optimal controllers. The reach-avoid specification deals with assessing the likelihood that any finite-horizon trajectory of the model enters a given goal set, while avoiding a given set of undesired states. This article newly provides an approximate computational scheme for the reach-avoid specification based on the Fitted Value Iteration algorithm, which hinges on random sample extractions, and gives a-priori computable formal probabilistic bounds on the error made by the approximation algorithm: as such, the output of the numerical scheme is quantitatively assessed and thus meaningful for safety-critical applications. Furthermore, we provide tighter probabilistic error bounds that are sample-based. The overall computational scheme is put in relationship with alternative approximation algorithms in the literature, and finally its performance is practically assessed over a benchmark case study.

Via

Access Paper or Ask Questions