Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Angela Schoellig

Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

Oct 04, 2024

Oliver Hausdörfer, Alexander von Rohr, Éric Lefort, Angela Schoellig

Figure 1 for Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

Figure 2 for Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

Figure 3 for Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

Figure 4 for Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

Abstract:Deep Reinforcement Learning (DRL) in simulation often results in brittle and unrealistic learning outcomes. To push the agent towards more desirable solutions, prior information can be injected in the learning process through, for instance, reward shaping, expert data, or motion primitives. We propose an additional inductive bias for robot learning: latent actions learned from expert demonstration as priors in the action space. We show that these action priors can be learned from only a single open-loop gait cycle using a simple autoencoder. Using these latent action priors combined with established style rewards for imitation in DRL achieves above expert demonstration level of performance and leads to more desirable gaits. Further, action priors substantially improve the performance on transfer tasks, even leading to gait transitions for higher target speeds. Videos and code are available at https://sites.google.com/view/latent-action-priors.

* Submitted to ICRA 2025

Via

Access Paper or Ask Questions

Reinforcement Learning with Lie Group Orientations for Robotics

Sep 18, 2024

Martin Schuck, Jan Brüdigam, Sandra Hirche, Angela Schoellig

Figure 1 for Reinforcement Learning with Lie Group Orientations for Robotics

Figure 2 for Reinforcement Learning with Lie Group Orientations for Robotics

Figure 3 for Reinforcement Learning with Lie Group Orientations for Robotics

Figure 4 for Reinforcement Learning with Lie Group Orientations for Robotics

Abstract:Handling orientations of robots and objects is a crucial aspect of many applications. Yet, ever so often, there is a lack of mathematical correctness when dealing with orientations, especially in learning pipelines involving, for example, artificial neural networks. In this paper, we investigate reinforcement learning with orientations and propose a simple modification of the network's input and output that adheres to the Lie group structure of orientations. As a result, we obtain an easy and efficient implementation that is directly usable with existing learning libraries and achieves significantly better performance than other common orientation representations. We briefly introduce Lie theory specifically for orientations in robotics to motivate and outline our approach. Subsequently, a thorough empirical evaluation of different combinations of orientation representations for states and actions demonstrates the superior performance of our proposed approach in different scenarios, including: direct orientation control, end effector orientation control, and pick-and-place tasks.

* Submitted to ICRA 2025

Via

Access Paper or Ask Questions

Does Unpredictability Influence Driving Behavior?

Jul 28, 2023

Sepehr Samavi, Florian Shkurti, Angela Schoellig

Figure 1 for Does Unpredictability Influence Driving Behavior?

Figure 2 for Does Unpredictability Influence Driving Behavior?

Figure 3 for Does Unpredictability Influence Driving Behavior?

Figure 4 for Does Unpredictability Influence Driving Behavior?

Abstract:In this paper we investigate the effect of the unpredictability of surrounding cars on an ego-car performing a driving maneuver. We use Maximum Entropy Inverse Reinforcement Learning to model reward functions for an ego-car conducting a lane change in a highway setting. We define a new feature based on the unpredictability of surrounding cars and use it in the reward function. We learn two reward functions from human data: a baseline and one that incorporates our defined unpredictability feature, then compare their performance with a quantitative and qualitative evaluation. Our evaluation demonstrates that incorporating the unpredictability feature leads to a better fit of human-generated test data. These results encourage further investigation of the effect of unpredictability on driving behavior.

* Accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023

Via

Access Paper or Ask Questions

Perception-aware Tag Placement Planning for Robust Localization of UAVs in Indoor Construction Environments

Oct 27, 2022

Navid Kayhani, Angela Schoellig, Brenda McCabe

Abstract:Tag-based visual-inertial localization is a lightweight method for enabling autonomous data collection missions of low-cost unmanned aerial vehicles (UAVs) in indoor construction environments. However, finding the optimal tag configuration (i.e., number, size, and location) on dynamic construction sites remains challenging. This paper proposes a perception-aware genetic algorithm-based tag placement planner (PGA-TaPP) to determine the optimal tag configuration using 4D-BIM, considering the project progress, safety requirements, and UAV's localizability. The proposed method provides a 4D plan for tag placement by maximizing the localizability in user-specified regions of interest (ROIs) while limiting the installation costs. Localizability is quantified using the Fisher information matrix (FIM) and encapsulated in navigable grids. The experimental results show the effectiveness of our method in finding an optimal 4D tag placement plan for the robust localization of UAVs on under-construction indoor sites.

* [Final draft] This material may be downloaded for personal use only. Any other use requires prior permission of the American Society of Civil Engineers and the Journal of Computing in Civil Engineering

Via

Access Paper or Ask Questions

Adaptive Model Predictive Control for High-Accuracy Trajectory Tracking in Changing Conditions

Aug 02, 2018

Karime Pereida, Angela Schoellig

Figure 1 for Adaptive Model Predictive Control for High-Accuracy Trajectory Tracking in Changing Conditions

Figure 2 for Adaptive Model Predictive Control for High-Accuracy Trajectory Tracking in Changing Conditions

Figure 3 for Adaptive Model Predictive Control for High-Accuracy Trajectory Tracking in Changing Conditions

Figure 4 for Adaptive Model Predictive Control for High-Accuracy Trajectory Tracking in Changing Conditions

Abstract:Robots and automated systems are increasingly being introduced to unknown and dynamic environments where they are required to handle disturbances, unmodeled dynamics, and parametric uncertainties. Robust and adaptive control strategies are required to achieve high performance in these dynamic environments. In this paper, we propose a novel adaptive model predictive controller that combines model predictive control (MPC) with an underlying $\mathcal{L}_1$ adaptive controller to improve trajectory tracking of a system subject to unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller forces the system to behave in a predefined way, as specified by a reference model. A higher-level model predictive controller then uses this reference model to calculate the optimal reference input based on a cost function, while taking into account input and state constraints. We focus on the experimental validation of the proposed approach and demonstrate its effectiveness in experiments on a quadrotor. We show that the proposed approach has a lower trajectory tracking error compared to non-predictive, adaptive approaches and a predictive, non-adaptive approach, even when external wind disturbances are applied.

Via

Access Paper or Ask Questions