Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alex LaGrassa

Task-Oriented Active Learning of Model Preconditions for Inaccurate Dynamics Models

Jan 08, 2024

Alex LaGrassa, Moonyoung Lee, Oliver Kroemer

Abstract:When planning with an inaccurate dynamics model, a practical strategy is to restrict planning to regions of state-action space where the model is accurate: also known as a model precondition. Empirical real-world trajectory data is valuable for defining data-driven model preconditions regardless of the model form (analytical, simulator, learned, etc...). However, real-world data is often expensive and dangerous to collect. In order to achieve data efficiency, this paper presents an algorithm for actively selecting trajectories to learn a model precondition for an inaccurate pre-specified dynamics model. Our proposed techniques address challenges arising from the sequential nature of trajectories, and potential benefit of prioritizing task-relevant data. The experimental analysis shows how algorithmic properties affect performance in three planning scenarios: icy gridworld, simulated plant watering, and real-world plant watering. Results demonstrate an improvement of approximately 80% after only four real-world trajectories when using our proposed techniques.

* Submitted to International Conference on Robotics and Automation

Via

Access Paper or Ask Questions

Focused Adaptation of Dynamics Models for Deformable Object Manipulation

Sep 28, 2022

Peter Mitrano, Alex LaGrassa, Oliver Kroemer, Dmitry Berenson

Figure 1 for Focused Adaptation of Dynamics Models for Deformable Object Manipulation

Figure 2 for Focused Adaptation of Dynamics Models for Deformable Object Manipulation

Figure 3 for Focused Adaptation of Dynamics Models for Deformable Object Manipulation

Figure 4 for Focused Adaptation of Dynamics Models for Deformable Object Manipulation

Abstract:In order to efficiently learn a dynamics model for a task in a new environment, one can adapt a model learned in a similar source environment. However, existing adaptation methods can fail when the target dataset contains transitions where the dynamics are very different from the source environment. For example, the source environment dynamics could be of a rope manipulated in free-space, whereas the target dynamics could involve collisions and deformation on obstacles. Our key insight is to improve data efficiency by focusing model adaptation on only the regions where the source and target dynamics are similar. In the rope example, adapting the free-space dynamics requires significantly fewer data than adapting the free-space dynamics while also learning collision dynamics. We propose a new method for adaptation that is effective in adapting to regions of similar dynamics. Additionally, we combine this adaptation method with prior work on planning with unreliable dynamics to make a method for data-efficient online adaptation, called FOCUS. We first demonstrate that the proposed adaptation method achieves statistically significantly lower prediction error in regions of similar dynamics on simulated rope manipulation and plant watering tasks. We then show on a bimanual rope manipulation task that FOCUS achieves data-efficient online learning, in simulation and in the real world.

* Project Website: https://sites.google.com/view/focused-adaptation-dynamics/home

Via

Access Paper or Ask Questions

Learning Model Preconditions for Planning with Multiple Models

Jun 11, 2022

Alex LaGrassa, Oliver Kroemer

Figure 1 for Learning Model Preconditions for Planning with Multiple Models

Figure 2 for Learning Model Preconditions for Planning with Multiple Models

Figure 3 for Learning Model Preconditions for Planning with Multiple Models

Figure 4 for Learning Model Preconditions for Planning with Multiple Models

Abstract:Different models can provide differing levels of fidelity when a robot is planning. Analytical models are often fast to evaluate but only work in limited ranges of conditions. Meanwhile, physics simulators are effective at modeling complex interactions between objects but are typically more computationally expensive. Learning when to switch between the various models can greatly improve the speed of planning and task success reliability. In this work, we learn model deviation estimators (MDEs) to predict the error between real-world states and the states outputted by transition models. MDEs can be used to define a model precondition that describes which transitions are accurately modeled. We then propose a planner that uses the learned model preconditions to switch between various models in order to use models in conditions where they are accurate, prioritizing faster models when possible. We evaluate our method on two real-world tasks: placing a rod into a box and placing a rod into a closed drawer.

* Proceedings of the 5th Conference on Robot Learning, PMLR 164 (2022) 491-500
* Presented at Conference on Robot Learning (CoRL 2021). Revised for clarity

Via

Access Paper or Ask Questions

Specifying and achieving goals in open uncertain robot-manipulation domains

Dec 21, 2021

Leslie Pack Kaelbling, Alex LaGrassa, Tomás Lozano-Pérez

Figure 1 for Specifying and achieving goals in open uncertain robot-manipulation domains

Figure 2 for Specifying and achieving goals in open uncertain robot-manipulation domains

Figure 3 for Specifying and achieving goals in open uncertain robot-manipulation domains

Figure 4 for Specifying and achieving goals in open uncertain robot-manipulation domains

Abstract:This paper describes an integrated solution to the problem of describing and interpreting goals for robots in open uncertain domains. Given a formal specification of a desired situation, in which objects are described only by their properties, general-purpose planning and reasoning tools are used to derive appropriate actions for a robot. These goals are carried out through an online combination of hierarchical planning, state-estimation, and execution that operates robustly in real robot domains with substantial occlusion and sensing error.

* Paper completed in 2019

Via

Access Paper or Ask Questions

Search-Based Task Planning with Learned Skill Effect Models for Lifelong Robotic Manipulation

Sep 17, 2021

Jacky Liang, Mohit Sharma, Alex LaGrassa, Shivam Vats, Saumya Saxena, Oliver Kroemer

Figure 1 for Search-Based Task Planning with Learned Skill Effect Models for Lifelong Robotic Manipulation

Figure 2 for Search-Based Task Planning with Learned Skill Effect Models for Lifelong Robotic Manipulation

Figure 3 for Search-Based Task Planning with Learned Skill Effect Models for Lifelong Robotic Manipulation

Figure 4 for Search-Based Task Planning with Learned Skill Effect Models for Lifelong Robotic Manipulation

Abstract:Lifelong-learning robots need to be able to acquire new skills and plan for new tasks over time. Prior works on planning with skills often make assumptions on the structure of skills and tasks, like subgoal skills, shared skill implementations, or learning task-specific plan skeletons, that limit their application to new and different skills and tasks. By contrast, we propose doing task planning by jointly searching in the space of skills and their parameters with skill effect models learned in simulation. Our approach is flexible about skill parameterizations and task specifications, and we use an iterative training procedure to efficiently generate relevant data to train such models. Experiments demonstrate the ability of our planner to integrate new skills in a lifelong manner, finding new task strategies with lower costs in both train and test tasks. We additionally show that our method can transfer to the real world without further fine-tuning.

Via

Access Paper or Ask Questions

Learning Reactive and Predictive Differentiable Controllers for Switching Linear Dynamical Models

Mar 26, 2021

Saumya Saxena, Alex LaGrassa, Oliver Kroemer

Figure 1 for Learning Reactive and Predictive Differentiable Controllers for Switching Linear Dynamical Models

Figure 2 for Learning Reactive and Predictive Differentiable Controllers for Switching Linear Dynamical Models

Figure 3 for Learning Reactive and Predictive Differentiable Controllers for Switching Linear Dynamical Models

Figure 4 for Learning Reactive and Predictive Differentiable Controllers for Switching Linear Dynamical Models

Abstract:Humans leverage the dynamics of the environment and their own bodies to accomplish challenging tasks such as grasping an object while walking past it or pushing off a wall to turn a corner. Such tasks often involve switching dynamics as the robot makes and breaks contact. Learning these dynamics is a challenging problem and prone to model inaccuracies, especially near contact regions. In this work, we present a framework for learning composite dynamical behaviors from expert demonstrations. We learn a switching linear dynamical model with contacts encoded in switching conditions as a close approximation of our system dynamics. We then use discrete-time LQR as the differentiable policy class for data-efficient learning of control to develop a control strategy that operates over multiple dynamical modes and takes into account discontinuities due to contact. In addition to predicting interactions with the environment, our policy effectively reacts to inaccurate predictions such as unanticipated contacts. Through simulation and real world experiments, we demonstrate generalization of learned behaviors to different scenarios and robustness to model inaccuracies during execution.

Via

Access Paper or Ask Questions

Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation

Nov 13, 2020

Mohit Sharma, Jacky Liang, Jialiang Zhao, Alex LaGrassa, Oliver Kroemer

Figure 1 for Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation

Figure 2 for Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation

Figure 3 for Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation

Figure 4 for Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation

Abstract:Manipulation tasks can often be decomposed into multiple subtasks performed in parallel, e.g., sliding an object to a goal pose while maintaining contact with a table. Individual subtasks can be achieved by task-axis controllers defined relative to the objects being manipulated, and a set of object-centric controllers can be combined in an hierarchy. In prior works, such combinations are defined manually or learned from demonstrations. By contrast, we propose using reinforcement learning to dynamically compose hierarchical object-centric controllers for manipulation tasks. Experiments in both simulation and real world show how the proposed approach leads to improved sample efficiency, zero-shot generalization to novel test environments, and simulation-to-reality transfer without fine-tuning.

* Accepted as Plenary Talk at CoRL'20. First two authors contributed equally. For results see https://sites.google.com/view/compositional-object-control/

Via

Access Paper or Ask Questions

Learning Skills to Patch Plans Based on Inaccurate Models

Sep 29, 2020

Alex LaGrassa, Steven Lee, Oliver Kroemer

Figure 1 for Learning Skills to Patch Plans Based on Inaccurate Models

Figure 2 for Learning Skills to Patch Plans Based on Inaccurate Models

Figure 3 for Learning Skills to Patch Plans Based on Inaccurate Models

Figure 4 for Learning Skills to Patch Plans Based on Inaccurate Models

Abstract:Planners using accurate models can be effective for accomplishing manipulation tasks in the real world, but are typically highly specialized and require significant fine-tuning to be reliable. Meanwhile, learning is useful for adaptation, but can require a substantial amount of data collection. In this paper, we propose a method that improves the efficiency of sub-optimal planners with approximate but simple and fast models by switching to a model-free policy when unexpected transitions are observed. Unlike previous work, our method specifically addresses when the planner fails due to transition model error by patching with a local policy only where needed. First, we use a sub-optimal model-based planner to perform a task until model failure is detected. Next, we learn a local model-free policy from expert demonstrations to complete the task in regions where the model failed. To show the efficacy of our method, we perform experiments with a shape insertion puzzle and compare our results to both pure planning and imitation learning approaches. We then apply our method to a door opening task. Our experiments demonstrate that our patch-enhanced planner performs more reliably than pure planning and with lower overall sample complexity than pure imitation learning.

* 8 pages, 10 figures, accepted to Intelligent Robots and Systems (IROS) 2020

Via

Access Paper or Ask Questions