Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maja Mataric

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Feb 18, 2024

Jacky Liang, Fei Xia, Wenhao Yu, Andy Zeng, Montserrat Gonzalez Arenas, Maria Attarian, Maria Bauza, Matthew Bennice, Alex Bewley, Adil Dostmohamed(+40 more)

Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Abstract:Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to perform new tasks. However, these capabilities (driven by in-context learning) are limited to short-term interactions, where users' feedback remains relevant for only as long as it fits within the context size of the LLM, and can be forgotten over longer interactions. In this work, we investigate fine-tuning the robot code-writing LLMs, to remember their in-context interactions and improve their teachability i.e., how efficiently they adapt to human inputs (measured by average number of corrections before the user considers the task successful). Our key observation is that when human-robot interactions are formulated as a partially observable Markov decision process (in which human language inputs are observations, and robot code outputs are actions), then training an LLM to complete previous interactions can be viewed as training a transition dynamics model -- that can be combined with classic robotics techniques such as model predictive control (MPC) to discover shorter paths to success. This gives rise to Language Model Predictive Control (LMPC), a framework that fine-tunes PaLM 2 to improve its teachability on 78 tasks across 5 robot embodiments -- improving non-expert teaching success rates of unseen tasks by 26.9% while reducing the average number of human corrections from 2.4 to 1.9. Experiments show that LMPC also produces strong meta-learners, improving the success rate of in-context learning new tasks on unseen robot embodiments and APIs by 31.5%. See videos, code, and demos at: https://robot-teaching.github.io/.

Via

Access Paper or Ask Questions

Using Design Metaphors to Understand User Expectations of Socially Interactive Robot Embodiments

Jan 25, 2022

Nathaniel Dennler, Changxiao Ruan, Jessica Hadiwijoyo, Brenna Chen, Stefanos Nikolaidis, Maja Mataric

Abstract:The physical design of a robot suggests expectations of that robot's functionality for human users and collaborators. When those expectations align with the true capabilities of the robot, interaction with the robot is enhanced. However, misalignment of those expectations can result in an unsatisfying interaction. This paper uses Mechanical Turk to evaluate user expectation through the use of design metaphors as applied to a wide range of robot embodiments. The first study (N=382) associates crowd-sourced design metaphors to different robot embodiments. The second study (N=803) assesses initial social expectations of robot embodiments. The final study (N=805) addresses the degree of abstraction of the design metaphors and the functional expectations projected on robot embodiments. Together, these results can guide robot designers toward aligning user expectations with true robot capabilities, facilitating positive human-robot interaction.

* 33 pages, 16 figures, 6 tables

Via

Access Paper or Ask Questions

Embodiment in Socially Interactive Robots

Dec 01, 2019

Eric Deng, Bilge Mutlu, Maja Mataric

Figure 1 for Embodiment in Socially Interactive Robots

Figure 2 for Embodiment in Socially Interactive Robots

Figure 3 for Embodiment in Socially Interactive Robots

Figure 4 for Embodiment in Socially Interactive Robots

Abstract:Physical embodiment is a required component for robots that are structurally coupled with their real-world environments. However, most socially interactive robots do not need to physically interact with their environments in order to perform their tasks. When and why should embodied robots be used instead of simpler and cheaper virtual agents? This paper reviews the existing work that explores the role of physical embodiment in socially interactive robots. This class consists of robots that are not only capable of engaging in social interaction with humans, but are using primarily their social capabilities to perform their desired functions. Socially interactive robots provide entertainment, information, and/or assistance; this last category is typically encompassed by socially assistive robotics. In all cases, such robots can achieve their primary functions without performing functional physical work. To comprehensively evaluate the existing body of work on embodiment, we first review work from established related fields including psychology, philosophy, and sociology. We then systematically review 65 studies evaluating aspects of embodiment published from 2003 to 2017 in major peer-reviewed robotics publication venues. We examine relevant aspects of the selected studies, focusing on the embodiments compared, tasks evaluated, social roles of robots, and measurements. We introduce three taxonomies for the types of robot embodiment, robot social roles, and human-robot tasks. These taxonomies are used to deconstruct the design and interaction spaces of socially interactive robots and facilitate analysis and discussion of the reviewed studies. We use this newly-defined methodology to critically discuss existing works, revealing topics within embodiment research for social interaction, assistive robotics, and service robotics.

* Foundations and Trends in Robotics: Vol. 7: No. 4, pp 251-356 (2019)
* The official publication is available from now publishers via https://www.nowpublishers.com/article/Details/ROB-056

Via

Access Paper or Ask Questions

Predicting Infant Motor Development Status using Day Long Movement Data from Wearable Sensors

Oct 14, 2018

David Goodfellow, Ruoyu Zhi, Rebecca Funke, Jose Carlos Pulido, Maja Mataric, Beth A. Smith

Figure 1 for Predicting Infant Motor Development Status using Day Long Movement Data from Wearable Sensors

Figure 2 for Predicting Infant Motor Development Status using Day Long Movement Data from Wearable Sensors

Figure 3 for Predicting Infant Motor Development Status using Day Long Movement Data from Wearable Sensors

Figure 4 for Predicting Infant Motor Development Status using Day Long Movement Data from Wearable Sensors

Abstract:Infants with a variety of complications at or before birth are classified as being at risk for developmental delays (AR). As they grow older, they are followed by healthcare providers in an effort to discern whether they are on a typical or impaired developmental trajectory. Often, it is difficult to make an accurate determination early in infancy as infants with typical development (TD) display high variability in their developmental trajectories both in content and timing. Studies have shown that spontaneous movements have the potential to differentiate typical and atypical trajectories early in life using sensors and kinematic analysis systems. In this study, machine learning classification algorithms are used to take inertial movement from wearable sensors placed on an infant for a day and predict if the infant is AR or TD, thus further establishing the connection between early spontaneous movement and developmental trajectory.

* 4 pages, KDD Machine Learning and Healthcare Workshop August 2018. This work was funded in part by the American Physical Therapy Association Academy of Pediatric Physical Therapy Research Grant 1 and 2 Awards (PI: Smith) and in part by NSF award 1706964 (PI: Smith, Co-PI: Matari\'c)

Via

Access Paper or Ask Questions

Advances in Artificial Intelligence Require Progress Across all of Computer Science

Jul 13, 2017

Gregory D. Hager, Randal Bryant, Eric Horvitz, Maja Mataric, Vasant Honavar

Abstract:Advances in Artificial Intelligence require progress across all of computer science.

* 7 pages, Computing Community Consortium White Paper

Via

Access Paper or Ask Questions

Next Generation Robotics

Jun 29, 2016

Henrik I Christensen, Allison Okamura, Maja Mataric, Vijay Kumar, Greg Hager, Howie Choset

Abstract:The National Robotics Initiative (NRI) was launched 2011 and is about to celebrate its 5 year anniversary. In parallel with the NRI, the robotics community, with support from the Computing Community Consortium, engaged in a series of road mapping exercises. The first version of the roadmap appeared in September 2009; a second updated version appeared in 2013. While not directly aligned with the NRI, these road-mapping documents have provided both a useful charting of the robotics research space, as well as a metric by which to measure progress. This report sets forth a perspective of progress in robotics over the past five years, and provides a set of recommendations for the future. The NRI has in its formulation a strong emphasis on co-robot, i.e., robots that work directly with people. An obvious question is if this should continue to be the focus going forward? To try to assess what are the main trends, what has happened the last 5 years and what may be promising directions for the future a small CCC sponsored study was launched to have two workshops, one in Washington DC (March 5th, 2016) and another in San Francisco, CA (March 11th, 2016). In this report we brief summarize some of the main discussions and observations from those workshops. We will present a variety of background information in Section 2, and outline various issues related to progress over the last 5 years in Section 3. In Section 4 we will outline a number of opportunities for moving forward. Finally, we will summarize the main points in Section 5.

* A Computing Community Consortium (CCC) white paper, 22 pages

Via

Access Paper or Ask Questions