Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kush Desai

Prosody as a Teaching Signal for Agent Learning: Exploratory Studies and Algorithmic Implications

Oct 31, 2024

Matilda Knierim, Sahil Jain, Murat Han Aydoğan, Kenneth Mitra, Kush Desai, Akanksha Saran, Kim Baraka

Figure 1 for Prosody as a Teaching Signal for Agent Learning: Exploratory Studies and Algorithmic Implications

Figure 2 for Prosody as a Teaching Signal for Agent Learning: Exploratory Studies and Algorithmic Implications

Figure 3 for Prosody as a Teaching Signal for Agent Learning: Exploratory Studies and Algorithmic Implications

Figure 4 for Prosody as a Teaching Signal for Agent Learning: Exploratory Studies and Algorithmic Implications

Abstract:Agent learning from human interaction often relies on explicit signals, but implicit social cues, such as prosody in speech, could provide valuable information for more effective learning. This paper advocates for the integration of prosody as a teaching signal to enhance agent learning from human teachers. Through two exploratory studies--one examining voice feedback in an interactive reinforcement learning setup and the other analyzing restricted audio from human demonstrations in three Atari games--we demonstrate that prosody carries significant information about task dynamics. Our findings suggest that prosodic features, when coupled with explicit feedback, can enhance reinforcement learning outcomes. Moreover, we propose guidelines for prosody-sensitive algorithm design and discuss insights into teaching behavior. Our work underscores the potential of leveraging prosody as an implicit signal for more efficient agent learning, thus advancing human-agent interaction paradigms.

* Published at the 26th ACM International Conference on Multimodal Interaction (ICMI) 2024

Via

Access Paper or Ask Questions

Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

Nov 01, 2022

Akanksha Saran, Kush Desai, Mai Lee Chang, Rudolf Lioutikov, Andrea Thomaz, Scott Niekum

Figure 1 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

Figure 2 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

Figure 3 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

Figure 4 for Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

Abstract:Humans use audio signals in the form of spoken language or verbal reactions effectively when teaching new skills or tasks to other humans. While demonstrations allow humans to teach robots in a natural way, learning from trajectories alone does not leverage other available modalities including audio from human teachers. To effectively utilize audio cues accompanying human demonstrations, first it is important to understand what kind of information is present and conveyed by such cues. This work characterizes audio from human teachers demonstrating multi-step manipulation tasks to a situated Sawyer robot using three feature types: (1) duration of speech used, (2) expressiveness in speech or prosody, and (3) semantic content of speech. We analyze these features along four dimensions and find that teachers convey similar semantic concepts via spoken words for different conditions of (1) demonstration types, (2) audio usage instructions, (3) subtasks, and (4) errors during demonstrations. However, differentiating properties of speech in terms of duration and expressiveness are present along the four dimensions, highlighting that human audio carries rich information, potentially beneficial for technological advancement of robot learning from demonstration methods.

* IROS 2022

Via

Access Paper or Ask Questions