Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ian Steenstra

Scaffolding Empathy: Training Counselors with Simulated Patients and Utterance-level Performance Visualizations

Feb 25, 2025

Ian Steenstra, Farnaz Nouraei, Timothy W. Bickmore

Abstract:Learning therapeutic counseling involves significant role-play experience with mock patients, with current manual training methods providing only intermittent granular feedback. We seek to accelerate and optimize counselor training by providing frequent, detailed feedback to trainees as they interact with a simulated patient. Our first application domain involves training motivational interviewing skills for counselors. Motivational interviewing is a collaborative counseling style in which patients are guided to talk about changing their behavior, with empathetic counseling an essential ingredient. We developed and evaluated an LLM-powered training system that features a simulated patient and visualizations of turn-by-turn performance feedback tailored to the needs of counselors learning motivational interviewing. We conducted an evaluation study with professional and student counselors, demonstrating high usability and satisfaction with the system. We present design implications for the development of automated systems that train users in counseling skills and their generalizability to other types of social skills training.

* This is a preprint version of the paper conditionally accepted to CHI'25

Via

Access Paper or Ask Questions

Virtual Agents for Alcohol Use Counseling: Exploring LLM-Powered Motivational Interviewing

Jul 10, 2024

Ian Steenstra, Farnaz Nouraei, Mehdi Arjmand, Timothy W. Bickmore

Figure 1 for Virtual Agents for Alcohol Use Counseling: Exploring LLM-Powered Motivational Interviewing

Figure 2 for Virtual Agents for Alcohol Use Counseling: Exploring LLM-Powered Motivational Interviewing

Figure 3 for Virtual Agents for Alcohol Use Counseling: Exploring LLM-Powered Motivational Interviewing

Figure 4 for Virtual Agents for Alcohol Use Counseling: Exploring LLM-Powered Motivational Interviewing

Abstract:We introduce a novel application of large language models (LLMs) in developing a virtual counselor capable of conducting motivational interviewing (MI) for alcohol use counseling. Access to effective counseling remains limited, particularly for substance abuse, and virtual agents offer a promising solution by leveraging LLM capabilities to simulate nuanced communication techniques inherent in MI. Our approach combines prompt engineering and integration into a user-friendly virtual platform to facilitate realistic, empathetic interactions. We evaluate the effectiveness of our virtual agent through a series of studies focusing on replicating MI techniques and human counselor dialog. Initial findings suggest that our LLM-powered virtual agent matches human counselors' empathetic and adaptive conversational skills, presenting a significant step forward in virtual health counseling and providing insights into the design and implementation of LLM-based therapeutic interactions.

Via

Access Paper or Ask Questions

Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational Agents

Jul 01, 2024

Mehdi Arjmand, Farnaz Nouraei, Ian Steenstra, Timothy Bickmore

Abstract:We introduce the concept of "empathic grounding" in conversational agents as an extension of Clark's conceptualization of grounding in conversation in which the grounding criterion includes listener empathy for the speaker's affective state. Empathic grounding is generally required whenever the speaker's emotions are foregrounded and can make the grounding process more efficient and reliable by communicating both propositional and affective understanding. Both speaker expressions of affect and listener empathic grounding can be multimodal, including facial expressions and other nonverbal displays. Thus, models of empathic grounding for embodied agents should be multimodal to facilitate natural and efficient communication. We describe a multimodal model that takes as input user speech and facial expression to generate multimodal grounding moves for a listening agent using a large language model. We also describe a testbed to evaluate approaches to empathic grounding, in which a humanoid robot interviews a user about a past episode of pain and then has the user rate their perception of the robot's empathy. We compare our proposed model to one that only generates non-affective grounding cues in a between-subjects experiment. Findings demonstrate that empathic grounding increases user perceptions of empathy, understanding, emotional intelligence, and trust. Our work highlights the role of emotion awareness and multimodality in generating appropriate grounding moves for conversational agents.

Via

Access Paper or Ask Questions

Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Jul 20, 2020

Xiangyang Mou, Brandyn Sigouin, Ian Steenstra, Hui Su

Figure 1 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Figure 2 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Figure 3 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Figure 4 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Abstract:Recently, a more challenging state tracking task, Audio-Video Scene-Aware Dialogue (AVSD), is catching an increasing amount of attention among researchers. Different from purely text-based dialogue state tracking, the dialogue in AVSD contains a sequence of question-answer pairs about a video and the final answer to the given question requires additional understanding of the video. This paper interprets the AVSD task from an open-domain Question Answering (QA) point of view and proposes a multimodal open-domain QA system to deal with the problem. The proposed QA system uses common encoder-decoder framework with multimodal fusion and attention. Teacher forcing is applied to train a natural language generator. We also propose a new data augmentation approach specifically under QA assumption. Our experiments show that our model and techniques bring significant improvements over the baseline model on the DSTC7-AVSD dataset and demonstrate the potentials of our data augmentation techniques.

* AAAI DSTC8 Workshop

Via

Access Paper or Ask Questions