Picture for Gabriel Skantze

Gabriel Skantze

Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection

Add code
Oct 21, 2024
Figure 1 for Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection
Figure 2 for Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection
Figure 3 for Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection
Figure 4 for Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection
Viaarxiv icon

Perception of Emotions in Human and Robot Faces: Is the Eye Region Enough?

Add code
Oct 18, 2024
Viaarxiv icon

Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

Add code
Sep 09, 2024
Viaarxiv icon

Joint Learning of Context and Feedback Embeddings in Spoken Dialogue

Add code
Jun 11, 2024
Viaarxiv icon

Human-Robot Interaction Conversational User Enjoyment Scale (HRI CUES)

Add code
May 02, 2024
Viaarxiv icon

Multilingual Turn-taking Prediction Using Voice Activity Projection

Add code
Mar 14, 2024
Viaarxiv icon

An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems

Add code
Jan 23, 2024
Viaarxiv icon

Real-time and Continuous Turn-taking Prediction Using Voice Activity Projection

Add code
Jan 10, 2024
Viaarxiv icon

Resolving References in Visually-Grounded Dialogue via Text Generation

Add code
Sep 23, 2023
Viaarxiv icon

Collecting Visually-Grounded Dialogue with A Game Of Sorts

Add code
Sep 10, 2023
Viaarxiv icon