Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonah Lubin

Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction

Jul 15, 2022

Sujeong Kim, Abhinav Garlapati, Jonah Lubin, Amir Tamrakar, Ajay Divakaran

Figure 1 for Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction

Figure 2 for Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction

Figure 3 for Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction

Figure 4 for Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction

Abstract:We present a series of two studies conducted to understand user's affective states during voice-based human-machine interactions. Emphasis is placed on the cases of communication errors or failures. In particular, we are interested in understanding "confusion" in relation with other affective states. The studies consist of two types of tasks: (1) related to communication with a voice-based virtual agent: speaking to the machine and understanding what the machine says, (2) non-communication related, problem-solving tasks where the participants solve puzzles and riddles but are asked to verbally explain the answers to the machine. We collected audio-visual data and self-reports of affective states of the participants. We report results of two studies and analysis of the collected data. The first study was analyzed based on the annotator's observation, and the second study was analyzed based on the self-report.

* 2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)

Via

Access Paper or Ask Questions

Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Apr 19, 2019

Julia Kruk, Jonah Lubin, Karan Sikka, Xiao Lin, Dan Jurafsky, Ajay Divakaran

Figure 1 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Figure 2 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Figure 3 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Figure 4 for Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

Abstract:Computing author intent from multimodal data like Instagram posts requires modeling a complex relationship between text and image. For example a caption might reflect ironically on the image, so neither the caption nor the image is a mere transcript of the other. Instead they combine -- via what has been called meaning multiplication -- to create a new meaning that has a more complex relation to the literal meanings of text and image. Here we introduce a multimodal dataset of 1299 Instagram post labeled for three orthogonal taxonomies: the authorial intent behind the image-caption pair, the contextual relationship between the literal meanings of the image and caption, and the semiotic relationship between the signified meanings of the image and caption. We build a baseline deep multimodal classifier to validate the taxonomy, showing that employing both text and image improves intent detection by 8% compared to using only image modality, demonstrating the commonality of non-intersective meaning multiplication. Our dataset offers an important resource for the study of the rich meanings that results from pairing text and image.

Via

Access Paper or Ask Questions