Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Md Iftekhar Tanveer

Ehsan

Predicting TED Talk Ratings from Language and Prosody

May 21, 2019

Md Iftekhar Tanveer, Md Kamrul Hassan, Daniel Gildea, M. Ehsan Hoque

Figure 1 for Predicting TED Talk Ratings from Language and Prosody

Figure 2 for Predicting TED Talk Ratings from Language and Prosody

Figure 3 for Predicting TED Talk Ratings from Language and Prosody

Figure 4 for Predicting TED Talk Ratings from Language and Prosody

Abstract:We use the largest open repository of public speaking---TED Talks---to predict the ratings of the online viewers. Our dataset contains over 2200 TED Talk transcripts (includes over 200 thousand sentences), audio features and the associated meta information including about 5.5 Million ratings from spontaneous visitors of the website. We propose three neural network architectures and compare with statistical machine learning. Our experiments reveal that it is possible to predict all the 14 different ratings with an average AUC of 0.83 using the transcripts and prosody features only. The dataset and the complete source code is available for further analysis.

* arXiv admin note: substantial text overlap with arXiv:1905.08392

Via

Access Paper or Ask Questions

A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks

May 21, 2019

Md Iftekhar Tanveer, Md Kamrul Hasan, Daniel Gildea, M. Ehsan Hoque

Figure 1 for A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks

Figure 2 for A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks

Figure 3 for A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks

Figure 4 for A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks

Abstract:Automated prediction of public speaking performance enables novel systems for tutoring public speaking skills. We use the largest open repository---TED Talks---to predict the ratings provided by the online viewers. The dataset contains over 2200 talk transcripts and the associated meta information including over 5.5 million ratings from spontaneous visitors to the website. We carefully removed the bias present in the dataset (e.g., the speakers' reputations, popularity gained by publicity, etc.) by modeling the data generating process using a causal diagram. We use a word sequence based recurrent architecture and a dependency tree based recursive architecture as the neural networks for predicting the TED talk ratings. Our neural network models can predict the ratings with an average F-score of 0.77 which largely outperforms the competitive baseline method.

Via

Access Paper or Ask Questions

UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

Apr 14, 2019

Md Kamrul Hasan, Wasifur Rahman, Amir Zadeh, Jianyuan Zhong, Md Iftekhar Tanveer, Louis-Philippe Morency, Mohammed, Hoque

Figure 1 for UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

Figure 2 for UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

Figure 3 for UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

Figure 4 for UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

Abstract:Humor is a unique and creative communicative behavior displayed during social interactions. It is produced in a multimodal manner, through the usage of words (text), gestures (vision) and prosodic cues (acoustic). Understanding humor from these three modalities falls within boundaries of multimodal language; a recent research trend in natural language processing that models natural language as it happens in face-to-face communication. Although humor detection is an established research area in NLP, in a multimodal context it is an understudied area. This paper presents a diverse multimodal dataset, called UR-FUNNY, to open the door to understanding multimodal language used in expressing humor. The dataset and accompanying studies, present a framework in multimodal humor detection for the natural language processing community. UR-FUNNY is publicly available for research.

Via

Access Paper or Ask Questions