Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Prahal Arora

VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding

May 20, 2021

Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer

Figure 1 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding

Figure 2 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding

Figure 3 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding

Figure 4 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding

Abstract:We present a simplified, task-agnostic multi-modal pre-training approach that can accept either video or text input, or both for a variety of end tasks. Existing pre-training are task-specific by adopting either a single cross-modal encoder that requires both modalities, limiting their use for retrieval-style end tasks or more complex multitask learning with two unimodal encoders, limiting early cross-modal fusion. We instead introduce new pretraining masking schemes that better mix across modalities (e.g. by forcing masks for text to predict the closest video embeddings) while also maintaining separability (e.g. unimodal predictions are sometimes required, without using all the input). Experimental results show strong performance across a wider range of tasks than any previous methods, often outperforming task-specific pre-training.

* 9 pages, ACL Findings 2021

Via

Access Paper or Ask Questions

Sarcasm Detection using Hybrid Neural Network

Aug 20, 2019

Rishabh Misra, Prahal Arora

Figure 1 for Sarcasm Detection using Hybrid Neural Network

Figure 2 for Sarcasm Detection using Hybrid Neural Network

Figure 3 for Sarcasm Detection using Hybrid Neural Network

Figure 4 for Sarcasm Detection using Hybrid Neural Network

Abstract:Sarcasm Detection has enjoyed great interest from the research community, however the task of predicting sarcasm in a text remains an elusive problem for machines. Past studies mostly make use of twitter datasets collected using hashtag based supervision but such datasets are noisy in terms of labels and language. To overcome these shortcoming, we introduce a new dataset which contains news headlines from a sarcastic news website and a real news website. Next, we propose a hybrid Neural Network architecture with attention mechanism which provides insights about what actually makes sentences sarcastic. Through experiments, we show that the proposed model improves upon the baseline by ~ 5% in terms of classification accuracy.

Via

Access Paper or Ask Questions