Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vineet John

Generating lyrics with variational autoencoder and multi-modal artist embeddings

Dec 20, 2018

Olga Vechtomova, Hareesh Bahuleyan, Amirpasha Ghabussi, Vineet John

Figure 1 for Generating lyrics with variational autoencoder and multi-modal artist embeddings

Figure 2 for Generating lyrics with variational autoencoder and multi-modal artist embeddings

Figure 3 for Generating lyrics with variational autoencoder and multi-modal artist embeddings

Figure 4 for Generating lyrics with variational autoencoder and multi-modal artist embeddings

Abstract:We present a system for generating song lyrics lines conditioned on the style of a specified artist. The system uses a variational autoencoder with artist embeddings. We propose the pre-training of artist embeddings with the representations learned by a CNN classifier, which is trained to predict artists based on MEL spectrograms of their song clips. This work is the first step towards combining audio and text modalities of songs for generating lyrics conditioned on the artist's style. Our preliminary results suggest that there is a benefit in initializing artists' embeddings with the representations learned by a spectrogram classifier.

* 5 pages, 5 tables, 1 figure

Via

Access Paper or Ask Questions

Disentangled Representation Learning for Non-Parallel Text Style Transfer

Sep 11, 2018

Vineet John, Lili Mou, Hareesh Bahuleyan, Olga Vechtomova

Figure 1 for Disentangled Representation Learning for Non-Parallel Text Style Transfer

Figure 2 for Disentangled Representation Learning for Non-Parallel Text Style Transfer

Figure 3 for Disentangled Representation Learning for Non-Parallel Text Style Transfer

Figure 4 for Disentangled Representation Learning for Non-Parallel Text Style Transfer

Abstract:This paper tackles the problem of disentangling the latent variables of style and content in language models. We propose a simple yet effective approach, which incorporates auxiliary multi-task and adversarial objectives, for label prediction and bag-of-words prediction, respectively. We show, both qualitatively and quantitatively, that the style and content are indeed disentangled in the latent space. This disentangled latent representation learning method is applied to style transfer on non-parallel corpora. We achieve substantially better results in terms of transfer accuracy, content preservation and language fluency, in comparison to previous state-of-the-art approaches.

* 11 pages, 7 figures, 6 tables; Preliminary work rejected by EMNLP-18

Via

Access Paper or Ask Questions

Sentiment Analysis on Financial News Headlines using Training Dataset Augmentation

Jul 29, 2017

Vineet John, Olga Vechtomova

Figure 1 for Sentiment Analysis on Financial News Headlines using Training Dataset Augmentation

Figure 2 for Sentiment Analysis on Financial News Headlines using Training Dataset Augmentation

Abstract:This paper discusses the approach taken by the UWaterloo team to arrive at a solution for the Fine-Grained Sentiment Analysis problem posed by Task 5 of SemEval 2017. The paper describes the document vectorization and sentiment score prediction techniques used, as well as the design and implementation decisions taken while building the system for this task. The system uses text vectorization models, such as N-gram, TF-IDF and paragraph embeddings, coupled with regression model variants to predict the sentiment scores. Amongst the methods examined, unigrams and bigrams coupled with simple linear regression obtained the best baseline accuracy. The paper also explores data augmentation methods to supplement the training dataset. This system was designed for Subtask 2 (News Statements and Headlines).

* Association for Computational Linguistics, Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), 869-873
* 5 pages

Via

Access Paper or Ask Questions

A Survey of Neural Network Techniques for Feature Extraction from Text

Apr 27, 2017

Vineet John

Figure 1 for A Survey of Neural Network Techniques for Feature Extraction from Text

Figure 2 for A Survey of Neural Network Techniques for Feature Extraction from Text

Figure 3 for A Survey of Neural Network Techniques for Feature Extraction from Text

Abstract:This paper aims to catalyze the discussions about text feature extraction techniques using neural network architectures. The research questions discussed in the paper focus on the state-of-the-art neural network techniques that have proven to be useful tools for language processing, language generation, text classification and other computational linguistics tasks.

* 9 pages, 2 figures

Via

Access Paper or Ask Questions

Rapid-Rate: A Framework for Semi-supervised Real-time Sentiment Trend Detection in Unstructured Big Data

Mar 25, 2017

Vineet John

Figure 1 for Rapid-Rate: A Framework for Semi-supervised Real-time Sentiment Trend Detection in Unstructured Big Data

Figure 2 for Rapid-Rate: A Framework for Semi-supervised Real-time Sentiment Trend Detection in Unstructured Big Data

Figure 3 for Rapid-Rate: A Framework for Semi-supervised Real-time Sentiment Trend Detection in Unstructured Big Data

Figure 4 for Rapid-Rate: A Framework for Semi-supervised Real-time Sentiment Trend Detection in Unstructured Big Data

Abstract:Commercial establishments like restaurants, service centres and retailers have several sources of customer feedback about products and services, most of which need not be as structured as rated reviews provided by services like Yelp, or Amazon, in terms of sentiment conveyed. For instance, Amazon provides a fine-grained score on a numeric scale for product reviews. Some sources, however, like social media (Twitter, Facebook), mailing lists (Google Groups) and forums (Quora) contain text data that is much more voluminous, but unstructured and unlabelled. It might be in the best interests of a business establishment to assess the general sentiment towards their brand on these platforms as well. This text could be pipelined into a system with a built-in prediction model, with the objective of generating real-time graphs on opinion and sentiment trends. Although such tasks like the one described about have been explored with respect to document classification problems in the past, the implementation described in this paper, by virtue of learning a continuous function rather than a discrete one, offers a lot more depth of insight as compared to document classification approaches. This study aims to explore the validity of such a continuous function predicting model to quantify sentiment about an entity, without the additional overhead of manual labelling, and computational preprocessing & feature extraction. This research project also aims to design and implement a re-usable document regression pipeline as a framework, Rapid-Rate, that can be used to predict document scores in real-time.

* 11 pages, 6 figures

Via

Access Paper or Ask Questions