Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mikael Kågebäck

Disentangled Representations for Manipulation of Sentiment in Text

Dec 22, 2017

Maria Larsson, Amanda Nilsson, Mikael Kågebäck

Figure 1 for Disentangled Representations for Manipulation of Sentiment in Text

Figure 2 for Disentangled Representations for Manipulation of Sentiment in Text

Abstract:The ability to change arbitrary aspects of a text while leaving the core message intact could have a strong impact in fields like marketing and politics by enabling e.g. automatic optimization of message impact and personalized language adapted to the receiver's profile. In this paper we take a first step towards such a system by presenting an algorithm that can manipulate the sentiment of a text while preserving its semantics using disentangled representations. Validation is performed by examining trajectories in embedding space and analyzing transformed sentences for semantic preservation while expression of desired sentiment shift.

Via

Access Paper or Ask Questions

Query-Based Abstractive Summarization Using Neural Networks

Dec 17, 2017

Johan Hasselqvist, Niklas Helmertz, Mikael Kågebäck

Figure 1 for Query-Based Abstractive Summarization Using Neural Networks

Figure 2 for Query-Based Abstractive Summarization Using Neural Networks

Figure 3 for Query-Based Abstractive Summarization Using Neural Networks

Figure 4 for Query-Based Abstractive Summarization Using Neural Networks

Abstract:In this paper, we present a model for generating summaries of text documents with respect to a query. This is known as query-based summarization. We adapt an existing dataset of news article summaries for the task and train a pointer-generator model using this dataset. The generated summaries are evaluated by measuring similarity to reference summaries. Our results show that a neural network summarization model, similar to existing neural network models for abstractive summarization, can be constructed to make use of queries to produce targeted summaries.

Via

Access Paper or Ask Questions

Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

Mar 15, 2017

Emilio Jorge, Mikael Kågebäck, Fredrik D. Johansson, Emil Gustavsson

Figure 1 for Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

Figure 2 for Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

Figure 3 for Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

Figure 4 for Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

Abstract:Acquiring your first language is an incredible feat and not easily duplicated. Learning to communicate using nothing but a few pictureless books, a corpus, would likely be impossible even for humans. Nevertheless, this is the dominating approach in most natural language processing today. As an alternative, we propose the use of situated interactions between agents as a driving force for communication, and the framework of Deep Recurrent Q-Networks for evolving a shared language grounded in the provided environment. We task the agents with interactive image search in the form of the game Guess Who?. The images from the game provide a non trivial environment for the agents to discuss and a natural grounding for the concepts they decide to encode in their communication. Our experiments show that the agents learn not only to encode physical concepts in their words, i.e. grounding, but also that the agents learn to hold a multi-step dialogue remembering the state of the dialogue from step to step.

* Previous version was accepted to Deep Reinforcement Learning Workshop at NIPS 2016

Via

Access Paper or Ask Questions

Word Sense Disambiguation using a Bidirectional LSTM

Nov 18, 2016

Mikael Kågebäck, Hans Salomonsson

Figure 1 for Word Sense Disambiguation using a Bidirectional LSTM

Figure 2 for Word Sense Disambiguation using a Bidirectional LSTM

Abstract:In this paper we present a clean, yet effective, model for word sense disambiguation. Our approach leverage a bidirectional long short-term memory network which is shared between all words. This enables the model to share statistical strength and to scale well with vocabulary size. The model is trained end-to-end, directly from the raw text to sense labels, and makes effective use of word order. We evaluate our approach on two standard datasets, using identical hyperparameter settings, which are in turn tuned on a third set of held out data. We employ no external resources (e.g. knowledge graphs, part-of-speech tagging, etc), language specific features, or hand crafted rules, but still achieve statistically equivalent results to the best state-of-the-art systems, that employ no such limitations.

Via

Access Paper or Ask Questions