Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tim Van-De-Cruys

Composition of Sentence Embeddings:Lessons from Statistical Relational Learning

Apr 04, 2019

Damien Sileo, Tim Van-De-Cruys, Camille Pradel, Philippe Muller

Figure 1 for Composition of Sentence Embeddings:Lessons from Statistical Relational Learning

Figure 2 for Composition of Sentence Embeddings:Lessons from Statistical Relational Learning

Figure 3 for Composition of Sentence Embeddings:Lessons from Statistical Relational Learning

Figure 4 for Composition of Sentence Embeddings:Lessons from Statistical Relational Learning

Abstract:Various NLP problems -- such as the prediction of sentence similarity, entailment, and discourse relations -- are all instances of the same general task: the modeling of semantic relations between a pair of textual elements. A popular model for such problems is to embed sentences into fixed size vectors, and use composition functions (e.g. concatenation or sum) of those vectors as features for the prediction. At the same time, composition of embeddings has been a main focus within the field of Statistical Relational Learning (SRL) whose goal is to predict relations between entities (typically from knowledge base triples). In this article, we show that previous work on relation prediction between texts implicitly uses compositions from baseline SRL models. We show that such compositions are not expressive enough for several tasks (e.g. natural language inference). We build on recent SRL models to address textual relational problems, showing that they are more expressive, and can alleviate issues from simpler compositions. The resulting models significantly improve the state of the art in both transferable sentence representation learning and relation prediction.

* Camera-ready for *SEM 2019

Via

Access Paper or Ask Questions

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Mar 28, 2019

Damien Sileo, Tim Van-De-Cruys, Camille Pradel, Philippe Muller

Figure 1 for Mining Discourse Markers for Unsupervised Sentence Representation Learning

Figure 2 for Mining Discourse Markers for Unsupervised Sentence Representation Learning

Figure 3 for Mining Discourse Markers for Unsupervised Sentence Representation Learning

Figure 4 for Mining Discourse Markers for Unsupervised Sentence Representation Learning

Abstract:Current state of the art systems in NLP heavily rely on manually annotated datasets, which are expensive to construct. Very little work adequately exploits unannotated data -- such as discourse markers between sentences -- mainly because of data sparseness and ineffective extraction methods. In the present work, we propose a method to automatically discover sentence pairs with relevant discourse markers, and apply it to massive amounts of data. Our resulting dataset contains 174 discourse markers with at least 10k examples each, even for rare markers such as coincidentally or amazingly We use the resulting data as supervision for learning transferable sentence embeddings. In addition, we show that even though sentence representation learning through prediction of discourse markers yields state of the art results across different transfer tasks, it is not clear that our models made use of the semantic relation between sentences, thus leaving room for further improvements. Our datasets are publicly available (https://github.com/synapse-developpement/Discovery)

* Camera-ready for NAACL HLT 2019

Via

Access Paper or Ask Questions