Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joshua Chin

ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Dec 12, 2016

Robert Speer, Joshua Chin, Catherine Havasi

Figure 1 for ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Figure 2 for ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Figure 3 for ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Figure 4 for ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Abstract:Machine learning about language can be improved by supplying it with specific knowledge and sources of external information. We present here a new version of the linked open data resource ConceptNet that is particularly well suited to be used with modern NLP techniques such as word embeddings. ConceptNet is a knowledge graph that connects words and phrases of natural language with labeled edges. Its knowledge is collected from many sources that include expert-created resources, crowd-sourcing, and games with a purpose. It is designed to represent the general knowledge involved in understanding language, improving natural language applications by allowing the application to better understand the meanings behind the words people use. When ConceptNet is combined with word embeddings acquired from distributional semantics (such as word2vec), it provides applications with understanding that they would not acquire from distributional semantics alone, nor from narrower resources such as WordNet or DBPedia. We demonstrate this with state-of-the-art results on intrinsic evaluations of word relatedness that translate into improvements on applications of word vectors, including solving SAT-style analogies.

* AAAI 31 (2017) 4444-4451

Via

Access Paper or Ask Questions

An Ensemble Method to Produce High-Quality Word Embeddings

Apr 06, 2016

Robert Speer, Joshua Chin

Figure 1 for An Ensemble Method to Produce High-Quality Word Embeddings

Figure 2 for An Ensemble Method to Produce High-Quality Word Embeddings

Figure 3 for An Ensemble Method to Produce High-Quality Word Embeddings

Figure 4 for An Ensemble Method to Produce High-Quality Word Embeddings

Abstract:A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and word2vec (Mikolov et al., 2013) with structured knowledge from the semantic networks ConceptNet (Speer and Havasi, 2012) and PPDB (Ganitkevitch et al., 2013), merging their information into a common representation with a large, multilingual vocabulary. The embeddings it produces achieve state-of-the-art performance on many word-similarity evaluations. Its score of $\rho = .596$ on an evaluation of rare words (Luong et al., 2013) is 16% higher than the previous best known system.

* 12 pages, 3 figures

Via

Access Paper or Ask Questions