Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Wikipedia-based Semantic Interpretation for Natural Language Processing

Jan 15, 2014

Evgeniy Gabrilovich, Shaul Markovitch

Figure 1 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Figure 2 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Figure 3 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Figure 4 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Share this with someone who'll enjoy it:

Abstract:Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was based on purely statistical techniques that did not make use of background knowledge, on limited lexicographic knowledge bases such as WordNet, or on huge manual efforts such as the CYC project. Here we propose a novel method, called Explicit Semantic Analysis (ESA), for fine-grained semantic interpretation of unrestricted natural language texts. Our method represents meaning in a high-dimensional space of concepts derived from Wikipedia, the largest encyclopedia in existence. We explicitly represent the meaning of any text in terms of Wikipedia-based concepts. We evaluate the effectiveness of our method on text categorization and on computing the degree of semantic relatedness between fragments of natural language text. Using ESA results in significant improvements over the previous state of the art in both tasks. Importantly, due to the use of natural concepts, the ESA model is easy to explain to human users.

* Journal Of Artificial Intelligence Research, Volume 34, pages 443-498, 2009

View paper on

Share this with someone who'll enjoy it:

Title:Wikipedia-based Semantic Interpretation for Natural Language Processing

Paper and Code