Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dimuthu Lakmal

Semi-Supervised Instance Population of an Ontology using Word Vector Embeddings

Sep 09, 2017

Vindula Jayawardana, Dimuthu Lakmal, Nisansa de Silva, Amal Shehan Perera, Keet Sugathadasa, Buddhi Ayesha, Madhavi Perera

Figure 1 for Semi-Supervised Instance Population of an Ontology using Word Vector Embeddings

Figure 2 for Semi-Supervised Instance Population of an Ontology using Word Vector Embeddings

Figure 3 for Semi-Supervised Instance Population of an Ontology using Word Vector Embeddings

Figure 4 for Semi-Supervised Instance Population of an Ontology using Word Vector Embeddings

Abstract:In many modern day systems such as information extraction and knowledge management agents, ontologies play a vital role in maintaining the concept hierarchies of the selected domain. However, ontology population has become a problematic process due to its nature of heavy coupling with manual human intervention. With the use of word embeddings in the field of natural language processing, it became a popular topic due to its ability to cope up with semantic sensitivity. Hence, in this study, we propose a novel way of semi-supervised ontology population through word embeddings as the basis. We built several models including traditional benchmark models and new types of models which are based on word embeddings. Finally, we ensemble them together to come up with a synergistic model with better accuracy. We demonstrate that our ensemble model can outperform the individual models.

Via

Access Paper or Ask Questions

Synergistic Union of Word2Vec and Lexicon for Domain Specific Semantic Similarity

Jun 09, 2017

Keet Sugathadasa, Buddhi Ayesha, Nisansa de Silva, Amal Shehan Perera, Vindula Jayawardana, Dimuthu Lakmal, Madhavi Perera

Figure 1 for Synergistic Union of Word2Vec and Lexicon for Domain Specific Semantic Similarity

Figure 2 for Synergistic Union of Word2Vec and Lexicon for Domain Specific Semantic Similarity

Figure 3 for Synergistic Union of Word2Vec and Lexicon for Domain Specific Semantic Similarity

Figure 4 for Synergistic Union of Word2Vec and Lexicon for Domain Specific Semantic Similarity

Abstract:Semantic similarity measures are an important part in Natural Language Processing tasks. However Semantic similarity measures built for general use do not perform well within specific domains. Therefore in this study we introduce a domain specific semantic similarity measure that was created by the synergistic union of word2vec, a word embedding method that is used for semantic similarity calculation and lexicon based (lexical) semantic similarity methods. We prove that this proposed methodology out performs word embedding methods trained on generic corpus and methods trained on domain specific corpus but do not use lexical semantic similarity methods to augment the results. Further, we prove that text lemmatization can improve the performance of word embedding methods.

* 6 Pages, 3 figures

Via

Access Paper or Ask Questions

Deriving a Representative Vector for Ontology Classes with Instance Word Vector Embeddings

Jun 08, 2017

Vindula Jayawardana, Dimuthu Lakmal, Nisansa de Silva, Amal Shehan Perera, Keet Sugathadasa, Buddhi Ayesha

Figure 1 for Deriving a Representative Vector for Ontology Classes with Instance Word Vector Embeddings

Figure 2 for Deriving a Representative Vector for Ontology Classes with Instance Word Vector Embeddings

Figure 3 for Deriving a Representative Vector for Ontology Classes with Instance Word Vector Embeddings

Figure 4 for Deriving a Representative Vector for Ontology Classes with Instance Word Vector Embeddings

Abstract:Selecting a representative vector for a set of vectors is a very common requirement in many algorithmic tasks. Traditionally, the mean or median vector is selected. Ontology classes are sets of homogeneous instance objects that can be converted to a vector space by word vector embeddings. This study proposes a methodology to derive a representative vector for ontology classes whose instances were converted to the vector space. We start by deriving five candidate vectors which are then used to train a machine learning model that would calculate a representative vector for the class. We show that our methodology out-performs the traditional mean and median vector representations.

Via

Access Paper or Ask Questions