Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kamen Brestnichki

Multilingual Factor Analysis

May 14, 2019

Francisco Vargas, Kamen Brestnichki, Alex Papadopoulos-Korfiatis, Nils Hammerla

Figure 1 for Multilingual Factor Analysis

Figure 2 for Multilingual Factor Analysis

Figure 3 for Multilingual Factor Analysis

Figure 4 for Multilingual Factor Analysis

Abstract:In this work we approach the task of learning multilingual word representations in an offline manner by fitting a generative latent variable model to a multilingual dictionary. We model equivalent words in different languages as different views of the same word generated by a common latent variable representing their latent lexical meaning. We explore the task of alignment by querying the fitted model for multilingual embeddings achieving competitive results across a variety of tasks. The proposed model is robust to noise in the embedding space making it a suitable method for distributed representations learned from noisy corpora.

* Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Via

Access Paper or Ask Questions

Model Comparison for Semantic Grouping

May 01, 2019

Francisco Vargas, Kamen Brestnichki, Nils Hammerla

Figure 1 for Model Comparison for Semantic Grouping

Figure 2 for Model Comparison for Semantic Grouping

Figure 3 for Model Comparison for Semantic Grouping

Figure 4 for Model Comparison for Semantic Grouping

Abstract:We introduce a probabilistic framework for quantifying the semantic similarity between two groups of embeddings. We formulate the task of semantic similarity as a model comparison task in which we contrast a generative model which jointly models two sentences versus one that does not. We illustrate how this framework can be used for the Semantic Textual Similarity tasks using clear assumptions about how the embeddings of words are generated. We apply model comparison that utilises information criteria to address some of the shortcomings of Bayesian model comparison, whilst still penalising model complexity. We achieve competitive results by applying the proposed framework with an appropriate choice of likelihood on the STS datasets.

* Proceedings of the 36th International Conference on Machine Learning

Via

Access Paper or Ask Questions