Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Artem Popov

Interpretable probabilistic embeddings: bridging the gap between topic models and neural networks

Nov 11, 2017

Anna Potapenko, Artem Popov, Konstantin Vorontsov

Figure 1 for Interpretable probabilistic embeddings: bridging the gap between topic models and neural networks

Figure 2 for Interpretable probabilistic embeddings: bridging the gap between topic models and neural networks

Figure 3 for Interpretable probabilistic embeddings: bridging the gap between topic models and neural networks

Figure 4 for Interpretable probabilistic embeddings: bridging the gap between topic models and neural networks

Abstract:We consider probabilistic topic models and more recent word embedding techniques from a perspective of learning hidden semantic representations. Inspired by a striking similarity of the two approaches, we merge them and learn probabilistic embeddings with online EM-algorithm on word co-occurrence data. The resulting embeddings perform on par with Skip-Gram Negative Sampling (SGNS) on word similarity tasks and benefit in the interpretability of the components. Next, we learn probabilistic document embeddings that outperform paragraph2vec on a document similarity task and require less memory and time for training. Finally, we employ multimodal Additive Regularization of Topic Models (ARTM) to obtain a high sparsity and learn embeddings for other modalities, such as timestamps and categories. We observe further improvement of word similarity performance and meaningful inter-modality similarities.

* Appeared in AINL-2017

Via

Access Paper or Ask Questions