Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Victor Prokhorov

Autoencoding Conditional Neural Processes for Representation Learning

May 29, 2023

Victor Prokhorov, Ivan Titov, N. Siddharth

Figure 1 for Autoencoding Conditional Neural Processes for Representation Learning

Figure 2 for Autoencoding Conditional Neural Processes for Representation Learning

Figure 3 for Autoencoding Conditional Neural Processes for Representation Learning

Figure 4 for Autoencoding Conditional Neural Processes for Representation Learning

Abstract:Conditional neural processes (CNPs) are a flexible and efficient family of models that learn to learn a stochastic process from observations. In the visual domain, they have seen particular application in contextual image completion - observing pixel values at some locations to predict a distribution over values at other unobserved locations. However, the choice of pixels in learning such a CNP is typically either random or derived from a simple statistical measure (e.g. pixel variance). Here, we turn the problem on its head and ask: which pixels would a CNP like to observe? That is, which pixels allow fitting CNP, and do such pixels tell us something about the underlying image? Viewing the context provided to the CNP as fixed-size latent representations, we construct an amortised variational framework, Partial Pixel Space Variational Autoencoder (PPS-VAE), for predicting this context simultaneously with learning a CNP. We evaluate PPS-VAE on a set of vision datasets, and find that not only is it possible to learn context points while also fitting CNPs, but that their spatial arrangement and values provides strong signal for the information contained in the image - evaluated through the lens of classification. We believe the PPS-VAE provides a promising avenue to explore learning interpretable and effective visual representations.

Via

Access Paper or Ask Questions

StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure

May 09, 2023

Mattia Opper, Victor Prokhorov, N. Siddharth

Abstract:This work explores the utility of explicit structure for representation learning in NLP by developing StrAE -- an autoencoding framework that faithfully leverages sentence structure to learn multi-level node embeddings in an unsupervised fashion. We use StrAE to train models across different types of sentential structure and objectives, including a novel contrastive loss over structure, and evaluate the learnt embeddings on a series of both intrinsic and extrinsic tasks. Our experiments indicate that leveraging explicit structure through StrAE leads to improved embeddings over prior work, and that our novel contrastive objective over structure outperforms the standard cross-entropy objective. Moreover, in contrast to findings from prior work that weakly leverages structure, we find that being completely faithful to structure does enable disambiguation between types of structure based on the corresponding model's performance. As further evidence of StrAE's utility, we develop a simple proof-of-concept approach to simultaneously induce structure while learning embeddings, rather than being given structure, and find that performance is comparable to that of the best-performing models where structure is given. Finally, we contextualise these results by comparing StrAE against standard unstructured baselines learnt in similar settings, and show that faithfully leveraging explicit structure can be beneficial in lexical and sentence-level semantics.

* An earlier non-archival version of this paper was presented at UM-IOS 2022

Via

Access Paper or Ask Questions

Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

Jun 07, 2021

Lan Zhang, Victor Prokhorov, Ehsan Shareghi

Figure 1 for Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

Figure 2 for Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

Figure 3 for Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

Figure 4 for Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

Abstract:To highlight the challenges of achieving representation disentanglement for text domain in an unsupervised setting, in this paper we select a representative set of successfully applied models from the image domain. We evaluate these models on 6 disentanglement metrics, as well as on downstream classification tasks and homotopy. To facilitate the evaluation, we propose two synthetic datasets with known generative factors. Our experiments highlight the existing gap in the text domain and illustrate that certain elements such as representation sparsity (as an inductive bias), or representation coupling with the decoder could impact disentanglement. To the best of our knowledge, our work is the first attempt on the intersection of unsupervised representation disentanglement and text, and provides the experimental framework and datasets for examining future developments in this direction.

* Accepted to RepL4NLP 2021

Via

Access Paper or Ask Questions

Hierarchical Sparse Variational Autoencoder for Text Encoding

Sep 25, 2020

Victor Prokhorov, Yingzhen Li, Ehsan Shareghi, Nigel Collier

Figure 1 for Hierarchical Sparse Variational Autoencoder for Text Encoding

Figure 2 for Hierarchical Sparse Variational Autoencoder for Text Encoding

Figure 3 for Hierarchical Sparse Variational Autoencoder for Text Encoding

Figure 4 for Hierarchical Sparse Variational Autoencoder for Text Encoding

Abstract:In this paper we focus on unsupervised representation learning and propose a novel framework, Hierarchical Sparse Variational Autoencoder (HSVAE), that imposes sparsity on sentence representations via direct optimisation of Evidence Lower Bound (ELBO). Our experimental results illustrate that HSVAE is flexible and adapts nicely to the underlying characteristics of the corpus which is reflected by the level of sparsity and its distributional patterns.

Via

Access Paper or Ask Questions

On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Sep 30, 2019

Victor Prokhorov, Ehsan Shareghi, Yingzhen Li, Mohammad Taher Pilehvar, Nigel Collier

Figure 1 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Figure 2 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Figure 3 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Figure 4 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Abstract:Variational Autoencoders (VAEs) are known to suffer from learning uninformative latent representation of the input due to issues such as approximated posterior collapse, or entanglement of the latent space. We impose an explicit constraint on the Kullback-Leibler (KL) divergence term inside the VAE objective function. While the explicit constraint naturally avoids posterior collapse, we use it to further understand the significance of the KL term in controlling the information transmitted through the VAE channel. Within this framework, we explore different properties of the estimated posterior distribution, and highlight the trade-off between the amount of information encoded in a latent code during training, and the generative capacity of the model.

* 10 pages; Accepted in 3rd Workshop on Neural Generation and Translation (WNGT 2019)

Via

Access Paper or Ask Questions

Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models

Apr 05, 2019

Victor Prokhorov, Mohammad Taher Pilehvar, Nigel Collier

Figure 1 for Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models

Figure 2 for Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models

Figure 3 for Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models

Figure 4 for Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models

Abstract:We present a novel method for mapping unrestricted text to knowledge graph entities by framing the task as a sequence-to-sequence problem. Specifically, given the encoded state of an input text, our decoder directly predicts paths in the knowledge graph, starting from the root and ending at the target node following hypernym-hyponym relationships. In this way, and in contrast to other text-to-entity mapping systems, our model outputs hierarchically structured predictions that are fully interpretable in the context of the underlying ontology, in an end-to-end manner. We present a proof-of-concept experiment with encouraging results, comparable to those of state-of-the-art systems.

* accepted at naacl 2019

Via

Access Paper or Ask Questions

Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces

Nov 12, 2018

Victor Prokhorov, Mohammad Taher Pilehvar, Dimitri Kartsaklis, Pietro Lio, Nigel Collier

Figure 1 for Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces

Figure 2 for Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces

Figure 3 for Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces

Figure 4 for Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces

Abstract:Word embedding techniques heavily rely on the abundance of training data for individual words. Given the Zipfian distribution of words in natural language texts, a large number of words do not usually appear frequently or at all in the training data. In this paper we put forward a technique that exploits the knowledge encoded in lexical resources, such as WordNet, to induce embeddings for unseen words. Our approach adapts graph embedding and cross-lingual vector space transformation techniques in order to merge lexical knowledge encoded in ontologies with that derived from corpus statistics. We show that the approach can provide consistent performance improvements across multiple evaluation benchmarks: in-vitro, on multiple rare word similarity datasets, and in-vivo, in two downstream text classification tasks.

* Accepted for presentation at AAAI 2019

Via

Access Paper or Ask Questions

Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models

Aug 28, 2018

Mohammad Taher Pilehvar, Dimitri Kartsaklis, Victor Prokhorov, Nigel Collier

Figure 1 for Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models

Figure 2 for Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models

Figure 3 for Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models

Figure 4 for Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models

Abstract:Rare word representation has recently enjoyed a surge of interest, owing to the crucial role that effective handling of infrequent words can play in accurate semantic understanding. However, there is a paucity of reliable benchmarks for evaluation and comparison of these techniques. We show in this paper that the only existing benchmark (the Stanford Rare Word dataset) suffers from low-confidence annotations and limited vocabulary; hence, it does not constitute a solid comparison framework. In order to fill this evaluation gap, we propose CAmbridge Rare word Dataset (Card-660), an expert-annotated word similarity dataset which provides a highly reliable, yet challenging, benchmark for rare word representation techniques. Through a set of experiments we show that even the best mainstream word embeddings, with millions of words in their vocabularies, are unable to achieve performances higher than 0.43 (Pearson correlation) on the dataset, compared to a human-level upperbound of 0.90. We release the dataset and the annotation materials at https://pilehvar.github.io/card-660/.

* EMNLP 2018

Via

Access Paper or Ask Questions

Learning Rare Word Representations using Semantic Bridging

Jul 24, 2017

Victor Prokhorov, Mohammad Taher Pilehvar, Dimitri Kartsaklis, Pietro Lió, Nigel Collier

Figure 1 for Learning Rare Word Representations using Semantic Bridging

Figure 2 for Learning Rare Word Representations using Semantic Bridging

Figure 3 for Learning Rare Word Representations using Semantic Bridging

Figure 4 for Learning Rare Word Representations using Semantic Bridging

Abstract:We propose a methodology that adapts graph embedding techniques (DeepWalk (Perozzi et al., 2014) and node2vec (Grover and Leskovec, 2016)) as well as cross-lingual vector space mapping approaches (Least Squares and Canonical Correlation Analysis) in order to merge the corpus and ontological sources of lexical knowledge. We also perform comparative analysis of the used algorithms in order to identify the best combination for the proposed system. We then apply this to the task of enhancing the coverage of an existing word embedding's vocabulary with rare and unseen words. We show that our technique can provide considerable extra coverage (over 99%), leading to consistent performance gain (around 10% absolute gain is achieved with w2v-gn-500K cf.\S 3.3) on the Rare Word Similarity dataset.

Via

Access Paper or Ask Questions