Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jannik Wolff

Mixture-of-experts VAEs can disregard variation in surjective multimodal data

Apr 11, 2022

Jannik Wolff, Tassilo Klein, Moin Nabi, Rahul G. Krishnan, Shinichi Nakajima

Figure 1 for Mixture-of-experts VAEs can disregard variation in surjective multimodal data

Figure 2 for Mixture-of-experts VAEs can disregard variation in surjective multimodal data

Abstract:Machine learning systems are often deployed in domains that entail data from multiple modalities, for example, phenotypic and genotypic characteristics describe patients in healthcare. Previous works have developed multimodal variational autoencoders (VAEs) that generate several modalities. We consider subjective data, where single datapoints from one modality (such as class labels) describe multiple datapoints from another modality (such as images). We theoretically and empirically demonstrate that multimodal VAEs with a mixture of experts posterior can struggle to capture variability in such surjective data.

* Accepted at the NeurIPS 2021 workshop on Bayesian Deep Learning

Via

Access Paper or Ask Questions

Learning Graph-Based Priors for Generalized Zero-Shot Learning

Oct 22, 2020

Colin Samplawski, Jannik Wolff, Tassilo Klein, Moin Nabi

Figure 1 for Learning Graph-Based Priors for Generalized Zero-Shot Learning

Figure 2 for Learning Graph-Based Priors for Generalized Zero-Shot Learning

Figure 3 for Learning Graph-Based Priors for Generalized Zero-Shot Learning

Figure 4 for Learning Graph-Based Priors for Generalized Zero-Shot Learning

Abstract:The task of zero-shot learning (ZSL) requires correctly predicting the label of samples from classes which were unseen at training time. This is achieved by leveraging side information about class labels, such as label attributes or word embeddings. Recently, attention has shifted to the more realistic task of generalized ZSL (GZSL) where test sets consist of seen and unseen samples. Recent approaches to GZSL have shown the value of generative models, which are used to generate samples from unseen classes. In this work, we incorporate an additional source of side information in the form of a relation graph over labels. We leverage this graph in order to learn a set of prior distributions, which encourage an aligned variational autoencoder (VAE) model to learn embeddings which respect the graph structure. Using this approach we are able to achieve improved performance on the CUB and SUN benchmarks over a strong baseline.

* Presented at AAAI 2020 Workshop on Deep Learning on Graphs: Methodologies and Applications (DLGMA'20)

Via

Access Paper or Ask Questions

Low-Shot Learning from Imaginary 3D Model

Jan 04, 2019

Frederik Pahde, Mihai Puscas, Jannik Wolff, Tassilo Klein, Nicu Sebe, Moin Nabi

Figure 1 for Low-Shot Learning from Imaginary 3D Model

Figure 2 for Low-Shot Learning from Imaginary 3D Model

Figure 3 for Low-Shot Learning from Imaginary 3D Model

Figure 4 for Low-Shot Learning from Imaginary 3D Model

Abstract:Since the advent of deep learning, neural networks have demonstrated remarkable results in many visual recognition tasks, constantly pushing the limits. However, the state-of-the-art approaches are largely unsuitable in scarce data regimes. To address this shortcoming, this paper proposes employing a 3D model, which is derived from training images. Such a model can then be used to hallucinate novel viewpoints and poses for the scarce samples of the few-shot learning scenario. A self-paced learning approach allows for the selection of a diverse set of high-quality images, which facilitates the training of a classifier. The performance of the proposed approach is showcased on the fine-grained CUB-200-2011 dataset in a few-shot setting and significantly improves our baseline accuracy.

* To appear at WACV 2019. arXiv admin note: text overlap with arXiv:1811.09192

Via

Access Paper or Ask Questions