Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lawrence Phillips

Intrinsic uncertainties and where to find them

Jul 06, 2021

Francesco Farina, Lawrence Phillips, Nicola J Richmond

Figure 1 for Intrinsic uncertainties and where to find them

Figure 2 for Intrinsic uncertainties and where to find them

Figure 3 for Intrinsic uncertainties and where to find them

Figure 4 for Intrinsic uncertainties and where to find them

Abstract:We introduce a framework for uncertainty estimation that both describes and extends many existing methods. We consider typical hyperparameters involved in classical training as random variables and marginalise them out to capture various sources of uncertainty in the parameter space. We investigate which forms and combinations of marginalisation are most useful from a practical point of view on standard benchmarking data sets. Moreover, we discuss how some marginalisations may produce reliable estimates of uncertainty without the need for extensive hyperparameter tuning and/or large-scale ensembling.

* Presented at the ICML 2021 Workshop on Uncertainty and Robustness in Deep Learning

Via

Access Paper or Ask Questions

Explanatory Masks for Neural Network Interpretability

Nov 15, 2019

Lawrence Phillips, Garrett Goh, Nathan Hodas

Figure 1 for Explanatory Masks for Neural Network Interpretability

Figure 2 for Explanatory Masks for Neural Network Interpretability

Figure 3 for Explanatory Masks for Neural Network Interpretability

Figure 4 for Explanatory Masks for Neural Network Interpretability

Abstract:Neural network interpretability is a vital component for applications across a wide variety of domains. In such cases it is often useful to analyze a network which has already been trained for its specific purpose. In this work, we develop a method to produce explanation masks for pre-trained networks. The mask localizes the most important aspects of each input for prediction of the original network. Masks are created by a secondary network whose goal is to create as small an explanation as possible while still preserving the predictive accuracy of the original network. We demonstrate the applicability of our method for image classification with CNNs, sentiment analysis with RNNs, and chemical property prediction with mixed CNN/RNN architectures.

* Presented at IJCAI-18 Workshop on Explainable Artificial Intelligence (XAI)

Via

Access Paper or Ask Questions

Metric-Based Few-Shot Learning for Video Action Recognition

Sep 14, 2019

Chris Careaga, Brian Hutchinson, Nathan Hodas, Lawrence Phillips

Figure 1 for Metric-Based Few-Shot Learning for Video Action Recognition

Figure 2 for Metric-Based Few-Shot Learning for Video Action Recognition

Figure 3 for Metric-Based Few-Shot Learning for Video Action Recognition

Abstract:In the few-shot scenario, a learner must effectively generalize to unseen classes given a small support set of labeled examples. While a relatively large amount of research has gone into few-shot learning for image classification, little work has been done on few-shot video classification. In this work, we address the task of few-shot video action recognition with a set of two-stream models. We evaluate the performance of a set of convolutional and recurrent neural network video encoder architectures used in conjunction with three popular metric-based few-shot algorithms. We train and evaluate using a few-shot split of the Kinetics 600 dataset. Our experiments confirm the importance of the two-stream setup, and find prototypical networks and pooled long short-term memory network embeddings to give the best performance as few-shot method and video encoder, respectively. For a 5-shot 5-way task, this setup obtains 84.2% accuracy on the test set and 59.4% on a special "challenge" test set, composed of highly confusable classes.

Via

Access Paper or Ask Questions

Sparse hierarchical representation learning on molecular graphs

Aug 06, 2019

Matthias Bal, Hagen Triendl, Mariana Assmann, Michael Craig, Lawrence Phillips, Jarvist Moore Frost, Usman Bashir, Noor Shaker, Vid Stojevic

Figure 1 for Sparse hierarchical representation learning on molecular graphs

Figure 2 for Sparse hierarchical representation learning on molecular graphs

Figure 3 for Sparse hierarchical representation learning on molecular graphs

Figure 4 for Sparse hierarchical representation learning on molecular graphs

Abstract:Architectures for sparse hierarchical representation learning have recently been proposed for graph-structured data, but so far assume the absence of edge features in the graph. We close this gap and propose a method to pool graphs with edge features, inspired by the hierarchical nature of chemistry. In particular, we introduce two types of pooling layers compatible with an edge-feature graph-convolutional architecture and investigate their performance for molecules relevant to drug discovery on a set of two classification and two regression benchmark datasets of MoleculeNet. We find that our models significantly outperform previous benchmarks on three of the datasets and reach state-of-the-art results on the fourth benchmark, with pooling improving performance for three out of four tasks, keeping performance stable on the fourth task, and generally speeding up the training process.

* 4 pages, 2 figures, accepted as a DLG 2019 workshop paper at KDD 2019

Via

Access Paper or Ask Questions

Few-Shot Learning with Metric-Agnostic Conditional Embeddings

Feb 12, 2018

Nathan Hilliard, Lawrence Phillips, Scott Howland, Artëm Yankov, Courtney D. Corley, Nathan O. Hodas

Figure 1 for Few-Shot Learning with Metric-Agnostic Conditional Embeddings

Figure 2 for Few-Shot Learning with Metric-Agnostic Conditional Embeddings

Figure 3 for Few-Shot Learning with Metric-Agnostic Conditional Embeddings

Figure 4 for Few-Shot Learning with Metric-Agnostic Conditional Embeddings

Abstract:Learning high quality class representations from few examples is a key problem in metric-learning approaches to few-shot learning. To accomplish this, we introduce a novel architecture where class representations are conditioned for each few-shot trial based on a target image. We also deviate from traditional metric-learning approaches by training a network to perform comparisons between classes rather than relying on a static metric comparison. This allows the network to decide what aspects of each class are important for the comparison at hand. We find that this flexible architecture works well in practice, achieving state-of-the-art performance on the Caltech-UCSD birds fine-grained classification task.

Via

Access Paper or Ask Questions

Assessing the Linguistic Productivity of Unsupervised Deep Neural Networks

Jun 06, 2017

Lawrence Phillips, Nathan Hodas

Figure 1 for Assessing the Linguistic Productivity of Unsupervised Deep Neural Networks

Figure 2 for Assessing the Linguistic Productivity of Unsupervised Deep Neural Networks

Figure 3 for Assessing the Linguistic Productivity of Unsupervised Deep Neural Networks

Figure 4 for Assessing the Linguistic Productivity of Unsupervised Deep Neural Networks

Abstract:Increasingly, cognitive scientists have demonstrated interest in applying tools from deep learning. One use for deep learning is in language acquisition where it is useful to know if a linguistic phenomenon can be learned through domain-general means. To assess whether unsupervised deep learning is appropriate, we first pose a smaller question: Can unsupervised neural networks apply linguistic rules productively, using them in novel situations? We draw from the literature on determiner/noun productivity by training an unsupervised, autoencoder network measuring its ability to combine nouns with determiners. Our simple autoencoder creates combinations it has not previously encountered and produces a degree of overlap matching adults. While this preliminary work does not provide conclusive evidence for productivity, it warrants further investigation with more complex models. Further, this work helps lay the foundations for future collaboration between the deep learning and cognitive science communities.

* To be presented at the 39th Annual Meeting of the Cognitive Science Society

Via

Access Paper or Ask Questions