Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kristin Branson

The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior

Jul 21, 2022

Jennifer J. Sun, Andrew Ulmer, Dipam Chakraborty, Brian Geuther, Edward Hayes, Heng Jia, Vivek Kumar, Zachary Partridge, Alice Robie, Catherine E. Schretter(+7 more)

Figure 1 for The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior

Figure 2 for The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior

Figure 3 for The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior

Figure 4 for The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior

Abstract:Real-world behavior is often shaped by complex interactions between multiple agents. To scalably study multi-agent behavior, advances in unsupervised and self-supervised learning have enabled a variety of different behavioral representations to be learned from trajectory data. To date, there does not exist a unified set of benchmarks that can enable comparing methods quantitatively and systematically across a broad set of behavior analysis settings. We aim to address this by introducing a large-scale, multi-agent trajectory dataset from real-world behavioral neuroscience experiments that covers a range of behavior analysis tasks. Our dataset consists of trajectory data from common model organisms, with 9.6 million frames of mouse data and 4.4 million frames of fly data, in a variety of experimental settings, such as different strains, lengths of interaction, and optogenetic stimulation. A subset of the frames also consist of expert-annotated behavior labels. Improvements on our dataset corresponds to behavioral representations that work across multiple organisms and is able to capture differences for common behavior analysis tasks.

* Project website: https://sites.google.com/view/computational-behavior/our-datasets/mabe2022-dataset

Via

Access Paper or Ask Questions

Evaluation metrics for behaviour modeling

Jul 23, 2020

Daniel Jiwoong Im, Iljung Kwak, Kristin Branson

Figure 1 for Evaluation metrics for behaviour modeling

Figure 2 for Evaluation metrics for behaviour modeling

Figure 3 for Evaluation metrics for behaviour modeling

Figure 4 for Evaluation metrics for behaviour modeling

Abstract:A primary difficulty with unsupervised discovery of structure in large data sets is a lack of quantitative evaluation criteria. In this work, we propose and investigate several metrics for evaluating and comparing generative models of behavior learned using imitation learning. Compared to the commonly-used model log-likelihood, these criteria look at longer temporal relationships in behavior, are relevant if behavior has some properties that are inherently unpredictable, and highlight biases in the overall distribution of behaviors produced by the model. Pointwise metrics compare real to model-predicted trajectories given true past information. Distribution metrics compare statistics of the model simulating behavior in open loop, and are inspired by how experimental biologists evaluate the effects of manipulations on animal behavior. We show that the proposed metrics correspond with biologists' intuitions about behavior, and allow us to evaluate models, understand their biases, and enable us to propose new research directions.

* 17 pages

Via

Access Paper or Ask Questions

Are skip connections necessary for biologically plausible learning rules?

Dec 04, 2019

Daniel Jiwoong Im, Rutuja Patil, Kristin Branson

Figure 1 for Are skip connections necessary for biologically plausible learning rules?

Figure 2 for Are skip connections necessary for biologically plausible learning rules?

Abstract:Backpropagation is the workhorse of deep learning, however, several other biologically-motivated learning rules have been introduced, such as random feedback alignment and difference target propagation. None of these methods have produced a competitive performance against backpropagation. In this paper, we show that biologically-motivated learning rules with skip connections between intermediate layers can perform as well as backpropagation on the MNIST dataset and are robust to various sets of hyper-parameters.

Via

Access Paper or Ask Questions

Detecting the Starting Frame of Actions in Video

Jun 07, 2019

Iljung S. Kwak, David Kriegman, Kristin Branson

Figure 1 for Detecting the Starting Frame of Actions in Video

Figure 2 for Detecting the Starting Frame of Actions in Video

Figure 3 for Detecting the Starting Frame of Actions in Video

Figure 4 for Detecting the Starting Frame of Actions in Video

Abstract:To understand causal relationships between events in the world, it is useful to pinpoint when actions occur in videos and to examine the state of the world at and around that time point. For example, one must accurately detect the start of an audience response -- laughter in a movie, cheering at a sporting event -- to understand the cause of the reaction. In this work, we focus on the problem of accurately detecting action starts rather than isolated events or action ends. We introduce a novel structured loss function based on matching predictions to true action starts that is tailored to this problem; it more heavily penalizes extra and missed action start detections over small misalignments. Recurrent neural networks are used to minimize a differentiable approximation of this loss. To evaluate these methods, we introduce the Mouse Reach Dataset, a large, annotated video dataset of mice performing a sequence of actions. The dataset was labeled by experts for the purpose of neuroscience research on causally relating neural activity to behavior. On this dataset, we demonstrate that the structured loss leads to significantly higher accuracy than a baseline of mean-squared error loss.

Via

Access Paper or Ask Questions

Importance Weighted Adversarial Variational Autoencoders for Spike Inference from Calcium Imaging Data

Jun 07, 2019

Daniel Jiwoong Im, Sridhama Prakhya, Jinyao Yan, Srinivas Turaga, Kristin Branson

Figure 1 for Importance Weighted Adversarial Variational Autoencoders for Spike Inference from Calcium Imaging Data

Figure 2 for Importance Weighted Adversarial Variational Autoencoders for Spike Inference from Calcium Imaging Data

Figure 3 for Importance Weighted Adversarial Variational Autoencoders for Spike Inference from Calcium Imaging Data

Figure 4 for Importance Weighted Adversarial Variational Autoencoders for Spike Inference from Calcium Imaging Data

Abstract:The Importance Weighted Auto Encoder (IWAE) objective has been shown to improve the training of generative models over the standard Variational Auto Encoder (VAE) objective. Here, we derive importance weighted extensions to AVB and AAE. These latent variable models use implicitly defined inference networks whose approximate posterior density q_\phi(z|x) cannot be directly evaluated, an essential ingredient for importance weighting. We show improved training and inference in latent variable models with our adversarially trained importance weighting method, and derive new theoretical connections between adversarial generative model training criteria and marginal likelihood based methods. We apply these methods to the important problem of inferring spiking neural activity from calcium imaging data, a challenging posterior inference problem in neuroscience, and show that posterior samples from the adversarial methods outperform factorized posteriors used in VAEs.

Via

Access Paper or Ask Questions

Stochastic Neighbor Embedding under f-divergences

Nov 03, 2018

Daniel Jiwoong Im, Nakul Verma, Kristin Branson

Figure 1 for Stochastic Neighbor Embedding under f-divergences

Figure 2 for Stochastic Neighbor Embedding under f-divergences

Figure 3 for Stochastic Neighbor Embedding under f-divergences

Figure 4 for Stochastic Neighbor Embedding under f-divergences

Abstract:The t-distributed Stochastic Neighbor Embedding (t-SNE) is a powerful and popular method for visualizing high-dimensional data. It minimizes the Kullback-Leibler (KL) divergence between the original and embedded data distributions. In this work, we propose extending this method to other f-divergences. We analytically and empirically evaluate the types of latent structure-manifold, cluster, and hierarchical-that are well-captured using both the original KL-divergence as well as the proposed f-divergence generalization, and find that different divergences perform better for different types of structure. A common concern with $t$-SNE criterion is that it is optimized using gradient descent, and can become stuck in poor local minima. We propose optimizing the f-divergence based loss criteria by minimizing a variational bound. This typically performs better than optimizing the primal form, and our experiments show that it can improve upon the embedding results obtained from the original $t$-SNE criterion as well.

Via

Access Paper or Ask Questions

Quantitatively Evaluating GANs With Divergences Proposed for Training

Apr 28, 2018

Daniel Jiwoong Im, He Ma, Graham Taylor, Kristin Branson

Figure 1 for Quantitatively Evaluating GANs With Divergences Proposed for Training

Figure 2 for Quantitatively Evaluating GANs With Divergences Proposed for Training

Figure 3 for Quantitatively Evaluating GANs With Divergences Proposed for Training

Figure 4 for Quantitatively Evaluating GANs With Divergences Proposed for Training

Abstract:Generative adversarial networks (GANs) have been extremely effective in approximating complex distributions of high-dimensional, input data samples, and substantial progress has been made in understanding and improving GAN performance in terms of both theory and application. However, we currently lack quantitative methods for model assessment. Because of this, while many GAN variants are being proposed, we have relatively little understanding of their relative abilities. In this paper, we evaluate the performance of various types of GANs using divergence and distance functions typically used only for training. We observe consistency across the various proposed metrics and, interestingly, the test-time metrics do not favour networks that use the same training-time criterion. We also compare the proposed metrics to human perceptual scores.

* ICLR 2018

Via

Access Paper or Ask Questions

An empirical analysis of the optimization of deep network loss surfaces

Dec 07, 2017

Daniel Jiwoong Im, Michael Tao, Kristin Branson

Figure 1 for An empirical analysis of the optimization of deep network loss surfaces

Figure 2 for An empirical analysis of the optimization of deep network loss surfaces

Figure 3 for An empirical analysis of the optimization of deep network loss surfaces

Figure 4 for An empirical analysis of the optimization of deep network loss surfaces

Abstract:The success of deep neural networks hinges on our ability to accurately and efficiently optimize high-dimensional, non-convex functions. In this paper, we empirically investigate the loss functions of state-of-the-art networks, and how commonly-used stochastic gradient descent variants optimize these loss functions. To do this, we visualize the loss function by projecting them down to low-dimensional spaces chosen based on the convergence points of different optimization algorithms. Our observations suggest that optimization algorithms encounter and choose different descent directions at many saddle points to find different final weights. Based on consistency we observe across re-runs of the same stochastic optimization algorithm, we hypothesize that each optimization algorithm makes characteristic choices at these saddle points.

Via

Access Paper or Ask Questions

Network-size independent covering number bounds for deep networks

Nov 08, 2017

Mayank Kabra, Kristin Branson

Abstract:We give a covering number bound for deep learning networks that is independent of the size of the network. The key for the simple analysis is that for linear classifiers, rotating the data doesn't affect the covering number. Thus, we can ignore the rotation part of each layer's linear transformation, and get the covering number bound by concentrating on the scaling part.

* We found a possible error in our analysis. We are re-evaluating, and may resubmit

Via

Access Paper or Ask Questions

Learning recurrent representations for hierarchical behavior modeling

Nov 15, 2016

Eyrun Eyjolfsdottir, Kristin Branson, Yisong Yue, Pietro Perona

Figure 1 for Learning recurrent representations for hierarchical behavior modeling

Figure 2 for Learning recurrent representations for hierarchical behavior modeling

Figure 3 for Learning recurrent representations for hierarchical behavior modeling

Figure 4 for Learning recurrent representations for hierarchical behavior modeling

Abstract:We propose a framework for detecting action patterns from motion sequences and modeling the sensory-motor relationship of animals, using a generative recurrent neural network. The network has a discriminative part (classifying actions) and a generative part (predicting motion), whose recurrent cells are laterally connected, allowing higher levels of the network to represent high level phenomena. We test our framework on two types of data, fruit fly behavior and online handwriting. Our results show that 1) taking advantage of unlabeled sequences, by predicting future motion, significantly improves action detection performance when training labels are scarce, 2) the network learns to represent high level phenomena such as writer identity and fly gender, without supervision, and 3) simulated motion trajectories, generated by treating motion prediction as input to the network, look realistic and may be used to qualitatively evaluate whether the model has learnt generative control rules.

Via

Access Paper or Ask Questions