Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ta-Chu Kao

Minimum Description Length Control

Jul 24, 2022

Ted Moskovitz, Ta-Chu Kao, Maneesh Sahani, Matthew M. Botvinick

Figure 1 for Minimum Description Length Control

Figure 2 for Minimum Description Length Control

Figure 3 for Minimum Description Length Control

Figure 4 for Minimum Description Length Control

Abstract:We propose a novel framework for multitask reinforcement learning based on the minimum description length (MDL) principle. In this approach, which we term MDL-control (MDL-C), the agent learns the common structure among the tasks with which it is faced and then distills it into a simpler representation which facilitates faster convergence and generalization to new tasks. In doing so, MDL-C naturally balances adaptation to each task with epistemic uncertainty about the task distribution. We motivate MDL-C via formal connections between the MDL principle and Bayesian inference, derive theoretical performance guarantees, and demonstrate MDL-C's empirical effectiveness on both discrete and high-dimensional continuous control tasks.

Via

Access Paper or Ask Questions

Natural continual learning: success is a journey, not (just) a destination

Jun 15, 2021

Ta-Chu Kao, Kristopher T. Jensen, Alberto Bernacchia, Guillaume Hennequin

Figure 1 for Natural continual learning: success is a journey, not (just) a destination

Figure 2 for Natural continual learning: success is a journey, not (just) a destination

Figure 3 for Natural continual learning: success is a journey, not (just) a destination

Figure 4 for Natural continual learning: success is a journey, not (just) a destination

Abstract:Biological agents are known to learn many different tasks over the course of their lives, and to be able to revisit previous tasks and behaviors with little to no loss in performance. In contrast, artificial agents are prone to 'catastrophic forgetting' whereby performance on previous tasks deteriorates rapidly as new ones are acquired. This shortcoming has recently been addressed using methods that encourage parameters to stay close to those used for previous tasks. This can be done by (i) using specific parameter regularizers that map out suitable destinations in parameter space, or (ii) guiding the optimization journey by projecting gradients into subspaces that do not interfere with previous tasks. However, parameter regularization has been shown to be relatively ineffective in recurrent neural networks (RNNs), a setting relevant to the study of neural dynamics supporting biological continual learning. Similarly, projection based methods can reach capacity and fail to learn any further as the number of tasks increases. To address these limitations, we propose Natural Continual Learning (NCL), a new method that unifies weight regularization and projected gradient descent. NCL uses Bayesian weight regularization to encourage good performance on all tasks at convergence and combines this with gradient projections designed to prevent catastrophic forgetting during optimization. NCL formalizes gradient projection as a trust region algorithm based on the Fisher information metric, and achieves scalability via a novel Kronecker-factored approximation strategy. Our method outperforms both standard weight regularization techniques and projection based approaches when applied to continual learning problems in RNNs. The trained networks evolve task-specific dynamics that are strongly preserved as new tasks are learned, similar to experimental findings in biological circuits.

Via

Access Paper or Ask Questions

Automatic differentiation of Sylvester, Lyapunov, and algebraic Riccati equations

Nov 24, 2020

Ta-Chu Kao, Guillaume Hennequin

Figure 1 for Automatic differentiation of Sylvester, Lyapunov, and algebraic Riccati equations

Figure 2 for Automatic differentiation of Sylvester, Lyapunov, and algebraic Riccati equations

Figure 3 for Automatic differentiation of Sylvester, Lyapunov, and algebraic Riccati equations

Abstract:Sylvester, Lyapunov, and algebraic Riccati equations are the bread and butter of control theorists. They are used to compute infinite-horizon Gramians, solve optimal control problems in continuous or discrete time, and design observers. While popular numerical computing frameworks (e.g., scipy) provide efficient solvers for these equations, these solvers are still largely missing from most automatic differentiation libraries. Here, we derive the forward and reverse-mode derivatives of the solutions to all three types of equations, and showcase their application on an inverse control problem.

Via

Access Paper or Ask Questions

Manifold GPLVMs for discovering non-Euclidean latent structure in neural data

Jun 12, 2020

Kristopher T. Jensen, Ta-Chu Kao, Marco Tripodi, Guillaume Hennequin

Figure 1 for Manifold GPLVMs for discovering non-Euclidean latent structure in neural data

Figure 2 for Manifold GPLVMs for discovering non-Euclidean latent structure in neural data

Figure 3 for Manifold GPLVMs for discovering non-Euclidean latent structure in neural data

Figure 4 for Manifold GPLVMs for discovering non-Euclidean latent structure in neural data

Abstract:A common problem in neuroscience is to elucidate the collective neural representations of behaviorally important variables such as head direction, spatial location, upcoming movements, or mental spatial transformations. Often, these latent variables are internal constructs not directly accessible to the experimenter. Here, we propose a new probabilistic latent variable model to simultaneously identify the latent state and the way each neuron contributes to its representation in an unsupervised way. In contrast to previous models which assume Euclidean latent spaces, we embrace the fact that latent states often belong to symmetric manifolds such as spheres, tori, or rotation groups of various dimensions. We therefore propose the manifold Gaussian process latent variable model (mGPLVM), where neural responses arise from (i) a shared latent variable living on a specific manifold, and (ii) a set of non-parametric tuning curves determining how each neuron contributes to the representation. Cross-validated comparisons of models with different topologies can be used to distinguish between candidate manifolds, and variational inference enables quantification of uncertainty. We demonstrate the validity of the approach on several synthetic datasets and on calcium recordings from the ellipsoid body of Drosophila melanogaster. This circuit is known to encode head direction, and mGPLVM correctly recovers the ring topology expected from a neural population representing a single angular variable.

Via

Access Paper or Ask Questions