Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tom Veniat

Efficient Continual Learning with Modular Networks and Task-Driven Priors

Dec 23, 2020

Tom Veniat, Ludovic Denoyer, Marc'Aurelio Ranzato

Figure 1 for Efficient Continual Learning with Modular Networks and Task-Driven Priors

Figure 2 for Efficient Continual Learning with Modular Networks and Task-Driven Priors

Figure 3 for Efficient Continual Learning with Modular Networks and Task-Driven Priors

Figure 4 for Efficient Continual Learning with Modular Networks and Task-Driven Priors

Abstract:Existing literature in Continual Learning (CL) has focused on overcoming catastrophic forgetting, the inability of the learner to recall how to perform tasks observed in the past. There are however other desirable properties of a CL system, such as the ability to transfer knowledge from previous tasks and to scale memory and compute sub-linearly with the number of tasks. Since most current benchmarks focus only on forgetting using short streams of tasks, we first propose a new suite of benchmarks to probe CL algorithms across these new axes. Finally, we introduce a new modular architecture, whose modules represent atomic skills that can be composed to perform a certain task. Learning a task reduces to figuring out which past modules to re-use, and which new modules to instantiate to solve the current task. Our learning algorithm leverages a task-driven prior over the exponential search space of all possible ways to combine modules, enabling efficient learning on long streams of tasks. Our experiments show that this modular architecture and learning algorithm perform competitively on widely used CL benchmarks while yielding superior performance on the more challenging benchmarks we introduce in this work.

Via

Access Paper or Ask Questions

Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks

May 22, 2018

Tom Veniat, Ludovic Denoyer

Figure 1 for Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks

Figure 2 for Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks

Figure 3 for Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks

Figure 4 for Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks

Abstract:We propose to focus on the problem of discovering neural network architectures efficient in terms of both prediction quality and cost. For instance, our approach is able to solve the following tasks: learn a neural network able to predict well in less than 100 milliseconds or learn an efficient model that fits in a 50 Mb memory. Our contribution is a novel family of models called Budgeted Super Networks (BSN). They are learned using gradient descent techniques applied on a budgeted learning objective function which integrates a maximum authorized cost, while making no assumption on the nature of this cost. We present a set of experiments on computer vision problems and analyze the ability of our technique to deal with three different costs: the computation cost, the memory consumption cost and a distributed computation cost. We particularly show that our model can discover neural network architectures that have a better accuracy than the ResNet and Convolutional Neural Fabrics architectures on CIFAR-10 and CIFAR-100, at a lower cost.

* 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Via

Access Paper or Ask Questions