Picture for Andrei A. Rusu

Andrei A. Rusu

DiPaCo: Distributed Path Composition

Add code
Mar 15, 2024
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Jan 17, 2024
Figure 1 for Asynchronous Local-SGD Training for Language Modeling
Figure 2 for Asynchronous Local-SGD Training for Language Modeling
Figure 3 for Asynchronous Local-SGD Training for Language Modeling
Figure 4 for Asynchronous Local-SGD Training for Language Modeling
Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Nov 14, 2023
Figure 1 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 2 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 3 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 4 for DiLoCo: Distributed Low-Communication Training of Language Models
Viaarxiv icon

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Add code
Nov 15, 2022
Viaarxiv icon

Continual Unsupervised Representation Learning

Add code
Oct 31, 2019
Figure 1 for Continual Unsupervised Representation Learning
Figure 2 for Continual Unsupervised Representation Learning
Figure 3 for Continual Unsupervised Representation Learning
Figure 4 for Continual Unsupervised Representation Learning
Viaarxiv icon

Meta-Learning with Warped Gradient Descent

Add code
Aug 30, 2019
Figure 1 for Meta-Learning with Warped Gradient Descent
Figure 2 for Meta-Learning with Warped Gradient Descent
Figure 3 for Meta-Learning with Warped Gradient Descent
Figure 4 for Meta-Learning with Warped Gradient Descent
Viaarxiv icon

Task Agnostic Continual Learning via Meta Learning

Add code
Jun 12, 2019
Figure 1 for Task Agnostic Continual Learning via Meta Learning
Figure 2 for Task Agnostic Continual Learning via Meta Learning
Figure 3 for Task Agnostic Continual Learning via Meta Learning
Figure 4 for Task Agnostic Continual Learning via Meta Learning
Viaarxiv icon

Meta-Learning with Latent Embedding Optimization

Add code
Sep 28, 2018
Figure 1 for Meta-Learning with Latent Embedding Optimization
Figure 2 for Meta-Learning with Latent Embedding Optimization
Figure 3 for Meta-Learning with Latent Embedding Optimization
Viaarxiv icon

Meta-Learning by the Baldwin Effect

Add code
Jun 22, 2018
Figure 1 for Meta-Learning by the Baldwin Effect
Figure 2 for Meta-Learning by the Baldwin Effect
Figure 3 for Meta-Learning by the Baldwin Effect
Figure 4 for Meta-Learning by the Baldwin Effect
Viaarxiv icon

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

Add code
Jun 06, 2018
Figure 1 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Figure 2 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Figure 3 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Figure 4 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Viaarxiv icon