Picture for Andrey Zhmoginov

Andrey Zhmoginov

Learning and Unlearning of Fabricated Knowledge in Language Models

Add code
Oct 29, 2024
Viaarxiv icon

MELODI: Exploring Memory Compression for Long Contexts

Add code
Oct 04, 2024
Viaarxiv icon

Narrowing the Focus: Learned Optimizers for Pretrained Models

Add code
Aug 21, 2024
Viaarxiv icon

Continual Few-Shot Learning Using HyperTransformers

Add code
Jan 12, 2023
Viaarxiv icon

Training trajectories, mini-batch losses and the curious role of the learning rate

Add code
Jan 05, 2023
Viaarxiv icon

Transformers learn in-context by gradient descent

Add code
Dec 15, 2022
Viaarxiv icon

Decentralized Learning with Multi-Headed Distillation

Add code
Nov 28, 2022
Viaarxiv icon

Fine-tuning Image Transformers using Learnable Memory

Add code
Mar 30, 2022
Figure 1 for Fine-tuning Image Transformers using Learnable Memory
Figure 2 for Fine-tuning Image Transformers using Learnable Memory
Figure 3 for Fine-tuning Image Transformers using Learnable Memory
Figure 4 for Fine-tuning Image Transformers using Learnable Memory
Viaarxiv icon

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning

Add code
Jan 15, 2022
Figure 1 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 2 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 3 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 4 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Viaarxiv icon

Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks

Add code
Jul 23, 2021
Figure 1 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Figure 2 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Figure 3 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Figure 4 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Viaarxiv icon