Picture for Max Vladymyrov

Max Vladymyrov

UC Merced

Learning and Unlearning of Fabricated Knowledge in Language Models

Add code
Oct 29, 2024
Viaarxiv icon

Narrowing the Focus: Learned Optimizers for Pretrained Models

Add code
Aug 21, 2024
Viaarxiv icon

Linear Transformers are Versatile In-Context Learners

Add code
Feb 21, 2024
Viaarxiv icon

Uncovering mesa-optimization algorithms in Transformers

Add code
Sep 11, 2023
Figure 1 for Uncovering mesa-optimization algorithms in Transformers
Figure 2 for Uncovering mesa-optimization algorithms in Transformers
Figure 3 for Uncovering mesa-optimization algorithms in Transformers
Figure 4 for Uncovering mesa-optimization algorithms in Transformers
Viaarxiv icon

Continual Few-Shot Learning Using HyperTransformers

Add code
Jan 12, 2023
Viaarxiv icon

Training trajectories, mini-batch losses and the curious role of the learning rate

Add code
Jan 05, 2023
Viaarxiv icon

Transformers learn in-context by gradient descent

Add code
Dec 15, 2022
Viaarxiv icon

Decentralized Learning with Multi-Headed Distillation

Add code
Nov 28, 2022
Viaarxiv icon

Fine-tuning Image Transformers using Learnable Memory

Add code
Mar 30, 2022
Figure 1 for Fine-tuning Image Transformers using Learnable Memory
Figure 2 for Fine-tuning Image Transformers using Learnable Memory
Figure 3 for Fine-tuning Image Transformers using Learnable Memory
Figure 4 for Fine-tuning Image Transformers using Learnable Memory
Viaarxiv icon

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning

Add code
Jan 15, 2022
Figure 1 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 2 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 3 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 4 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Viaarxiv icon