Picture for Michael Matena

Michael Matena

NPEFF: Non-Negative Per-Example Fisher Factorization

Add code
Oct 07, 2023
Viaarxiv icon

A Combinatorial Perspective on the Optimization of Shallow ReLU Networks

Add code
Oct 01, 2022
Figure 1 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 2 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 3 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 4 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Viaarxiv icon

Merging Models with Fisher-Weighted Averaging

Add code
Nov 18, 2021
Figure 1 for Merging Models with Fisher-Weighted Averaging
Figure 2 for Merging Models with Fisher-Weighted Averaging
Figure 3 for Merging Models with Fisher-Weighted Averaging
Figure 4 for Merging Models with Fisher-Weighted Averaging
Viaarxiv icon

Do Transformer Modifications Transfer Across Implementations and Applications?

Add code
Feb 23, 2021
Figure 1 for Do Transformer Modifications Transfer Across Implementations and Applications?
Figure 2 for Do Transformer Modifications Transfer Across Implementations and Applications?
Figure 3 for Do Transformer Modifications Transfer Across Implementations and Applications?
Viaarxiv icon

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Add code
Oct 24, 2019
Figure 1 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Figure 2 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Figure 3 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Figure 4 for Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Viaarxiv icon