Picture for Jared Davis

Jared Davis

Debiasing a First-order Heuristic for Approximate Bi-level Optimization

Add code
Jun 08, 2021
Figure 1 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Figure 2 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Figure 3 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Figure 4 for Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Viaarxiv icon

Sub-Linear Memory: How to Make Performers SLiM

Add code
Dec 21, 2020
Figure 1 for Sub-Linear Memory: How to Make Performers SLiM
Figure 2 for Sub-Linear Memory: How to Make Performers SLiM
Figure 3 for Sub-Linear Memory: How to Make Performers SLiM
Figure 4 for Sub-Linear Memory: How to Make Performers SLiM
Viaarxiv icon

Rethinking Attention with Performers

Add code
Sep 30, 2020
Figure 1 for Rethinking Attention with Performers
Figure 2 for Rethinking Attention with Performers
Figure 3 for Rethinking Attention with Performers
Figure 4 for Rethinking Attention with Performers
Viaarxiv icon

UFO-BLO: Unbiased First-Order Bilevel Optimization

Add code
Jun 05, 2020
Figure 1 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Figure 2 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Figure 3 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Figure 4 for UFO-BLO: Unbiased First-Order Bilevel Optimization
Viaarxiv icon

Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers

Add code
Jun 05, 2020
Figure 1 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Figure 2 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Figure 3 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Figure 4 for Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Viaarxiv icon

CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices

Add code
Apr 18, 2020
Figure 1 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Figure 2 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Figure 3 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Figure 4 for CWY Parametrization for Scalable Learning of Orthogonal and Stiefel Matrices
Viaarxiv icon

Stochastic Flows and Geometric Optimization on the Orthogonal Group

Add code
Mar 30, 2020
Figure 1 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 2 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 3 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Figure 4 for Stochastic Flows and Geometric Optimization on the Orthogonal Group
Viaarxiv icon