Picture for Utku Evci

Utku Evci

Towards Optimal Adapter Placement for Efficient Transfer Learning

Add code
Oct 21, 2024
Viaarxiv icon

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

Add code
Feb 07, 2024
Figure 1 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 2 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 3 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 4 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Viaarxiv icon

Scaling Laws for Sparsely-Connected Foundation Models

Add code
Sep 15, 2023
Viaarxiv icon

Dynamic Sparse Training with Structured Sparsity

Add code
May 03, 2023
Figure 1 for Dynamic Sparse Training with Structured Sparsity
Figure 2 for Dynamic Sparse Training with Structured Sparsity
Figure 3 for Dynamic Sparse Training with Structured Sparsity
Figure 4 for Dynamic Sparse Training with Structured Sparsity
Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
May 02, 2023
Viaarxiv icon

The Dormant Neuron Phenomenon in Deep Reinforcement Learning

Add code
Feb 24, 2023
Viaarxiv icon

Scaling Vision Transformers to 22 Billion Parameters

Add code
Feb 10, 2023
Viaarxiv icon

Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask

Add code
Sep 15, 2022
Figure 1 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 2 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 3 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 4 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Viaarxiv icon

The State of Sparse Training in Deep Reinforcement Learning

Add code
Jun 17, 2022
Figure 1 for The State of Sparse Training in Deep Reinforcement Learning
Figure 2 for The State of Sparse Training in Deep Reinforcement Learning
Figure 3 for The State of Sparse Training in Deep Reinforcement Learning
Figure 4 for The State of Sparse Training in Deep Reinforcement Learning
Viaarxiv icon

GradMax: Growing Neural Networks using Gradient Information

Add code
Jan 13, 2022
Figure 1 for GradMax: Growing Neural Networks using Gradient Information
Figure 2 for GradMax: Growing Neural Networks using Gradient Information
Figure 3 for GradMax: Growing Neural Networks using Gradient Information
Figure 4 for GradMax: Growing Neural Networks using Gradient Information
Viaarxiv icon