Picture for Greg Heinrich

Greg Heinrich

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

Add code
Oct 02, 2024
Viaarxiv icon

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Add code
Sep 26, 2024
Viaarxiv icon

A deeper look at depth pruning of LLMs

Add code
Jul 23, 2024
Figure 1 for A deeper look at depth pruning of LLMs
Figure 2 for A deeper look at depth pruning of LLMs
Figure 3 for A deeper look at depth pruning of LLMs
Figure 4 for A deeper look at depth pruning of LLMs
Viaarxiv icon

Flextron: Many-in-One Flexible Large Language Model

Add code
Jun 11, 2024
Figure 1 for Flextron: Many-in-One Flexible Large Language Model
Figure 2 for Flextron: Many-in-One Flexible Large Language Model
Figure 3 for Flextron: Many-in-One Flexible Large Language Model
Figure 4 for Flextron: Many-in-One Flexible Large Language Model
Viaarxiv icon

AM-RADIO: Agglomerative Model -- Reduce All Domains Into One

Add code
Dec 21, 2023
Viaarxiv icon

FasterViT: Fast Vision Transformers with Hierarchical Attention

Add code
Jun 09, 2023
Viaarxiv icon

Metaoptimization on a Distributed System for Deep Reinforcement Learning

Add code
Feb 07, 2019
Figure 1 for Metaoptimization on a Distributed System for Deep Reinforcement Learning
Figure 2 for Metaoptimization on a Distributed System for Deep Reinforcement Learning
Figure 3 for Metaoptimization on a Distributed System for Deep Reinforcement Learning
Figure 4 for Metaoptimization on a Distributed System for Deep Reinforcement Learning
Viaarxiv icon