Picture for Wonpyo Park

Wonpyo Park

Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization

Add code
Jun 21, 2024
Viaarxiv icon

Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization

Add code
Jun 17, 2024
Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
May 02, 2023
Figure 1 for JaxPruner: A concise library for sparsity research
Figure 2 for JaxPruner: A concise library for sparsity research
Figure 3 for JaxPruner: A concise library for sparsity research
Viaarxiv icon

Graph Self-Attention for learning graph representation with Transformer

Add code
Jan 30, 2022
Figure 1 for Graph Self-Attention for learning graph representation with Transformer
Figure 2 for Graph Self-Attention for learning graph representation with Transformer
Figure 3 for Graph Self-Attention for learning graph representation with Transformer
Figure 4 for Graph Self-Attention for learning graph representation with Transformer
Viaarxiv icon

Multi-level Distance Regularization for Deep Metric Learning

Add code
Feb 08, 2021
Figure 1 for Multi-level Distance Regularization for Deep Metric Learning
Figure 2 for Multi-level Distance Regularization for Deep Metric Learning
Figure 3 for Multi-level Distance Regularization for Deep Metric Learning
Figure 4 for Multi-level Distance Regularization for Deep Metric Learning
Viaarxiv icon

Diversified Mutual Learning for Deep Metric Learning

Add code
Sep 09, 2020
Figure 1 for Diversified Mutual Learning for Deep Metric Learning
Figure 2 for Diversified Mutual Learning for Deep Metric Learning
Figure 3 for Diversified Mutual Learning for Deep Metric Learning
Figure 4 for Diversified Mutual Learning for Deep Metric Learning
Viaarxiv icon

BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition

Add code
Aug 15, 2020
Figure 1 for BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition
Figure 2 for BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition
Figure 3 for BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition
Figure 4 for BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition
Viaarxiv icon

GroupFace: Learning Latent Groups and Constructing Group-based Representations for Face Recognition

Add code
May 25, 2020
Figure 1 for GroupFace: Learning Latent Groups and Constructing Group-based Representations for Face Recognition
Figure 2 for GroupFace: Learning Latent Groups and Constructing Group-based Representations for Face Recognition
Figure 3 for GroupFace: Learning Latent Groups and Constructing Group-based Representations for Face Recognition
Figure 4 for GroupFace: Learning Latent Groups and Constructing Group-based Representations for Face Recognition
Viaarxiv icon

Regularizing Neural Networks via Stochastic Branch Layers

Add code
Oct 03, 2019
Figure 1 for Regularizing Neural Networks via Stochastic Branch Layers
Figure 2 for Regularizing Neural Networks via Stochastic Branch Layers
Figure 3 for Regularizing Neural Networks via Stochastic Branch Layers
Figure 4 for Regularizing Neural Networks via Stochastic Branch Layers
Viaarxiv icon

Relational Knowledge Distillation

Add code
May 01, 2019
Figure 1 for Relational Knowledge Distillation
Figure 2 for Relational Knowledge Distillation
Figure 3 for Relational Knowledge Distillation
Figure 4 for Relational Knowledge Distillation
Viaarxiv icon