Picture for Hanlin Tang

Hanlin Tang

RazorAttention: Efficient KV Cache Compression Through Retrieval Heads

Add code
Jul 22, 2024
Viaarxiv icon

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

Add code
Mar 05, 2024
Figure 1 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Figure 2 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Figure 3 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Figure 4 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Viaarxiv icon

MKQ-BERT: Quantized BERT with 4-bits Weights and Activations

Add code
Mar 25, 2022
Figure 1 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Figure 2 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Figure 3 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Viaarxiv icon

PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic

Add code
Aug 20, 2021
Figure 1 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 2 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 3 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 4 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Viaarxiv icon

On the geometry of generalization and memorization in deep neural networks

Add code
May 30, 2021
Figure 1 for On the geometry of generalization and memorization in deep neural networks
Figure 2 for On the geometry of generalization and memorization in deep neural networks
Figure 3 for On the geometry of generalization and memorization in deep neural networks
Figure 4 for On the geometry of generalization and memorization in deep neural networks
Viaarxiv icon

Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models

Add code
Apr 15, 2021
Figure 1 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Figure 2 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Figure 3 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Figure 4 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Viaarxiv icon

1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed

Add code
Apr 13, 2021
Figure 1 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Figure 2 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Figure 3 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Figure 4 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Viaarxiv icon

1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed

Add code
Feb 04, 2021
Figure 1 for 1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed
Figure 2 for 1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed
Figure 3 for 1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed
Figure 4 for 1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed
Viaarxiv icon

APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm

Add code
Aug 28, 2020
Figure 1 for APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Figure 2 for APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Figure 3 for APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Figure 4 for APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Viaarxiv icon

Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning

Add code
Jul 14, 2020
Figure 1 for Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Figure 2 for Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Figure 3 for Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Figure 4 for Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Viaarxiv icon