Picture for Ella Charlaix

Ella Charlaix

KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation

Add code
Sep 13, 2021
Figure 1 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Figure 2 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Figure 3 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Figure 4 for KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Viaarxiv icon

Block Pruning For Faster Transformers

Add code
Sep 10, 2021
Figure 1 for Block Pruning For Faster Transformers
Figure 2 for Block Pruning For Faster Transformers
Figure 3 for Block Pruning For Faster Transformers
Figure 4 for Block Pruning For Faster Transformers
Viaarxiv icon

Fully Quantized Transformer for Improved Translation

Add code
Nov 23, 2019
Figure 1 for Fully Quantized Transformer for Improved Translation
Figure 2 for Fully Quantized Transformer for Improved Translation
Figure 3 for Fully Quantized Transformer for Improved Translation
Figure 4 for Fully Quantized Transformer for Improved Translation
Viaarxiv icon