Picture for Anastasia Dietrich

Anastasia Dietrich

Towards Structured Dynamic Sparse Pre-Training of BERT

Add code
Aug 13, 2021
Figure 1 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 2 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 3 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 4 for Towards Structured Dynamic Sparse Pre-Training of BERT
Viaarxiv icon

GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures

Add code
Jun 10, 2021
Figure 1 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 2 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 3 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 4 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Viaarxiv icon