Picture for Alexandros Koliousis

Alexandros Koliousis

GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures

Add code
Jun 10, 2021
Figure 1 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 2 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 3 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Figure 4 for GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Viaarxiv icon

CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers

Add code
Jan 08, 2019
Figure 1 for CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers
Figure 2 for CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers
Figure 3 for CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers
Figure 4 for CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers
Viaarxiv icon