Picture for Joanna Yoo

Joanna Yoo

Scalable Training of Language Models using JAX pjit and TPUv4

Add code
Apr 13, 2022
Figure 1 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 2 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 3 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 4 for Scalable Training of Language Models using JAX pjit and TPUv4
Viaarxiv icon

SliceOut: Training Transformers and CNNs faster while using less memory

Add code
Jul 21, 2020
Figure 1 for SliceOut: Training Transformers and CNNs faster while using less memory
Figure 2 for SliceOut: Training Transformers and CNNs faster while using less memory
Figure 3 for SliceOut: Training Transformers and CNNs faster while using less memory
Figure 4 for SliceOut: Training Transformers and CNNs faster while using less memory
Viaarxiv icon