Picture for Siddhartha Rao Kamalakara

Siddhartha Rao Kamalakara

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Exploring Low Rank Training of Deep Neural Networks

Add code
Sep 27, 2022
Figure 1 for Exploring Low Rank Training of Deep Neural Networks
Figure 2 for Exploring Low Rank Training of Deep Neural Networks
Figure 3 for Exploring Low Rank Training of Deep Neural Networks
Figure 4 for Exploring Low Rank Training of Deep Neural Networks
Viaarxiv icon

Scalable Training of Language Models using JAX pjit and TPUv4

Add code
Apr 13, 2022
Figure 1 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 2 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 3 for Scalable Training of Language Models using JAX pjit and TPUv4
Figure 4 for Scalable Training of Language Models using JAX pjit and TPUv4
Viaarxiv icon