Picture for Abhinav Bhatele

Abhinav Bhatele

Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers

Add code
Feb 12, 2025
Viaarxiv icon

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

Add code
Feb 07, 2025
Viaarxiv icon

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Add code
Feb 07, 2025
Viaarxiv icon

HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages

Add code
Dec 19, 2024
Viaarxiv icon

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Add code
Jun 14, 2024
Figure 1 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 2 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 3 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 4 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Viaarxiv icon

From Pixels to Prose: A Large Dataset of Dense Image Captions

Add code
Jun 14, 2024
Figure 1 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Figure 2 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Figure 3 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Figure 4 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Viaarxiv icon

Loki: Low-Rank Keys for Efficient Sparse Attention

Add code
Jun 04, 2024
Figure 1 for Loki: Low-Rank Keys for Efficient Sparse Attention
Figure 2 for Loki: Low-Rank Keys for Efficient Sparse Attention
Figure 3 for Loki: Low-Rank Keys for Efficient Sparse Attention
Figure 4 for Loki: Low-Rank Keys for Efficient Sparse Attention
Viaarxiv icon

Transformers Can Do Arithmetic with the Right Embeddings

Add code
May 27, 2024
Figure 1 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 2 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 3 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 4 for Transformers Can Do Arithmetic with the Right Embeddings
Viaarxiv icon

Performance-Aligned LLMs for Generating Fast Code

Add code
Apr 29, 2024
Viaarxiv icon

Can Large Language Models Write Parallel Code?

Add code
Jan 23, 2024
Viaarxiv icon