Picture for John Kirchenbauer

John Kirchenbauer

Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers

Add code
Feb 12, 2025
Viaarxiv icon

Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs

Add code
Feb 10, 2025
Figure 1 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 2 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 3 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 4 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Viaarxiv icon

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Add code
Feb 07, 2025
Viaarxiv icon

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

Add code
Feb 07, 2025
Viaarxiv icon

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Add code
Jun 14, 2024
Figure 1 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 2 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 3 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 4 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Viaarxiv icon

GenQA: Generating Millions of Instructions from a Handful of Prompts

Add code
Jun 14, 2024
Figure 1 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 2 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 3 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 4 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Viaarxiv icon

OPTune: Efficient Online Preference Tuning

Add code
Jun 11, 2024
Figure 1 for OPTune: Efficient Online Preference Tuning
Figure 2 for OPTune: Efficient Online Preference Tuning
Figure 3 for OPTune: Efficient Online Preference Tuning
Figure 4 for OPTune: Efficient Online Preference Tuning
Viaarxiv icon

Transformers Can Do Arithmetic with the Right Embeddings

Add code
May 27, 2024
Figure 1 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 2 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 3 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 4 for Transformers Can Do Arithmetic with the Right Embeddings
Viaarxiv icon

LMD3: Language Model Data Density Dependence

Add code
May 10, 2024
Figure 1 for LMD3: Language Model Data Density Dependence
Figure 2 for LMD3: Language Model Data Density Dependence
Figure 3 for LMD3: Language Model Data Density Dependence
Figure 4 for LMD3: Language Model Data Density Dependence
Viaarxiv icon

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Add code
Oct 10, 2023
Viaarxiv icon