Picture for Michael Carbin

Michael Carbin

Retrieval Capabilities of Large Language Models Scale with Pretraining FLOPs

Add code
Aug 24, 2025
Viaarxiv icon

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

Add code
Apr 17, 2025
Viaarxiv icon

Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding

Add code
Feb 17, 2025
Viaarxiv icon

Drowning in Documents: Consequences of Scaling Reranker Inference

Add code
Nov 18, 2024
Figure 1 for Drowning in Documents: Consequences of Scaling Reranker Inference
Figure 2 for Drowning in Documents: Consequences of Scaling Reranker Inference
Figure 3 for Drowning in Documents: Consequences of Scaling Reranker Inference
Figure 4 for Drowning in Documents: Consequences of Scaling Reranker Inference
Viaarxiv icon

Long Context RAG Performance of Large Language Models

Add code
Nov 05, 2024
Figure 1 for Long Context RAG Performance of Large Language Models
Figure 2 for Long Context RAG Performance of Large Language Models
Figure 3 for Long Context RAG Performance of Large Language Models
Viaarxiv icon

Inference Plans for Hybrid Particle Filtering

Add code
Aug 21, 2024
Figure 1 for Inference Plans for Hybrid Particle Filtering
Figure 2 for Inference Plans for Hybrid Particle Filtering
Figure 3 for Inference Plans for Hybrid Particle Filtering
Figure 4 for Inference Plans for Hybrid Particle Filtering
Viaarxiv icon

Learning to Compile Programs to Neural Networks

Add code
Jul 21, 2024
Figure 1 for Learning to Compile Programs to Neural Networks
Figure 2 for Learning to Compile Programs to Neural Networks
Figure 3 for Learning to Compile Programs to Neural Networks
Figure 4 for Learning to Compile Programs to Neural Networks
Viaarxiv icon

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Add code
Mar 27, 2024
Viaarxiv icon

The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning

Add code
Oct 07, 2023
Viaarxiv icon

Turaco: Complexity-Guided Data Sampling for Training Neural Surrogates of Programs

Add code
Sep 21, 2023
Viaarxiv icon