Picture for Ryan Ehrlich

Ryan Ehrlich

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Add code
Jul 31, 2024
Viaarxiv icon

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Add code
Feb 07, 2024
Viaarxiv icon