Picture for Ryan Ehrlich

Ryan Ehrlich

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Add code
Jul 31, 2024
Figure 1 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 2 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 3 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 4 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Viaarxiv icon

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Add code
Feb 07, 2024
Figure 1 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Figure 2 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Figure 3 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Figure 4 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Viaarxiv icon