Picture for Charlie Snell

Charlie Snell

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Add code
Aug 06, 2024
Figure 1 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 2 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 3 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 4 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Viaarxiv icon

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

Add code
Nov 30, 2023
Figure 1 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 2 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 3 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 4 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Viaarxiv icon

The False Promise of Imitating Proprietary LLMs

Add code
May 25, 2023
Viaarxiv icon

Learning by Distilling Context

Add code
Sep 30, 2022
Figure 1 for Learning by Distilling Context
Figure 2 for Learning by Distilling Context
Figure 3 for Learning by Distilling Context
Figure 4 for Learning by Distilling Context
Viaarxiv icon

Active Programming by Example with a Natural Language Prior

Add code
May 25, 2022
Figure 1 for Active Programming by Example with a Natural Language Prior
Figure 2 for Active Programming by Example with a Natural Language Prior
Figure 3 for Active Programming by Example with a Natural Language Prior
Figure 4 for Active Programming by Example with a Natural Language Prior
Viaarxiv icon

Context-Aware Language Modeling for Goal-Oriented Dialogue Systems

Add code
Apr 22, 2022
Figure 1 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Figure 2 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Figure 3 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Figure 4 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Viaarxiv icon

Summarizing Differences between Text Distributions with Natural Language

Add code
Jan 28, 2022
Figure 1 for Summarizing Differences between Text Distributions with Natural Language
Figure 2 for Summarizing Differences between Text Distributions with Natural Language
Figure 3 for Summarizing Differences between Text Distributions with Natural Language
Figure 4 for Summarizing Differences between Text Distributions with Natural Language
Viaarxiv icon

Approximating How Single Head Attention Learns

Add code
Mar 13, 2021
Figure 1 for Approximating How Single Head Attention Learns
Figure 2 for Approximating How Single Head Attention Learns
Figure 3 for Approximating How Single Head Attention Learns
Figure 4 for Approximating How Single Head Attention Learns
Viaarxiv icon