Picture for Charlie Snell

Charlie Snell

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought

Add code
Jan 08, 2025
Viaarxiv icon

Predicting Emergent Capabilities by Finetuning

Add code
Nov 25, 2024
Figure 1 for Predicting Emergent Capabilities by Finetuning
Figure 2 for Predicting Emergent Capabilities by Finetuning
Figure 3 for Predicting Emergent Capabilities by Finetuning
Figure 4 for Predicting Emergent Capabilities by Finetuning
Viaarxiv icon

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Add code
Aug 06, 2024
Figure 1 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 2 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 3 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 4 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Viaarxiv icon

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

Add code
Nov 30, 2023
Figure 1 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 2 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 3 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 4 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Viaarxiv icon

The False Promise of Imitating Proprietary LLMs

Add code
May 25, 2023
Figure 1 for The False Promise of Imitating Proprietary LLMs
Figure 2 for The False Promise of Imitating Proprietary LLMs
Figure 3 for The False Promise of Imitating Proprietary LLMs
Figure 4 for The False Promise of Imitating Proprietary LLMs
Viaarxiv icon

Learning by Distilling Context

Add code
Sep 30, 2022
Figure 1 for Learning by Distilling Context
Figure 2 for Learning by Distilling Context
Figure 3 for Learning by Distilling Context
Figure 4 for Learning by Distilling Context
Viaarxiv icon

Active Programming by Example with a Natural Language Prior

Add code
May 25, 2022
Figure 1 for Active Programming by Example with a Natural Language Prior
Figure 2 for Active Programming by Example with a Natural Language Prior
Figure 3 for Active Programming by Example with a Natural Language Prior
Figure 4 for Active Programming by Example with a Natural Language Prior
Viaarxiv icon

Context-Aware Language Modeling for Goal-Oriented Dialogue Systems

Add code
Apr 22, 2022
Figure 1 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Figure 2 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Figure 3 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Figure 4 for Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Viaarxiv icon

Summarizing Differences between Text Distributions with Natural Language

Add code
Jan 28, 2022
Figure 1 for Summarizing Differences between Text Distributions with Natural Language
Figure 2 for Summarizing Differences between Text Distributions with Natural Language
Figure 3 for Summarizing Differences between Text Distributions with Natural Language
Figure 4 for Summarizing Differences between Text Distributions with Natural Language
Viaarxiv icon

Approximating How Single Head Attention Learns

Add code
Mar 13, 2021
Figure 1 for Approximating How Single Head Attention Learns
Figure 2 for Approximating How Single Head Attention Learns
Figure 3 for Approximating How Single Head Attention Learns
Figure 4 for Approximating How Single Head Attention Learns
Viaarxiv icon