Picture for David Dohan

David Dohan

Shammie

Training Chain-of-Thought via Latent-Variable Inference

Add code
Nov 28, 2023
Viaarxiv icon

Large Language Models Can Be Easily Distracted by Irrelevant Context

Add code
Feb 13, 2023
Viaarxiv icon

Language Model Cascades

Add code
Jul 28, 2022
Figure 1 for Language Model Cascades
Figure 2 for Language Model Cascades
Figure 3 for Language Model Cascades
Figure 4 for Language Model Cascades
Viaarxiv icon

Solving Quantitative Reasoning Problems with Language Models

Add code
Jul 01, 2022
Figure 1 for Solving Quantitative Reasoning Problems with Language Models
Figure 2 for Solving Quantitative Reasoning Problems with Language Models
Figure 3 for Solving Quantitative Reasoning Problems with Language Models
Figure 4 for Solving Quantitative Reasoning Problems with Language Models
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Towards Learning Universal Hyperparameter Optimizers with Transformers

Add code
May 26, 2022
Figure 1 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 2 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 3 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 4 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Viaarxiv icon

PaLM: Scaling Language Modeling with Pathways

Add code
Apr 19, 2022
Figure 1 for PaLM: Scaling Language Modeling with Pathways
Figure 2 for PaLM: Scaling Language Modeling with Pathways
Figure 3 for PaLM: Scaling Language Modeling with Pathways
Figure 4 for PaLM: Scaling Language Modeling with Pathways
Viaarxiv icon

Show Your Work: Scratchpads for Intermediate Computation with Language Models

Add code
Nov 30, 2021
Figure 1 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 2 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 3 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 4 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Viaarxiv icon

Program Synthesis with Large Language Models

Add code
Aug 16, 2021
Figure 1 for Program Synthesis with Large Language Models
Figure 2 for Program Synthesis with Large Language Models
Figure 3 for Program Synthesis with Large Language Models
Figure 4 for Program Synthesis with Large Language Models
Viaarxiv icon

Latent Programmer: Discrete Latent Codes for Program Synthesis

Add code
Dec 01, 2020
Figure 1 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 2 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 3 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 4 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Viaarxiv icon