Picture for Scott Gray

Scott Gray

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Evaluating Large Language Models Trained on Code

Add code
Jul 14, 2021
Figure 1 for Evaluating Large Language Models Trained on Code
Figure 2 for Evaluating Large Language Models Trained on Code
Figure 3 for Evaluating Large Language Models Trained on Code
Figure 4 for Evaluating Large Language Models Trained on Code
Viaarxiv icon

Zero-Shot Text-to-Image Generation

Add code
Feb 26, 2021
Figure 1 for Zero-Shot Text-to-Image Generation
Figure 2 for Zero-Shot Text-to-Image Generation
Figure 3 for Zero-Shot Text-to-Image Generation
Figure 4 for Zero-Shot Text-to-Image Generation
Viaarxiv icon

Scaling Laws for Autoregressive Generative Modeling

Add code
Nov 06, 2020
Figure 1 for Scaling Laws for Autoregressive Generative Modeling
Figure 2 for Scaling Laws for Autoregressive Generative Modeling
Figure 3 for Scaling Laws for Autoregressive Generative Modeling
Figure 4 for Scaling Laws for Autoregressive Generative Modeling
Viaarxiv icon

Language Models are Few-Shot Learners

Add code
Jun 05, 2020
Figure 1 for Language Models are Few-Shot Learners
Figure 2 for Language Models are Few-Shot Learners
Figure 3 for Language Models are Few-Shot Learners
Figure 4 for Language Models are Few-Shot Learners
Viaarxiv icon

Scaling Laws for Neural Language Models

Add code
Jan 23, 2020
Figure 1 for Scaling Laws for Neural Language Models
Figure 2 for Scaling Laws for Neural Language Models
Figure 3 for Scaling Laws for Neural Language Models
Figure 4 for Scaling Laws for Neural Language Models
Viaarxiv icon

Dota 2 with Large Scale Deep Reinforcement Learning

Add code
Dec 13, 2019
Figure 1 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 2 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 3 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 4 for Dota 2 with Large Scale Deep Reinforcement Learning
Viaarxiv icon

Generating Long Sequences with Sparse Transformers

Add code
Apr 23, 2019
Figure 1 for Generating Long Sequences with Sparse Transformers
Figure 2 for Generating Long Sequences with Sparse Transformers
Figure 3 for Generating Long Sequences with Sparse Transformers
Figure 4 for Generating Long Sequences with Sparse Transformers
Viaarxiv icon

Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks

Add code
Dec 02, 2017
Figure 1 for Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
Figure 2 for Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
Figure 3 for Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
Figure 4 for Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
Viaarxiv icon

Fast Algorithms for Convolutional Neural Networks

Add code
Nov 10, 2015
Figure 1 for Fast Algorithms for Convolutional Neural Networks
Figure 2 for Fast Algorithms for Convolutional Neural Networks
Figure 3 for Fast Algorithms for Convolutional Neural Networks
Figure 4 for Fast Algorithms for Convolutional Neural Networks
Viaarxiv icon