Picture for Long Ouyang

Long Ouyang

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Self-critiquing models for assisting human evaluators

Add code
Jun 14, 2022
Figure 1 for Self-critiquing models for assisting human evaluators
Figure 2 for Self-critiquing models for assisting human evaluators
Figure 3 for Self-critiquing models for assisting human evaluators
Figure 4 for Self-critiquing models for assisting human evaluators
Viaarxiv icon

Training language models to follow instructions with human feedback

Add code
Mar 04, 2022
Figure 1 for Training language models to follow instructions with human feedback
Figure 2 for Training language models to follow instructions with human feedback
Figure 3 for Training language models to follow instructions with human feedback
Figure 4 for Training language models to follow instructions with human feedback
Viaarxiv icon

WebGPT: Browser-assisted question-answering with human feedback

Add code
Dec 17, 2021
Figure 1 for WebGPT: Browser-assisted question-answering with human feedback
Figure 2 for WebGPT: Browser-assisted question-answering with human feedback
Figure 3 for WebGPT: Browser-assisted question-answering with human feedback
Figure 4 for WebGPT: Browser-assisted question-answering with human feedback
Viaarxiv icon

Recursively Summarizing Books with Human Feedback

Add code
Sep 27, 2021
Figure 1 for Recursively Summarizing Books with Human Feedback
Figure 2 for Recursively Summarizing Books with Human Feedback
Figure 3 for Recursively Summarizing Books with Human Feedback
Figure 4 for Recursively Summarizing Books with Human Feedback
Viaarxiv icon

Learning to summarize from human feedback

Add code
Sep 02, 2020
Figure 1 for Learning to summarize from human feedback
Figure 2 for Learning to summarize from human feedback
Figure 3 for Learning to summarize from human feedback
Figure 4 for Learning to summarize from human feedback
Viaarxiv icon

Bayesian Inference of Regular Expressions from Human-Generated Example Strings

Add code
Sep 26, 2018
Figure 1 for Bayesian Inference of Regular Expressions from Human-Generated Example Strings
Figure 2 for Bayesian Inference of Regular Expressions from Human-Generated Example Strings
Viaarxiv icon

Pedagogical learning

Add code
Nov 30, 2017
Figure 1 for Pedagogical learning
Figure 2 for Pedagogical learning
Figure 3 for Pedagogical learning
Figure 4 for Pedagogical learning
Viaarxiv icon

Practical optimal experiment design with probabilistic programs

Add code
Aug 17, 2016
Figure 1 for Practical optimal experiment design with probabilistic programs
Figure 2 for Practical optimal experiment design with probabilistic programs
Figure 3 for Practical optimal experiment design with probabilistic programs
Figure 4 for Practical optimal experiment design with probabilistic programs
Viaarxiv icon