Picture for Joshua Achiam

Joshua Achiam

Tony

Rule Based Rewards for Language Model Safety

Add code
Nov 02, 2024
Viaarxiv icon

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

A Hazard Analysis Framework for Code Synthesis Large Language Models

Add code
Jul 25, 2022
Figure 1 for A Hazard Analysis Framework for Code Synthesis Large Language Models
Figure 2 for A Hazard Analysis Framework for Code Synthesis Large Language Models
Figure 3 for A Hazard Analysis Framework for Code Synthesis Large Language Models
Viaarxiv icon

Responsive Safety in Reinforcement Learning by PID Lagrangian Methods

Add code
Jul 08, 2020
Figure 1 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Figure 2 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Figure 3 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Figure 4 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Viaarxiv icon

Towards Characterizing Divergence in Deep Q-Learning

Add code
Mar 21, 2019
Figure 1 for Towards Characterizing Divergence in Deep Q-Learning
Figure 2 for Towards Characterizing Divergence in Deep Q-Learning
Figure 3 for Towards Characterizing Divergence in Deep Q-Learning
Figure 4 for Towards Characterizing Divergence in Deep Q-Learning
Viaarxiv icon

On First-Order Meta-Learning Algorithms

Add code
Oct 22, 2018
Figure 1 for On First-Order Meta-Learning Algorithms
Figure 2 for On First-Order Meta-Learning Algorithms
Figure 3 for On First-Order Meta-Learning Algorithms
Figure 4 for On First-Order Meta-Learning Algorithms
Viaarxiv icon

Variational Option Discovery Algorithms

Add code
Jul 26, 2018
Figure 1 for Variational Option Discovery Algorithms
Figure 2 for Variational Option Discovery Algorithms
Figure 3 for Variational Option Discovery Algorithms
Figure 4 for Variational Option Discovery Algorithms
Viaarxiv icon

Constrained Policy Optimization

Add code
May 30, 2017
Figure 1 for Constrained Policy Optimization
Viaarxiv icon

Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning

Add code
Mar 06, 2017
Figure 1 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Figure 2 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Figure 3 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Figure 4 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Viaarxiv icon

Easy Monotonic Policy Iteration

Add code
Feb 29, 2016
Viaarxiv icon