Picture for David Farhi

David Farhi

Tony

Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation

Add code
Mar 14, 2025
Viaarxiv icon

OpenAI o1 System Card

Add code
Dec 21, 2024
Figure 1 for OpenAI o1 System Card
Figure 2 for OpenAI o1 System Card
Figure 3 for OpenAI o1 System Card
Figure 4 for OpenAI o1 System Card
Viaarxiv icon

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Add code
Mar 28, 2022
Figure 1 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 2 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 3 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 4 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Viaarxiv icon

Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

Add code
Jun 28, 2021
Figure 1 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 2 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 3 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 4 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Viaarxiv icon

Dota 2 with Large Scale Deep Reinforcement Learning

Add code
Dec 13, 2019
Figure 1 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 2 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 3 for Dota 2 with Large Scale Deep Reinforcement Learning
Figure 4 for Dota 2 with Large Scale Deep Reinforcement Learning
Viaarxiv icon