Picture for Allan Dafoe

Allan Dafoe

Holistic Safety and Responsibility Evaluations of Advanced AI Models

Add code
Apr 22, 2024
Viaarxiv icon

Evaluating Frontier Models for Dangerous Capabilities

Add code
Mar 20, 2024
Figure 1 for Evaluating Frontier Models for Dangerous Capabilities
Figure 2 for Evaluating Frontier Models for Dangerous Capabilities
Figure 3 for Evaluating Frontier Models for Dangerous Capabilities
Figure 4 for Evaluating Frontier Models for Dangerous Capabilities
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Levels of AGI: Operationalizing Progress on the Path to AGI

Add code
Nov 04, 2023
Viaarxiv icon

Model evaluation for extreme risks

Add code
May 24, 2023
Figure 1 for Model evaluation for extreme risks
Figure 2 for Model evaluation for extreme risks
Figure 3 for Model evaluation for extreme risks
Figure 4 for Model evaluation for extreme risks
Viaarxiv icon

Democratising AI: Multiple Meanings, Goals, and Methods

Add code
Mar 27, 2023
Viaarxiv icon

Normative Disagreement as a Challenge for Cooperative AI

Add code
Nov 27, 2021
Figure 1 for Normative Disagreement as a Challenge for Cooperative AI
Figure 2 for Normative Disagreement as a Challenge for Cooperative AI
Figure 3 for Normative Disagreement as a Challenge for Cooperative AI
Figure 4 for Normative Disagreement as a Challenge for Cooperative AI
Viaarxiv icon

Institutionalising Ethics in AI through Broader Impact Requirements

Add code
May 30, 2021
Figure 1 for Institutionalising Ethics in AI through Broader Impact Requirements
Figure 2 for Institutionalising Ethics in AI through Broader Impact Requirements
Viaarxiv icon

Open Problems in Cooperative AI

Add code
Dec 15, 2020
Figure 1 for Open Problems in Cooperative AI
Figure 2 for Open Problems in Cooperative AI
Figure 3 for Open Problems in Cooperative AI
Figure 4 for Open Problems in Cooperative AI
Viaarxiv icon

The Windfall Clause: Distributing the Benefits of AI for the Common Good

Add code
Jan 24, 2020
Figure 1 for The Windfall Clause: Distributing the Benefits of AI for the Common Good
Viaarxiv icon