Picture for David Manheim

David Manheim

The Necessity of AI Audit Standards Boards

Add code
Apr 11, 2024
Viaarxiv icon

Modeling Transformative AI Risks (MTAIR) Project -- Summary Report

Add code
Jun 19, 2022
Figure 1 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Figure 2 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Figure 3 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Figure 4 for Modeling Transformative AI Risks (MTAIR) Project -- Summary Report
Viaarxiv icon

Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety

Add code
Jan 09, 2022
Figure 1 for Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety
Figure 2 for Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety
Viaarxiv icon

Forecasting AI Progress: A Research Agenda

Add code
Aug 04, 2020
Figure 1 for Forecasting AI Progress: A Research Agenda
Figure 2 for Forecasting AI Progress: A Research Agenda
Figure 3 for Forecasting AI Progress: A Research Agenda
Figure 4 for Forecasting AI Progress: A Research Agenda
Viaarxiv icon

Oversight of Unsafe Systems via Dynamic Safety Envelopes

Add code
Nov 22, 2018
Viaarxiv icon

Overoptimization Failures and Specification Gaming in Multi-agent Systems

Add code
Oct 31, 2018
Viaarxiv icon

Categorizing Variants of Goodhart's Law

Add code
Apr 09, 2018
Viaarxiv icon