Picture for Peter Barnett

Peter Barnett

What AI evaluations for preventing catastrophic risks can and cannot do

Add code
Nov 26, 2024
Figure 1 for What AI evaluations for preventing catastrophic risks can and cannot do
Figure 2 for What AI evaluations for preventing catastrophic risks can and cannot do
Viaarxiv icon

Declare and Justify: Explicit assumptions in AI evaluations are necessary for effective regulation

Add code
Nov 19, 2024
Viaarxiv icon

Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance

Add code
Sep 04, 2024
Figure 1 for Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance
Figure 2 for Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance
Figure 3 for Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance
Figure 4 for Governing dual-use technologies: Case studies of international security agreements and lessons for AI governance
Viaarxiv icon

Verification methods for international AI agreements

Add code
Aug 28, 2024
Figure 1 for Verification methods for international AI agreements
Figure 2 for Verification methods for international AI agreements
Figure 3 for Verification methods for international AI agreements
Figure 4 for Verification methods for international AI agreements
Viaarxiv icon

Active Reward Learning from Multiple Teachers

Add code
Mar 02, 2023
Viaarxiv icon