Picture for Anka Reuel

Anka Reuel

Michael Pokorny

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

More than Marketing? On the Information Value of AI Benchmarks for Practitioners

Add code
Dec 07, 2024
Viaarxiv icon

The Reality of AI and Biorisk

Add code
Dec 02, 2024
Viaarxiv icon

BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

Add code
Nov 20, 2024
Figure 1 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 2 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 3 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 4 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Viaarxiv icon

Artificial Intelligence Index Report 2024

Add code
May 29, 2024
Viaarxiv icon

Fairness in Reinforcement Learning: A Survey

Add code
May 11, 2024
Viaarxiv icon

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

Add code
Jan 07, 2024
Viaarxiv icon

International Governance of Civilian AI: A Jurisdictional Certification Approach

Add code
Sep 11, 2023
Figure 1 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 2 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 3 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 4 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Viaarxiv icon

Analyzing And Editing Inner Mechanisms Of Backdoored Language Models

Add code
Feb 24, 2023
Figure 1 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 2 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 3 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Figure 4 for Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
Viaarxiv icon