Picture for Yonadav Shavit

Yonadav Shavit

Frontier AI Regulation: Managing Emerging Risks to Public Safety

Add code
Jul 11, 2023
Viaarxiv icon

Tools for Verifying Neural Models' Training Data

Add code
Jul 02, 2023
Viaarxiv icon

What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring

Add code
Mar 20, 2023
Viaarxiv icon

Strengthening Subcommunities: Towards Sustainable Growth in AI Research

Add code
Apr 18, 2022
Viaarxiv icon

Learning From Strategic Agents: Accuracy, Improvement, and Causality

Add code
Feb 24, 2020
Figure 1 for Learning From Strategic Agents: Accuracy, Improvement, and Causality
Viaarxiv icon

Extracting Incentives from Black-Box Decisions

Add code
Oct 13, 2019
Figure 1 for Extracting Incentives from Black-Box Decisions
Figure 2 for Extracting Incentives from Black-Box Decisions
Figure 3 for Extracting Incentives from Black-Box Decisions
Figure 4 for Extracting Incentives from Black-Box Decisions
Viaarxiv icon