Picture for Alessandro Abate

Alessandro Abate

University of Oxford

Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols

Add code
Dec 17, 2024
Figure 1 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Figure 2 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Figure 3 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Figure 4 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Viaarxiv icon

Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting

Add code
Dec 15, 2024
Viaarxiv icon

Risk-Averse Certification of Bayesian Neural Networks

Add code
Nov 29, 2024
Viaarxiv icon

Partial Identifiability and Misspecification in Inverse Reinforcement Learning

Add code
Nov 24, 2024
Viaarxiv icon

Temporal-Difference Variational Continual Learning

Add code
Oct 10, 2024
Figure 1 for Temporal-Difference Variational Continual Learning
Figure 2 for Temporal-Difference Variational Continual Learning
Figure 3 for Temporal-Difference Variational Continual Learning
Figure 4 for Temporal-Difference Variational Continual Learning
Viaarxiv icon

DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications

Add code
Oct 06, 2024
Viaarxiv icon

Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols

Add code
Sep 12, 2024
Viaarxiv icon

Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation

Add code
Aug 21, 2024
Viaarxiv icon

Learning Provably Robust Policies in Uncertain Parametric Environments

Add code
Aug 06, 2024
Figure 1 for Learning Provably Robust Policies in Uncertain Parametric Environments
Figure 2 for Learning Provably Robust Policies in Uncertain Parametric Environments
Figure 3 for Learning Provably Robust Policies in Uncertain Parametric Environments
Figure 4 for Learning Provably Robust Policies in Uncertain Parametric Environments
Viaarxiv icon

Walking the Values in Bayesian Inverse Reinforcement Learning

Add code
Jul 15, 2024
Figure 1 for Walking the Values in Bayesian Inverse Reinforcement Learning
Figure 2 for Walking the Values in Bayesian Inverse Reinforcement Learning
Figure 3 for Walking the Values in Bayesian Inverse Reinforcement Learning
Figure 4 for Walking the Values in Bayesian Inverse Reinforcement Learning
Viaarxiv icon