Picture for Alessandro Abate

Alessandro Abate

University of Oxford

Modular Training of Neural Networks aids Interpretability

Add code
Feb 04, 2025
Figure 1 for Modular Training of Neural Networks aids Interpretability
Figure 2 for Modular Training of Neural Networks aids Interpretability
Figure 3 for Modular Training of Neural Networks aids Interpretability
Figure 4 for Modular Training of Neural Networks aids Interpretability
Viaarxiv icon

Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols

Add code
Dec 17, 2024
Figure 1 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Figure 2 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Figure 3 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Figure 4 for Subversion Strategy Eval: Evaluating AI's stateless strategic capabilities against control protocols
Viaarxiv icon

Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting

Add code
Dec 15, 2024
Viaarxiv icon

Risk-Averse Certification of Bayesian Neural Networks

Add code
Nov 29, 2024
Figure 1 for Risk-Averse Certification of Bayesian Neural Networks
Figure 2 for Risk-Averse Certification of Bayesian Neural Networks
Figure 3 for Risk-Averse Certification of Bayesian Neural Networks
Figure 4 for Risk-Averse Certification of Bayesian Neural Networks
Viaarxiv icon

Partial Identifiability and Misspecification in Inverse Reinforcement Learning

Add code
Nov 24, 2024
Viaarxiv icon

Temporal-Difference Variational Continual Learning

Add code
Oct 10, 2024
Figure 1 for Temporal-Difference Variational Continual Learning
Figure 2 for Temporal-Difference Variational Continual Learning
Figure 3 for Temporal-Difference Variational Continual Learning
Figure 4 for Temporal-Difference Variational Continual Learning
Viaarxiv icon

DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications

Add code
Oct 06, 2024
Viaarxiv icon

Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols

Add code
Sep 12, 2024
Viaarxiv icon

Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation

Add code
Aug 21, 2024
Figure 1 for Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation
Figure 2 for Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation
Figure 3 for Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation
Figure 4 for Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation
Viaarxiv icon

Learning Provably Robust Policies in Uncertain Parametric Environments

Add code
Aug 06, 2024
Figure 1 for Learning Provably Robust Policies in Uncertain Parametric Environments
Figure 2 for Learning Provably Robust Policies in Uncertain Parametric Environments
Figure 3 for Learning Provably Robust Policies in Uncertain Parametric Environments
Figure 4 for Learning Provably Robust Policies in Uncertain Parametric Environments
Viaarxiv icon