Picture for Markus Kunesch

Markus Kunesch

Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity

Add code
May 29, 2023
Figure 1 for Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity
Figure 2 for Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity
Figure 3 for Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity
Viaarxiv icon

Beyond Bayes-optimality: meta-learning what you know you don't know

Add code
Oct 12, 2022
Figure 1 for Beyond Bayes-optimality: meta-learning what you know you don't know
Figure 2 for Beyond Bayes-optimality: meta-learning what you know you don't know
Figure 3 for Beyond Bayes-optimality: meta-learning what you know you don't know
Figure 4 for Beyond Bayes-optimality: meta-learning what you know you don't know
Viaarxiv icon

Your Policy Regularizer is Secretly an Adversary

Add code
Apr 01, 2022
Figure 1 for Your Policy Regularizer is Secretly an Adversary
Figure 2 for Your Policy Regularizer is Secretly an Adversary
Figure 3 for Your Policy Regularizer is Secretly an Adversary
Figure 4 for Your Policy Regularizer is Secretly an Adversary
Viaarxiv icon

Model-Free Risk-Sensitive Reinforcement Learning

Add code
Nov 04, 2021
Figure 1 for Model-Free Risk-Sensitive Reinforcement Learning
Figure 2 for Model-Free Risk-Sensitive Reinforcement Learning
Figure 3 for Model-Free Risk-Sensitive Reinforcement Learning
Figure 4 for Model-Free Risk-Sensitive Reinforcement Learning
Viaarxiv icon

Shaking the foundations: delusions in sequence models for interaction and control

Add code
Oct 20, 2021
Figure 1 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 2 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 3 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 4 for Shaking the foundations: delusions in sequence models for interaction and control
Viaarxiv icon

Causal Analysis of Agent Behavior for AI Safety

Add code
Mar 05, 2021
Figure 1 for Causal Analysis of Agent Behavior for AI Safety
Figure 2 for Causal Analysis of Agent Behavior for AI Safety
Figure 3 for Causal Analysis of Agent Behavior for AI Safety
Figure 4 for Causal Analysis of Agent Behavior for AI Safety
Viaarxiv icon

Human-interpretable model explainability on high-dimensional data

Add code
Oct 14, 2020
Figure 1 for Human-interpretable model explainability on high-dimensional data
Figure 2 for Human-interpretable model explainability on high-dimensional data
Figure 3 for Human-interpretable model explainability on high-dimensional data
Figure 4 for Human-interpretable model explainability on high-dimensional data
Viaarxiv icon