Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chris van Merwijk

A Complete Criterion for Value of Information in Soluble Influence Diagrams

Feb 23, 2022

Chris van Merwijk, Ryan Carey, Tom Everitt

Figure 1 for A Complete Criterion for Value of Information in Soluble Influence Diagrams

Figure 2 for A Complete Criterion for Value of Information in Soluble Influence Diagrams

Figure 3 for A Complete Criterion for Value of Information in Soluble Influence Diagrams

Figure 4 for A Complete Criterion for Value of Information in Soluble Influence Diagrams

Abstract:Influence diagrams have recently been used to analyse the safety and fairness properties of AI systems. A key building block for this analysis is a graphical criterion for value of information (VoI). This paper establishes the first complete graphical criterion for VoI in influence diagrams with multiple decisions. Along the way, we establish two important techniques for proving properties of multi-decision influence diagrams: ID homomorphisms are structure-preserving transformations of influence diagrams, while a Tree of Systems is collection of paths that captures how information and control can flow in an influence diagram.

* In Proceedings of the AAAI 2022 Conference

Via

Access Paper or Ask Questions

Risks from Learned Optimization in Advanced Machine Learning Systems

Jun 11, 2019

Evan Hubinger, Chris van Merwijk, Vladimir Mikulik, Joar Skalse, Scott Garrabrant

Figure 1 for Risks from Learned Optimization in Advanced Machine Learning Systems

Figure 2 for Risks from Learned Optimization in Advanced Machine Learning Systems

Figure 3 for Risks from Learned Optimization in Advanced Machine Learning Systems

Abstract:We analyze the type of learned optimization that occurs when a learned model (such as a neural network) is itself an optimizer - a situation we refer to as mesa-optimization, a neologism we introduce in this paper. We believe that the possibility of mesa-optimization raises two important questions for the safety and transparency of advanced machine learning systems. First, under what circumstances will learned models be optimizers, including when they should not be? Second, when a learned model is an optimizer, what will its objective be - how will it differ from the loss function it was trained under - and how can it be aligned? In this paper, we provide an in-depth analysis of these two primary questions and provide an overview of topics for future research.

Via

Access Paper or Ask Questions