Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pedro A Ortega

Agent Incentives: A Causal Perspective

Feb 02, 2021

Tom Everitt, Ryan Carey, Eric Langlois, Pedro A Ortega, Shane Legg

Figure 1 for Agent Incentives: A Causal Perspective

Figure 2 for Agent Incentives: A Causal Perspective

Figure 3 for Agent Incentives: A Causal Perspective

Figure 4 for Agent Incentives: A Causal Perspective

Abstract:We present a framework for analysing agent incentives using causal influence diagrams. We establish that a well-known criterion for value of information is complete. We propose a new graphical criterion for value of control, establishing its soundness and completeness. We also introduce two new concepts for incentive analysis: response incentives indicate which changes in the environment affect an optimal decision, while instrumental control incentives establish whether an agent can influence its utility via a variable X. For both new concepts, we provide sound and complete graphical criteria. We show by example how these results can help with evaluating the safety and fairness of an AI system.

* In Proceedings of the AAAI 2021 Conference. Supersedes arXiv:1902.09980, arXiv:2001.07118

Via

Access Paper or Ask Questions

Metabolic cost as an organizing principle for cooperative learning

Feb 09, 2013

David Balduzzi, Pedro A Ortega, Michel Besserve

Figure 1 for Metabolic cost as an organizing principle for cooperative learning

Figure 2 for Metabolic cost as an organizing principle for cooperative learning

Abstract:This paper investigates how neurons can use metabolic cost to facilitate learning at a population level. Although decision-making by individual neurons has been extensively studied, questions regarding how neurons should behave to cooperate effectively remain largely unaddressed. Under assumptions that capture a few basic features of cortical neurons, we show that constraining reward maximization by metabolic cost aligns the information content of actions with their expected reward. Thus, metabolic cost provides a mechanism whereby neurons encode expected reward into their outputs. Further, aside from reducing energy expenditures, imposing a tight metabolic constraint also increases the accuracy of empirical estimates of rewards, increasing the robustness of distributed learning. Finally, we present two implementations of metabolically constrained learning that confirm our theoretical finding. These results suggest that metabolic cost may be an organizing principle underlying the neural code, and may also provide a useful guide to the design and analysis of other cooperating populations.

* 14 pages, 2 figures, to appear in Advances in Complex Systems

Via

Access Paper or Ask Questions