Picture for Rohan Subramani

Rohan Subramani

Higher-Order Belief in Incomplete Information MAIDs

Add code
Mar 08, 2025
Viaarxiv icon

Will an AI with Private Information Allow Itself to Be Switched Off?

Add code
Nov 25, 2024
Figure 1 for Will an AI with Private Information Allow Itself to Be Switched Off?
Figure 2 for Will an AI with Private Information Allow Itself to Be Switched Off?
Figure 3 for Will an AI with Private Information Allow Itself to Be Switched Off?
Figure 4 for Will an AI with Private Information Allow Itself to Be Switched Off?
Viaarxiv icon

Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains

Add code
Nov 19, 2023
Viaarxiv icon

On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning

Add code
Oct 18, 2023
Figure 1 for On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning
Figure 2 for On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning
Figure 3 for On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning
Figure 4 for On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning
Viaarxiv icon