Picture for Anca D. Dragan

Anca D. Dragan

Quantifying Assistive Robustness Via the Natural-Adversarial Frontier

Add code
Oct 16, 2023
Viaarxiv icon

Confronting Reward Model Overoptimization with Constrained RLHF

Add code
Oct 10, 2023
Figure 1 for Confronting Reward Model Overoptimization with Constrained RLHF
Figure 2 for Confronting Reward Model Overoptimization with Constrained RLHF
Figure 3 for Confronting Reward Model Overoptimization with Constrained RLHF
Figure 4 for Confronting Reward Model Overoptimization with Constrained RLHF
Viaarxiv icon

Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

Add code
Sep 07, 2023
Viaarxiv icon

Contextual Reliability: When Different Features Matter in Different Contexts

Add code
Jul 19, 2023
Viaarxiv icon

Aligning Robot and Human Representations

Add code
Feb 03, 2023
Viaarxiv icon

SIRL: Similarity-based Implicit Representation Learning

Add code
Jan 03, 2023
Viaarxiv icon

Benchmarks and Algorithms for Offline Preference-Based Reward Learning

Add code
Jan 03, 2023
Viaarxiv icon

Learning Representations that Enable Generalization in Assistive Tasks

Add code
Dec 05, 2022
Viaarxiv icon

The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types

Add code
Aug 23, 2022
Figure 1 for The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types
Figure 2 for The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types
Figure 3 for The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types
Figure 4 for The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types
Viaarxiv icon

First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization

Add code
May 24, 2022
Figure 1 for First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
Figure 2 for First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
Figure 3 for First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
Figure 4 for First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
Viaarxiv icon