Honesty Is the Best Policy: Defining and Mitigating AI Deception

Add code
Dec 03, 2023
Figure 1 for Honesty Is the Best Policy: Defining and Mitigating AI Deception
Figure 2 for Honesty Is the Best Policy: Defining and Mitigating AI Deception
Figure 3 for Honesty Is the Best Policy: Defining and Mitigating AI Deception
Figure 4 for Honesty Is the Best Policy: Defining and Mitigating AI Deception

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: