Picture for Sydney Levine

Sydney Levine

Imagining and building wise machines: The centrality of AI metacognition

Add code
Nov 04, 2024
Figure 1 for Imagining and building wise machines: The centrality of AI metacognition
Figure 2 for Imagining and building wise machines: The centrality of AI metacognition
Figure 3 for Imagining and building wise machines: The centrality of AI metacognition
Viaarxiv icon

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation

Add code
Oct 22, 2024
Figure 1 for SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation
Figure 2 for SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation
Figure 3 for SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation
Figure 4 for SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation
Viaarxiv icon

Intuitions of Compromise: Utilitarianism vs. Contractualism

Add code
Oct 07, 2024
Viaarxiv icon

Can Language Models Reason about Individualistic Human Values and Preferences?

Add code
Oct 04, 2024
Viaarxiv icon

Multilingual Trolley Problems for Language Models

Add code
Jul 02, 2024
Figure 1 for Multilingual Trolley Problems for Language Models
Figure 2 for Multilingual Trolley Problems for Language Models
Figure 3 for Multilingual Trolley Problems for Language Models
Figure 4 for Multilingual Trolley Problems for Language Models
Viaarxiv icon

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties

Add code
Sep 02, 2023
Viaarxiv icon

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

Add code
Oct 04, 2022
Figure 1 for When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
Figure 2 for When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
Figure 3 for When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
Figure 4 for When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
Viaarxiv icon

When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgement Based on Empirical Data

Add code
Jan 19, 2022
Viaarxiv icon

Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation

Add code
Mar 21, 2018
Figure 1 for Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation
Figure 2 for Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation
Figure 3 for Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation
Figure 4 for Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation
Viaarxiv icon