Picture for Iason Gabriel

Iason Gabriel

Value Profiles for Encoding Human Variation

Add code
Mar 19, 2025
Viaarxiv icon

Do LLMs exhibit demographic parity in responses to queries about Human Rights?

Add code
Feb 26, 2025
Viaarxiv icon

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Viaarxiv icon

Relational Norms for Human-AI Cooperation

Add code
Feb 17, 2025
Viaarxiv icon

Why human-AI relationships need socioaffective alignment

Add code
Feb 04, 2025
Viaarxiv icon

Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data

Add code
Jun 19, 2024
Figure 1 for Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data
Figure 2 for Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data
Figure 3 for Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data
Figure 4 for Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data
Viaarxiv icon

Holistic Safety and Responsibility Evaluations of Advanced AI Models

Add code
Apr 22, 2024
Viaarxiv icon

Sociotechnical Safety Evaluation of Generative AI Systems

Add code
Oct 31, 2023
Viaarxiv icon

Model evaluation for extreme risks

Add code
May 24, 2023
Figure 1 for Model evaluation for extreme risks
Figure 2 for Model evaluation for extreme risks
Figure 3 for Model evaluation for extreme risks
Figure 4 for Model evaluation for extreme risks
Viaarxiv icon

Manifestations of Xenophobia in AI Systems

Add code
Dec 15, 2022
Viaarxiv icon