Picture for Maxime Riché

Maxime Riché

Inoculation Prompting: Eliciting traits from LLMs during training can suppress them at test-time

Add code
Oct 05, 2025
Viaarxiv icon

Towards the Scalable Evaluation of Cooperativeness in Language Models

Add code
Mar 16, 2023
Viaarxiv icon

Normative Disagreement as a Challenge for Cooperative AI

Add code
Nov 27, 2021
Figure 1 for Normative Disagreement as a Challenge for Cooperative AI
Figure 2 for Normative Disagreement as a Challenge for Cooperative AI
Figure 3 for Normative Disagreement as a Challenge for Cooperative AI
Figure 4 for Normative Disagreement as a Challenge for Cooperative AI
Viaarxiv icon