Picture for Seong Hah Cho

Seong Hah Cho

Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering

Add code
Mar 17, 2025
Viaarxiv icon

Inducing Human-like Biases in Moral Reasoning Language Models

Add code
Nov 23, 2024
Figure 1 for Inducing Human-like Biases in Moral Reasoning Language Models
Figure 2 for Inducing Human-like Biases in Moral Reasoning Language Models
Figure 3 for Inducing Human-like Biases in Moral Reasoning Language Models
Figure 4 for Inducing Human-like Biases in Moral Reasoning Language Models
Viaarxiv icon