Picture for Ryan A. Chi

Ryan A. Chi

Persona Features Control Emergent Misalignment

Add code
Jun 24, 2025
Figure 1 for Persona Features Control Emergent Misalignment
Figure 2 for Persona Features Control Emergent Misalignment
Figure 3 for Persona Features Control Emergent Misalignment
Figure 4 for Persona Features Control Emergent Misalignment
Viaarxiv icon

modeLing: A Novel Dataset for Testing Linguistic Reasoning in Language Models

Add code
Jun 24, 2024
Viaarxiv icon

Premise Order Matters in Reasoning with Large Language Models

Add code
Feb 14, 2024
Figure 1 for Premise Order Matters in Reasoning with Large Language Models
Figure 2 for Premise Order Matters in Reasoning with Large Language Models
Figure 3 for Premise Order Matters in Reasoning with Large Language Models
Figure 4 for Premise Order Matters in Reasoning with Large Language Models
Viaarxiv icon