Picture for Kyle Fish

Kyle Fish

The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Add code
Jan 15, 2026
Viaarxiv icon

The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models

Add code
Sep 05, 2025
Figure 1 for The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models
Figure 2 for The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models
Figure 3 for The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models
Figure 4 for The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models
Viaarxiv icon

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

Add code
May 20, 2025
Viaarxiv icon

Taking AI Welfare Seriously

Add code
Nov 04, 2024
Viaarxiv icon