Picture for Johannes Scholl

Johannes Scholl

Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models

Add code
Jan 30, 2025
Viaarxiv icon