Picture for Marc Carauleanu

Marc Carauleanu

Towards Safe and Honest AI Agents with Neural Self-Other Overlap

Add code
Dec 20, 2024
Viaarxiv icon