Picture for Warren Xia

Warren Xia

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Add code
Feb 09, 2025
Viaarxiv icon