Picture for Shuyue Hu

Shuyue Hu

ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning

Add code
Mar 12, 2025
Viaarxiv icon

Nature-Inspired Population-Based Evolution of Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

If Multi-Agent Debate is the Answer, What is the Question?

Add code
Feb 12, 2025
Viaarxiv icon

EvoFlow: Evolving Diverse Agentic Workflows On The Fly

Add code
Feb 11, 2025
Figure 1 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Figure 2 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Figure 3 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Figure 4 for EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Viaarxiv icon

Understanding When and Why Graph Attention Mechanisms Work via Node Classification

Add code
Dec 20, 2024
Viaarxiv icon

OASIS: Open Agent Social Interaction Simulations with One Million Agents

Add code
Nov 26, 2024
Figure 1 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 2 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 3 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 4 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Viaarxiv icon

OASIS: Open Agents Social Interaction Simulations on One Million Agents

Add code
Nov 21, 2024
Figure 1 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 2 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 3 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 4 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Viaarxiv icon

Configurable Mirror Descent: Towards a Unification of Decision Making

Add code
May 20, 2024
Figure 1 for Configurable Mirror Descent: Towards a Unification of Decision Making
Figure 2 for Configurable Mirror Descent: Towards a Unification of Decision Making
Figure 3 for Configurable Mirror Descent: Towards a Unification of Decision Making
Figure 4 for Configurable Mirror Descent: Towards a Unification of Decision Making
Viaarxiv icon

Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

Add code
Apr 30, 2024
Figure 1 for Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Figure 2 for Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Figure 3 for Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Viaarxiv icon

Emergence of Social Norms in Large Language Model-based Agent Societies

Add code
Mar 13, 2024
Viaarxiv icon