Picture for Wei Qiu

Wei Qiu

Prompt Optimization Is a Coin Flip: Diagnosing When It Helps in Compound AI Systems

Add code
Apr 16, 2026
Viaarxiv icon

Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents

Add code
Apr 13, 2026
Viaarxiv icon

Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning

Add code
Feb 07, 2023
Figure 1 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Figure 2 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Figure 3 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Figure 4 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Viaarxiv icon

Learning to Maximize Mutual Information for Dynamic Feature Selection

Add code
Jan 02, 2023
Figure 1 for Learning to Maximize Mutual Information for Dynamic Feature Selection
Figure 2 for Learning to Maximize Mutual Information for Dynamic Feature Selection
Figure 3 for Learning to Maximize Mutual Information for Dynamic Feature Selection
Figure 4 for Learning to Maximize Mutual Information for Dynamic Feature Selection
Viaarxiv icon

CELLS: A Parallel Corpus for Biomedical Lay Language Generation

Add code
Nov 07, 2022
Figure 1 for CELLS: A Parallel Corpus for Biomedical Lay Language Generation
Figure 2 for CELLS: A Parallel Corpus for Biomedical Lay Language Generation
Figure 3 for CELLS: A Parallel Corpus for Biomedical Lay Language Generation
Figure 4 for CELLS: A Parallel Corpus for Biomedical Lay Language Generation
Viaarxiv icon

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning

Add code
Oct 18, 2022
Figure 1 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 2 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 3 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 4 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Viaarxiv icon

Off-Beat Multi-Agent Reinforcement Learning

Add code
May 27, 2022
Figure 1 for Off-Beat Multi-Agent Reinforcement Learning
Figure 2 for Off-Beat Multi-Agent Reinforcement Learning
Figure 3 for Off-Beat Multi-Agent Reinforcement Learning
Figure 4 for Off-Beat Multi-Agent Reinforcement Learning
Viaarxiv icon

Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning

Add code
Aug 09, 2021
Figure 1 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Figure 2 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Figure 3 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Figure 4 for Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning
Viaarxiv icon

Contingency-Aware Influence Maximization: A Reinforcement Learning Approach

Add code
Jun 13, 2021
Figure 1 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Figure 2 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Figure 3 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Figure 4 for Contingency-Aware Influence Maximization: A Reinforcement Learning Approach
Viaarxiv icon

RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents

Add code
Feb 17, 2021
Figure 1 for RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
Figure 2 for RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
Figure 3 for RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
Figure 4 for RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
Viaarxiv icon