Picture for Junjie Sheng

Junjie Sheng

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

Add code
Dec 06, 2023
Viaarxiv icon

Negotiated Reasoning: On Provably Addressing Relative Over-Generalization

Add code
Jun 08, 2023
Viaarxiv icon

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

Add code
Nov 29, 2022
Viaarxiv icon

Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning

Add code
Nov 21, 2022
Figure 1 for Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning
Figure 2 for Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning
Figure 3 for Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning
Figure 4 for Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning
Viaarxiv icon

Obtaining Dyadic Fairness by Optimal Transport

Add code
Feb 09, 2022
Figure 1 for Obtaining Dyadic Fairness by Optimal Transport
Figure 2 for Obtaining Dyadic Fairness by Optimal Transport
Figure 3 for Obtaining Dyadic Fairness by Optimal Transport
Figure 4 for Obtaining Dyadic Fairness by Optimal Transport
Viaarxiv icon

VMAgent: Scheduling Simulator for Reinforcement Learning

Add code
Dec 09, 2021
Figure 1 for VMAgent: Scheduling Simulator for Reinforcement Learning
Figure 2 for VMAgent: Scheduling Simulator for Reinforcement Learning
Viaarxiv icon

Dealing with Non-Stationarity in Multi-Agent Reinforcement Learning via Trust Region Decomposition

Add code
Feb 21, 2021
Figure 1 for Dealing with Non-Stationarity in Multi-Agent Reinforcement Learning via Trust Region Decomposition
Figure 2 for Dealing with Non-Stationarity in Multi-Agent Reinforcement Learning via Trust Region Decomposition
Figure 3 for Dealing with Non-Stationarity in Multi-Agent Reinforcement Learning via Trust Region Decomposition
Figure 4 for Dealing with Non-Stationarity in Multi-Agent Reinforcement Learning via Trust Region Decomposition
Viaarxiv icon

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

Add code
Feb 09, 2021
Figure 1 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Figure 2 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Figure 3 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Figure 4 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Viaarxiv icon

Learning Structured Communication for Multi-agent Reinforcement Learning

Add code
Feb 11, 2020
Figure 1 for Learning Structured Communication for Multi-agent Reinforcement Learning
Figure 2 for Learning Structured Communication for Multi-agent Reinforcement Learning
Figure 3 for Learning Structured Communication for Multi-agent Reinforcement Learning
Figure 4 for Learning Structured Communication for Multi-agent Reinforcement Learning
Viaarxiv icon