Picture for Zifan Wu

Zifan Wu

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

Off-Policy Primal-Dual Safe Reinforcement Learning

Add code
Jan 26, 2024
Viaarxiv icon

Policy-regularized Offline Multi-objective Reinforcement Learning

Add code
Jan 04, 2024
Viaarxiv icon

Safe Offline Reinforcement Learning with Real-Time Budget Constraints

Add code
Jun 01, 2023
Viaarxiv icon

Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning

Add code
Jan 20, 2023
Viaarxiv icon

Coordinated Proximal Policy Optimization

Add code
Nov 07, 2021
Figure 1 for Coordinated Proximal Policy Optimization
Figure 2 for Coordinated Proximal Policy Optimization
Figure 3 for Coordinated Proximal Policy Optimization
Figure 4 for Coordinated Proximal Policy Optimization
Viaarxiv icon