Picture for Qizhen Zhang

Qizhen Zhang

Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts

Add code
Aug 28, 2024
Viaarxiv icon

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Add code
Aug 15, 2024
Figure 1 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 2 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 3 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 4 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Viaarxiv icon

PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition

Add code
May 14, 2024
Viaarxiv icon

Analysing the Sample Complexity of Opponent Shaping

Add code
Feb 08, 2024
Viaarxiv icon

Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native

Add code
Jan 17, 2024
Viaarxiv icon

Centralized Model and Exploration Policy for Multi-Agent RL

Add code
Jul 14, 2021
Figure 1 for Centralized Model and Exploration Policy for Multi-Agent RL
Figure 2 for Centralized Model and Exploration Policy for Multi-Agent RL
Figure 3 for Centralized Model and Exploration Policy for Multi-Agent RL
Figure 4 for Centralized Model and Exploration Policy for Multi-Agent RL
Viaarxiv icon