Picture for Kamil Ciebiera

Kamil Ciebiera

Scaling Laws for Fine-Grained Mixture of Experts

Add code
Feb 12, 2024
Figure 1 for Scaling Laws for Fine-Grained Mixture of Experts
Figure 2 for Scaling Laws for Fine-Grained Mixture of Experts
Figure 3 for Scaling Laws for Fine-Grained Mixture of Experts
Figure 4 for Scaling Laws for Fine-Grained Mixture of Experts
Viaarxiv icon

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Add code
Jan 08, 2024
Figure 1 for MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Figure 2 for MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Figure 3 for MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Figure 4 for MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Viaarxiv icon