Picture for Carlos Riquelme Ruiz

Carlos Riquelme Ruiz

PaLI-X: On Scaling up a Multilingual Vision and Language Model

Add code
May 29, 2023
Viaarxiv icon

Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints

Add code
Dec 09, 2022
Figure 1 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Figure 2 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Figure 3 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Figure 4 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Viaarxiv icon

Sparse MoEs meet Efficient Ensembles

Add code
Oct 07, 2021
Figure 1 for Sparse MoEs meet Efficient Ensembles
Figure 2 for Sparse MoEs meet Efficient Ensembles
Figure 3 for Sparse MoEs meet Efficient Ensembles
Figure 4 for Sparse MoEs meet Efficient Ensembles
Viaarxiv icon