Picture for Jan Ludziejewski

Jan Ludziejewski

Scaling Laws for Fine-Grained Mixture of Experts

Add code
Feb 12, 2024
Viaarxiv icon

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Add code
Jan 08, 2024
Viaarxiv icon

Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation

Add code
Oct 24, 2023
Viaarxiv icon