Picture for Pedram Akbarian

Pedram Akbarian

Understanding Expert Structures on Minimax Parameter Estimation in Contaminated Mixture of Experts

Add code
Oct 16, 2024
Viaarxiv icon

Quadratic Gating Functions in Mixture of Experts: A Statistical Insight

Add code
Oct 15, 2024
Viaarxiv icon

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts

Add code
May 23, 2024
Viaarxiv icon

Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?

Add code
Jan 25, 2024
Viaarxiv icon

A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts

Add code
Oct 22, 2023
Viaarxiv icon

Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts

Add code
Sep 25, 2023
Viaarxiv icon