Picture for Huy Nguyen

Huy Nguyen

Understanding Expert Structures on Minimax Parameter Estimation in Contaminated Mixture of Experts

Add code
Oct 16, 2024
Viaarxiv icon

Quadratic Gating Functions in Mixture of Experts: A Statistical Insight

Add code
Oct 15, 2024
Viaarxiv icon

On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions

Add code
Oct 03, 2024
Viaarxiv icon

Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts

Add code
Oct 03, 2024
Viaarxiv icon

Deep-Wide Learning Assistance for Insect Pest Classification

Add code
Sep 16, 2024
Viaarxiv icon

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts

Add code
May 23, 2024
Figure 1 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 2 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 3 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 4 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Viaarxiv icon

Mixture of Experts Meets Prompt-Based Continual Learning

Add code
May 23, 2024
Viaarxiv icon

Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts

Add code
May 22, 2024
Viaarxiv icon

On Parameter Estimation in Deviated Gaussian Mixture of Experts

Add code
Feb 07, 2024
Viaarxiv icon

FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion

Add code
Feb 05, 2024
Viaarxiv icon