Picture for Huy Nguyen

Huy Nguyen

On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation

Add code
Feb 05, 2025
Figure 1 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 2 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 3 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 4 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Viaarxiv icon

RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts

Add code
Feb 05, 2025
Viaarxiv icon

Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning

Add code
Jan 31, 2025
Figure 1 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Figure 2 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Figure 3 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Figure 4 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Viaarxiv icon

RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval

Add code
Jan 27, 2025
Figure 1 for RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval
Figure 2 for RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval
Figure 3 for RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval
Figure 4 for RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval
Viaarxiv icon

Understanding Expert Structures on Minimax Parameter Estimation in Contaminated Mixture of Experts

Add code
Oct 16, 2024
Viaarxiv icon

Quadratic Gating Functions in Mixture of Experts: A Statistical Insight

Add code
Oct 15, 2024
Figure 1 for Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Figure 2 for Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Figure 3 for Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Figure 4 for Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Viaarxiv icon

Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts

Add code
Oct 03, 2024
Viaarxiv icon

On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions

Add code
Oct 03, 2024
Figure 1 for On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions
Figure 2 for On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions
Figure 3 for On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions
Figure 4 for On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions
Viaarxiv icon

Deep-Wide Learning Assistance for Insect Pest Classification

Add code
Sep 16, 2024
Viaarxiv icon

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts

Add code
May 23, 2024
Figure 1 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 2 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 3 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 4 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Viaarxiv icon