Picture for Huy Nguyen

Huy Nguyen

Rethinking Multinomial Logistic Mixture of Experts with Sigmoid Gating Function

Add code
Feb 01, 2026
Viaarxiv icon

A Statistical Theory of Gated Attention through the Lens of Hierarchical Mixture of Experts

Add code
Feb 01, 2026
Viaarxiv icon

Improving Minimax Estimation Rates for Contaminated Mixture of Multinomial Logistic Experts via Expert Heterogeneity

Add code
Jan 31, 2026
Viaarxiv icon

Cite-While-You-Generate: Training-Free Evidence Attribution for Multimodal Clinical Summarization

Add code
Jan 23, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks

Add code
Oct 05, 2025
Figure 1 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Figure 2 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Figure 3 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Figure 4 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Viaarxiv icon

HoRA: Cross-Head Low-Rank Adaptation with Joint Hypernetworks

Add code
Oct 05, 2025
Viaarxiv icon

AG-VPReID.VIR: Bridging Aerial and Ground Platforms for Video-based Visible-Infrared Person Re-ID

Add code
Jul 24, 2025
Viaarxiv icon

On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts

Add code
May 24, 2025
Figure 1 for On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts
Figure 2 for On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts
Figure 3 for On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts
Viaarxiv icon

CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition

Add code
May 19, 2025
Figure 1 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Figure 2 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Figure 3 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Figure 4 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Viaarxiv icon