Picture for Tan M. Nguyen

Tan M. Nguyen

Distance-Based Tree-Sliced Wasserstein Distance

Add code
Mar 14, 2025
Viaarxiv icon

Spherical Tree-Sliced Wasserstein Distance

Add code
Mar 14, 2025
Viaarxiv icon

MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling

Add code
Mar 14, 2025
Viaarxiv icon

CAMEx: Curvature-aware Merging of Experts

Add code
Feb 26, 2025
Viaarxiv icon

Tight Clusters Make Specialized Experts

Add code
Feb 21, 2025
Viaarxiv icon

An Attention-based Framework for Fair Contrastive Learning

Add code
Nov 22, 2024
Viaarxiv icon

MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts

Add code
Oct 18, 2024
Viaarxiv icon

Tree-Sliced Wasserstein Distance on a System of Lines

Add code
Jun 19, 2024
Figure 1 for Tree-Sliced Wasserstein Distance on a System of Lines
Figure 2 for Tree-Sliced Wasserstein Distance on a System of Lines
Figure 3 for Tree-Sliced Wasserstein Distance on a System of Lines
Figure 4 for Tree-Sliced Wasserstein Distance on a System of Lines
Viaarxiv icon

A Primal-Dual Framework for Transformers and Neural Networks

Add code
Jun 19, 2024
Viaarxiv icon

Elliptical Attention

Add code
Jun 19, 2024
Viaarxiv icon