Picture for Tan M. Nguyen

Tan M. Nguyen

MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts

Add code
Oct 18, 2024
Viaarxiv icon

Tree-Sliced Wasserstein Distance on a System of Lines

Add code
Jun 19, 2024
Viaarxiv icon

A Primal-Dual Framework for Transformers and Neural Networks

Add code
Jun 19, 2024
Viaarxiv icon

Elliptical Attention

Add code
Jun 19, 2024
Viaarxiv icon

Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis

Add code
Jun 19, 2024
Viaarxiv icon

PIDformer: Transformer Meets Control Theory

Add code
Feb 25, 2024
Viaarxiv icon

Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals

Add code
Dec 01, 2023
Viaarxiv icon

p-Laplacian Transformer

Add code
Nov 06, 2023
Viaarxiv icon

From Coupled Oscillators to Graph Neural Networks: Reducing Over-smoothing via a Kuramoto Model-based Approach

Add code
Nov 06, 2023
Viaarxiv icon

ARIST: An Effective API Argument Recommendation Approach

Add code
Jun 11, 2023
Viaarxiv icon