Picture for Shuche Wang

Shuche Wang

Muon Learns More Robust and Transferable Features than Adam

Add code
Jun 08, 2026
Viaarxiv icon

Why Muon Outperforms Adam: A Curvature Perspective

Add code
Jun 03, 2026
Viaarxiv icon

Online Learning with Gradient-Variation Interval Regret

Add code
Jun 02, 2026
Viaarxiv icon

Bandit Convex Optimization with Gradient Prediction Adaptivity

Add code
May 21, 2026
Viaarxiv icon

Parameter-free Algorithms for the Stochastically Extended Adversarial Model

Add code
Oct 06, 2025
Figure 1 for Parameter-free Algorithms for the Stochastically Extended Adversarial Model
Figure 2 for Parameter-free Algorithms for the Stochastically Extended Adversarial Model
Viaarxiv icon

A Mirror Descent-Based Algorithm for Corruption-Tolerant Distributed Gradient Descent

Add code
Jul 19, 2024
Viaarxiv icon