Picture for Kim-Chuan Toh

Kim-Chuan Toh

Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning

Add code
Oct 15, 2024
Figure 1 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 2 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 3 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 4 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Viaarxiv icon

Optimization Hyper-parameter Laws for Large Language Models

Add code
Sep 07, 2024
Viaarxiv icon

LoCo: Low-Bit Communication Adaptor for Large-scale Model Training

Add code
Jul 05, 2024
Viaarxiv icon

Vertex Exchange Method for a Class of Quadratic Programming Problems

Add code
Jul 03, 2024
Viaarxiv icon

Developing Lagrangian-based Methods for Nonsmooth Nonconvex Optimization

Add code
Apr 15, 2024
Viaarxiv icon

An Inexact Halpern Iteration with Application to Distributionally Robust Optimization

Add code
Feb 12, 2024
Viaarxiv icon

On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods

Add code
Dec 22, 2023
Figure 1 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Figure 2 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Figure 3 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Figure 4 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Viaarxiv icon

Adam-family Methods with Decoupled Weight Decay in Deep Learning

Add code
Oct 13, 2023
Viaarxiv icon

Convergence Guarantees for Stochastic Subgradient Methods in Nonsmooth Nonconvex Optimization

Add code
Jul 19, 2023
Viaarxiv icon

Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning

Add code
Jun 29, 2023
Figure 1 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 2 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 3 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 4 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Viaarxiv icon