Picture for Kim-Chuan Toh

Kim-Chuan Toh

Slow-Fast Inference: Training-Free Inference Acceleration via Within-Sentence Support Stability

Add code
Mar 12, 2026
Viaarxiv icon

Towards Understanding Why Data Augmentation Improves Generalization

Add code
Feb 13, 2025
Figure 1 for Towards Understanding Why Data Augmentation Improves Generalization
Figure 2 for Towards Understanding Why Data Augmentation Improves Generalization
Figure 3 for Towards Understanding Why Data Augmentation Improves Generalization
Figure 4 for Towards Understanding Why Data Augmentation Improves Generalization
Viaarxiv icon

Memory-Efficient 4-bit Preconditioned Stochastic Optimization

Add code
Dec 14, 2024
Viaarxiv icon

Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning

Add code
Oct 15, 2024
Figure 1 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 2 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 3 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 4 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Viaarxiv icon

Optimization Hyper-parameter Laws for Large Language Models

Add code
Sep 07, 2024
Figure 1 for Optimization Hyper-parameter Laws for Large Language Models
Figure 2 for Optimization Hyper-parameter Laws for Large Language Models
Figure 3 for Optimization Hyper-parameter Laws for Large Language Models
Figure 4 for Optimization Hyper-parameter Laws for Large Language Models
Viaarxiv icon

LoCo: Low-Bit Communication Adaptor for Large-scale Model Training

Add code
Jul 05, 2024
Viaarxiv icon

Vertex Exchange Method for a Class of Quadratic Programming Problems

Add code
Jul 03, 2024
Viaarxiv icon

Developing Lagrangian-based Methods for Nonsmooth Nonconvex Optimization

Add code
Apr 15, 2024
Viaarxiv icon

An Inexact Halpern Iteration with Application to Distributionally Robust Optimization

Add code
Feb 12, 2024
Viaarxiv icon

On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods

Add code
Dec 22, 2023
Figure 1 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Figure 2 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Figure 3 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Figure 4 for On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods
Viaarxiv icon