Picture for Mark Schmidt

Mark Schmidt

SIERRA, LIENS

Don't Be So Positive: Negative Step Sizes in Second-Order Methods

Add code
Nov 18, 2024
Viaarxiv icon

BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks

Add code
Jun 25, 2024
Figure 1 for BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks
Figure 2 for BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks
Figure 3 for BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks
Figure 4 for BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks
Viaarxiv icon

Why Line Search when you can Plane Search? SO-Friendly Neural Networks allow Per-Iteration Optimization of Learning and Momentum Rates for Every Layer

Add code
Jun 25, 2024
Viaarxiv icon

Enhancing Policy Gradient with the Polyak Step-Size Adaption

Add code
Apr 11, 2024
Figure 1 for Enhancing Policy Gradient with the Polyak Step-Size Adaption
Figure 2 for Enhancing Policy Gradient with the Polyak Step-Size Adaption
Figure 3 for Enhancing Policy Gradient with the Polyak Step-Size Adaption
Figure 4 for Enhancing Policy Gradient with the Polyak Step-Size Adaption
Viaarxiv icon

Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation

Add code
Apr 03, 2024
Viaarxiv icon

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models

Add code
Feb 29, 2024
Viaarxiv icon

Analyzing and Improving Greedy 2-Coordinate Updates for Equality-Constrained Optimization via Steepest Descent in the 1-Norm

Add code
Jul 03, 2023
Viaarxiv icon

Don't be so Monotone: Relaxing Stochastic Line Search in Over-Parameterized Models

Add code
Jun 22, 2023
Viaarxiv icon

Searching for Optimal Per-Coordinate Step-sizes with Multidimensional Backtracking

Add code
Jun 05, 2023
Viaarxiv icon

BiSLS/SPS: Auto-tune Step Sizes for Stable Bi-level Optimization

Add code
May 30, 2023
Viaarxiv icon