Picture for Alexander Hägele

Alexander Hägele

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Add code
Jan 31, 2025
Figure 1 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Figure 2 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Figure 3 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Figure 4 for The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Viaarxiv icon

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Add code
May 29, 2024
Viaarxiv icon

BaCaDI: Bayesian Causal Discovery with Unknown Interventions

Add code
Jun 03, 2022
Figure 1 for BaCaDI: Bayesian Causal Discovery with Unknown Interventions
Figure 2 for BaCaDI: Bayesian Causal Discovery with Unknown Interventions
Figure 3 for BaCaDI: Bayesian Causal Discovery with Unknown Interventions
Figure 4 for BaCaDI: Bayesian Causal Discovery with Unknown Interventions
Viaarxiv icon