Picture for Dale Schuurmans

Dale Schuurmans

University of Alberta

Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates

Add code
Feb 11, 2025
Viaarxiv icon

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Add code
Jan 28, 2025
Figure 1 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 2 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 3 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 4 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Viaarxiv icon

Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback

Add code
Dec 03, 2024
Viaarxiv icon

Toward Understanding In-context vs. In-weight Learning

Add code
Oct 30, 2024
Viaarxiv icon

Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment

Add code
Oct 28, 2024
Viaarxiv icon

Plastic Learning with Deep Fourier Features

Add code
Oct 27, 2024
Viaarxiv icon

Autoregressive Large Language Models are Computationally Universal

Add code
Oct 04, 2024
Viaarxiv icon

Generative Hierarchical Materials Search

Add code
Sep 10, 2024
Figure 1 for Generative Hierarchical Materials Search
Figure 2 for Generative Hierarchical Materials Search
Figure 3 for Generative Hierarchical Materials Search
Figure 4 for Generative Hierarchical Materials Search
Viaarxiv icon

Exploring and Benchmarking the Planning Capabilities of Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

Learning Continually by Spectral Regularization

Add code
Jun 10, 2024
Figure 1 for Learning Continually by Spectral Regularization
Figure 2 for Learning Continually by Spectral Regularization
Figure 3 for Learning Continually by Spectral Regularization
Figure 4 for Learning Continually by Spectral Regularization
Viaarxiv icon