Picture for Keith Ross

Keith Ross

On the Limits of Layer Pruning for Generative Reasoning in LLMs

Add code
Feb 02, 2026
Viaarxiv icon

Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning

Add code
Jan 28, 2026
Viaarxiv icon

Is Optimal Transport Necessary for Inverse Reinforcement Learning?

Add code
Jun 07, 2025
Figure 1 for Is Optimal Transport Necessary for Inverse Reinforcement Learning?
Figure 2 for Is Optimal Transport Necessary for Inverse Reinforcement Learning?
Figure 3 for Is Optimal Transport Necessary for Inverse Reinforcement Learning?
Figure 4 for Is Optimal Transport Necessary for Inverse Reinforcement Learning?
Viaarxiv icon

Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning

Add code
May 20, 2025
Figure 1 for Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning
Figure 2 for Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning
Figure 3 for Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning
Figure 4 for Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning
Viaarxiv icon

Warm Up Before You Train: Unlocking General Reasoning in Resource-Constrained Settings

Add code
May 19, 2025
Viaarxiv icon

Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model

Add code
May 14, 2025
Viaarxiv icon

Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges

Add code
Feb 12, 2025
Viaarxiv icon

Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning

Add code
Oct 03, 2024
Viaarxiv icon

The Prevalence of Neural Collapse in Neural Multivariate Regression

Add code
Sep 06, 2024
Figure 1 for The Prevalence of Neural Collapse in Neural Multivariate Regression
Figure 2 for The Prevalence of Neural Collapse in Neural Multivariate Regression
Figure 3 for The Prevalence of Neural Collapse in Neural Multivariate Regression
Figure 4 for The Prevalence of Neural Collapse in Neural Multivariate Regression
Viaarxiv icon

Cross Entropy versus Label Smoothing: A Neural Collapse Perspective

Add code
Feb 07, 2024
Viaarxiv icon