Picture for Amrith Setlur

Amrith Setlur

What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?

Add code
Nov 12, 2024
Viaarxiv icon

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Add code
Oct 10, 2024
Figure 1 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 2 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 3 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 4 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Viaarxiv icon

RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

Add code
Jun 20, 2024
Viaarxiv icon

Leveraging Public Representations for Private Transfer Learning

Add code
Jan 16, 2024
Viaarxiv icon

Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift

Add code
Dec 06, 2023
Viaarxiv icon

Multitask Learning Can Improve Worst-Group Outcomes

Add code
Dec 05, 2023
Viaarxiv icon

Deep Neural Networks Tend To Extrapolate Predictably

Add code
Oct 02, 2023
Viaarxiv icon

Contextual Reliability: When Different Features Matter in Different Contexts

Add code
Jul 19, 2023
Viaarxiv icon

Confidence-Based Model Selection: When to Take Shortcuts for Subpopulation Shifts

Add code
Jun 19, 2023
Viaarxiv icon

Project and Probe: Sample-Efficient Domain Adaptation by Interpolating Orthogonal Features

Add code
Feb 10, 2023
Viaarxiv icon