Picture for Amrith Setlur

Amrith Setlur

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Add code
Mar 10, 2025
Viaarxiv icon

Scaling Test-Time Compute Without Verification or RL is Suboptimal

Add code
Feb 18, 2025
Viaarxiv icon

What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?

Add code
Nov 12, 2024
Figure 1 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 2 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 3 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 4 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Viaarxiv icon

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Add code
Oct 10, 2024
Figure 1 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 2 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 3 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 4 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Viaarxiv icon

RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

Add code
Jun 20, 2024
Viaarxiv icon

Leveraging Public Representations for Private Transfer Learning

Add code
Jan 16, 2024
Viaarxiv icon

Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift

Add code
Dec 06, 2023
Viaarxiv icon

Multitask Learning Can Improve Worst-Group Outcomes

Add code
Dec 05, 2023
Viaarxiv icon

Deep Neural Networks Tend To Extrapolate Predictably

Add code
Oct 02, 2023
Figure 1 for Deep Neural Networks Tend To Extrapolate Predictably
Figure 2 for Deep Neural Networks Tend To Extrapolate Predictably
Figure 3 for Deep Neural Networks Tend To Extrapolate Predictably
Figure 4 for Deep Neural Networks Tend To Extrapolate Predictably
Viaarxiv icon

Contextual Reliability: When Different Features Matter in Different Contexts

Add code
Jul 19, 2023
Viaarxiv icon