Picture for Micah Rentschler

Micah Rentschler

Reinforcement Learning from Meta-Evaluation: Aligning Language Models Without Ground-Truth Labels

Add code
Jan 29, 2026
Viaarxiv icon

RL + Transformer = A General-Purpose Problem Solver

Add code
Jan 24, 2025
Figure 1 for RL + Transformer = A General-Purpose Problem Solver
Figure 2 for RL + Transformer = A General-Purpose Problem Solver
Figure 3 for RL + Transformer = A General-Purpose Problem Solver
Figure 4 for RL + Transformer = A General-Purpose Problem Solver
Viaarxiv icon