Picture for Nathan Lile

Nathan Lile

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Add code
Mar 03, 2025
Figure 1 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Figure 2 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Figure 3 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Figure 4 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Viaarxiv icon

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Add code
Feb 24, 2025
Viaarxiv icon

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought

Add code
Jan 08, 2025
Viaarxiv icon

PERSONA: A Reproducible Testbed for Pluralistic Alignment

Add code
Jul 24, 2024
Viaarxiv icon

Suppressing Pink Elephants with Direct Principle Feedback

Add code
Feb 13, 2024
Viaarxiv icon