Picture for Julia Kempe

Julia Kempe

Likelihood-Based Reward Designs for General LLM Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Add code
Jan 26, 2026
Viaarxiv icon

Outcome-based Exploration for LLM Reasoning

Add code
Sep 08, 2025
Viaarxiv icon

Tuning without Peeking: Provable Privacy and Generalization Bounds for LLM Post-Training

Add code
Jul 02, 2025
Viaarxiv icon

PILAF: Optimal Human Preference Sampling for Reward Modeling

Add code
Feb 06, 2025
Figure 1 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Figure 2 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Figure 3 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Figure 4 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Viaarxiv icon

Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

Add code
Oct 29, 2024
Viaarxiv icon

On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds

Add code
Oct 21, 2024
Viaarxiv icon

Emergent properties with repeated examples

Add code
Oct 09, 2024
Figure 1 for Emergent properties with repeated examples
Figure 2 for Emergent properties with repeated examples
Figure 3 for Emergent properties with repeated examples
Figure 4 for Emergent properties with repeated examples
Viaarxiv icon

Strong Model Collapse

Add code
Oct 07, 2024
Figure 1 for Strong Model Collapse
Figure 2 for Strong Model Collapse
Figure 3 for Strong Model Collapse
Figure 4 for Strong Model Collapse
Viaarxiv icon

Mission Impossible: A Statistical Perspective on Jailbreaking LLMs

Add code
Aug 02, 2024
Viaarxiv icon