Picture for Sanjeev Arora

Sanjeev Arora

What Makes a Reward Model a Good Teacher? An Optimization Perspective

Add code
Mar 19, 2025
Viaarxiv icon

Weak-to-Strong Generalization Even in Random Feature Networks, Provably

Add code
Mar 04, 2025
Viaarxiv icon

On the Power of Context-Enhanced Learning in LLMs

Add code
Mar 03, 2025
Viaarxiv icon

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

Add code
Feb 11, 2025
Viaarxiv icon

Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set

Add code
Feb 05, 2025
Viaarxiv icon

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Add code
Jan 05, 2025
Figure 1 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Figure 2 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Figure 3 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Figure 4 for Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Viaarxiv icon

Provable unlearning in topic modeling and downstream tasks

Add code
Nov 20, 2024
Viaarxiv icon

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Add code
Oct 11, 2024
Viaarxiv icon

Can Models Learn Skill Composition from Examples?

Add code
Sep 29, 2024
Figure 1 for Can Models Learn Skill Composition from Examples?
Figure 2 for Can Models Learn Skill Composition from Examples?
Figure 3 for Can Models Learn Skill Composition from Examples?
Figure 4 for Can Models Learn Skill Composition from Examples?
Viaarxiv icon

Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning

Add code
Aug 27, 2024
Viaarxiv icon