Picture for Elias Stengel-Eskin

Elias Stengel-Eskin

Johns Hopkins University

Teaching Models to Balance Resisting and Accepting Persuasion

Add code
Oct 18, 2024
Figure 1 for Teaching Models to Balance Resisting and Accepting Persuasion
Figure 2 for Teaching Models to Balance Resisting and Accepting Persuasion
Figure 3 for Teaching Models to Balance Resisting and Accepting Persuasion
Figure 4 for Teaching Models to Balance Resisting and Accepting Persuasion
Viaarxiv icon

DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback

Add code
Oct 08, 2024
Viaarxiv icon

LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits

Add code
Oct 02, 2024
Viaarxiv icon

MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning

Add code
Sep 18, 2024
Viaarxiv icon

AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge

Add code
Sep 11, 2024
Viaarxiv icon

System-1.x: Learning to Balance Fast and Slow Planning with Language Models

Add code
Jul 19, 2024
Viaarxiv icon

Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?

Add code
Jun 27, 2024
Viaarxiv icon

See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding

Add code
Jun 17, 2024
Figure 1 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Figure 2 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Figure 3 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Figure 4 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Viaarxiv icon

Are language models rational? The case of coherence norms and belief revision

Add code
Jun 05, 2024
Viaarxiv icon

LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models

Add code
May 31, 2024
Viaarxiv icon