Picture for Lujain Ibrahim

Lujain Ibrahim

Offloading Score: Measuring AI Reliance Through Counterfactual Workflows

Add code
May 28, 2026
Viaarxiv icon

What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct

Add code
May 20, 2026
Viaarxiv icon

Verbalizing LLMs' assumptions to explain and control sycophancy

Add code
Apr 03, 2026
Viaarxiv icon

Evaluating Language Models for Harmful Manipulation

Add code
Mar 26, 2026
Viaarxiv icon

Training language models to be warm and empathetic makes them less reliable and more sycophantic

Add code
Jul 30, 2025
Viaarxiv icon

Social Sycophancy: A Broader Understanding of LLM Sycophancy

Add code
May 20, 2025
Viaarxiv icon

Thinking beyond the anthropomorphic paradigm benefits LLM research

Add code
Feb 13, 2025
Figure 1 for Thinking beyond the anthropomorphic paradigm benefits LLM research
Figure 2 for Thinking beyond the anthropomorphic paradigm benefits LLM research
Viaarxiv icon

Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models

Add code
Feb 10, 2025
Viaarxiv icon

Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks

Add code
May 17, 2024
Figure 1 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Figure 2 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Figure 3 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Figure 4 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Viaarxiv icon

Characterizing and modeling harms from interactions with design patterns in AI interfaces

Add code
Apr 17, 2024
Viaarxiv icon