Picture for Dani Roytburg

Dani Roytburg

Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations

Add code
Jan 30, 2026
Viaarxiv icon

Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research

Add code
Dec 10, 2025
Figure 1 for Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research
Figure 2 for Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research
Figure 3 for Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research
Figure 4 for Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research
Viaarxiv icon

Breaking the Mirror: Activation-Based Mitigation of Self-Preference in LLM Evaluators

Add code
Sep 03, 2025
Viaarxiv icon

Words and Action: Modeling Linguistic Leadership in #BlackLivesMatter Communities

Add code
Dec 03, 2024
Figure 1 for Words and Action: Modeling Linguistic Leadership in #BlackLivesMatter Communities
Figure 2 for Words and Action: Modeling Linguistic Leadership in #BlackLivesMatter Communities
Figure 3 for Words and Action: Modeling Linguistic Leadership in #BlackLivesMatter Communities
Figure 4 for Words and Action: Modeling Linguistic Leadership in #BlackLivesMatter Communities
Viaarxiv icon