Picture for Kenneth Holstein

Kenneth Holstein

Validating LLM-as-a-Judge Systems in the Absence of Gold Labels

Add code
Mar 07, 2025
Viaarxiv icon

Intent Tagging: Exploring Micro-Prompting Interactions for Supporting Granular Human-GenAI Co-Creation Workflows

Add code
Feb 26, 2025
Viaarxiv icon

AI Mismatches: Identifying Potential Algorithmic Harms Before AI Development

Add code
Feb 25, 2025
Viaarxiv icon

AI Policy Projector: Grounding LLM Policy Design in Iterative Mapmaking

Add code
Sep 26, 2024
Viaarxiv icon

Studying Up Public Sector AI: How Networks of Power Relations Shape Agency Decisions Around AI Design and Use

Add code
May 21, 2024
Viaarxiv icon

Predictive Performance Comparison of Decision Policies Under Confounding

Add code
Apr 01, 2024
Figure 1 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 2 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 3 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 4 for Predictive Performance Comparison of Decision Policies Under Confounding
Viaarxiv icon

Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia

Add code
Feb 21, 2024
Viaarxiv icon

Training Towards Critical Use: Learning to Situate AI Predictions Relative to Human Knowledge

Add code
Aug 30, 2023
Viaarxiv icon

Recentering Validity Considerations through Early-Stage Deliberations Around AI and Policy Design

Add code
Mar 26, 2023
Viaarxiv icon

Understanding Frontline Workers' and Unhoused Individuals' Perspectives on AI Used in Homeless Services

Add code
Mar 17, 2023
Viaarxiv icon