Picture for Diyi Yang

Diyi Yang

Stanford University

Towards Execution-Grounded Automated AI Research

Add code
Jan 20, 2026
Viaarxiv icon

CooperBench: Why Coding Agents Cannot be Your Teammates Yet

Add code
Jan 19, 2026
Viaarxiv icon

DECEPTICON: How Dark Patterns Manipulate Web Agents

Add code
Dec 28, 2025
Viaarxiv icon

AutoMetrics: Approximate Human Judgements with Automatically Generated Evaluators

Add code
Dec 19, 2025
Viaarxiv icon

Real-Time Reasoning Agents in Evolving Environments

Add code
Nov 07, 2025
Viaarxiv icon

Culture Cartography: Mapping the Landscape of Cultural Knowledge

Add code
Oct 31, 2025
Viaarxiv icon

How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations

Add code
Oct 26, 2025
Viaarxiv icon

Generative Interfaces for Language Models

Add code
Aug 26, 2025
Viaarxiv icon

OpenCUA: Open Foundations for Computer-Use Agents

Add code
Aug 12, 2025
Viaarxiv icon

The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas

Add code
Jun 25, 2025
Viaarxiv icon