Picture for Tianyi Zhou

Tianyi Zhou

LLMs Struggle to Measure What Distinguishes Students of Different Proficiency Levels: A Study of Item Discrimination in Reading Comprehension Assessment

Add code
Jun 17, 2026
Viaarxiv icon

Guava: An Effective and Universal Harness for Embodied Manipulation

Add code
Jun 16, 2026
Viaarxiv icon

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

Add code
Jun 15, 2026
Viaarxiv icon

Self-Evolving Visual Questioner

Add code
Jun 11, 2026
Viaarxiv icon

ECA: Efficient Continual Alignment for Open-Ended Image-to-Text Generation

Add code
Jun 10, 2026
Viaarxiv icon

When is Your LLM Steerable?

Add code
Jun 10, 2026
Viaarxiv icon

Skip a Layer or Loop It? Learning Program-of-Layers in LLMs

Add code
Jun 04, 2026
Viaarxiv icon

Sandboxed Coding Agents are Competitive Omni-modal Task Solvers

Add code
May 30, 2026
Viaarxiv icon

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Add code
May 29, 2026
Viaarxiv icon

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Add code
May 28, 2026
Viaarxiv icon