Picture for Cihang Xie

Cihang Xie

University of California, Santa Cruz

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

Add code
Apr 23, 2026
Viaarxiv icon

Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

Add code
Apr 22, 2026
Viaarxiv icon

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Add code
Apr 06, 2026
Viaarxiv icon

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Add code
Apr 05, 2026
Viaarxiv icon

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Add code
Apr 02, 2026
Viaarxiv icon

Omni-MMSI: Toward Identity-attributed Social Interaction Understanding

Add code
Mar 31, 2026
Viaarxiv icon

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Add code
Mar 17, 2026
Viaarxiv icon

Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation

Add code
Mar 17, 2026
Viaarxiv icon

In-Context Reinforcement Learning for Tool Use in Large Language Models

Add code
Mar 09, 2026
Viaarxiv icon