Picture for Boxuan Zhang

Boxuan Zhang

TRACES: Proactive Safety Auditing for Multi-Turn LLM Agents via Trajectory-State Modeling

Add code
May 26, 2026
Viaarxiv icon

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Add code
May 14, 2026
Viaarxiv icon

Micro-Defects Expose Macro-Fakes: Detecting AI-Generated Images via Local Distributional Shifts

Add code
May 10, 2026
Viaarxiv icon

OptiVerse: A Comprehensive Benchmark towards Optimization Problem Solving

Add code
Apr 23, 2026
Viaarxiv icon

Dual-Cluster Memory Agent: Resolving Multi-Paradigm Ambiguity in Optimization Problem Solving

Add code
Apr 22, 2026
Viaarxiv icon

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Add code
Apr 13, 2026
Viaarxiv icon

Shifting Uncertainty to Critical Moments: Towards Reliable Uncertainty Quantification for VLA Model

Add code
Mar 18, 2026
Viaarxiv icon

Differentiable Geometric Indexing for End-to-End Generative Retrieval

Add code
Mar 11, 2026
Viaarxiv icon

Data Augmentation for High-Fidelity Generation of CAR-T/NK Immunological Synapse Images

Add code
Feb 03, 2026
Viaarxiv icon

Mixture-of-World Models: Scaling Multi-Task Reinforcement Learning with Modular Latent Dynamics

Add code
Feb 01, 2026
Viaarxiv icon