Picture for Cheng Chen

Cheng Chen

Generalized Recognition of Basic Surgical Actions Enables Skill Assessment and Vision-Language-Model-based Surgical Planning

Add code
Mar 13, 2026
Viaarxiv icon

EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs

Add code
Mar 04, 2026
Viaarxiv icon

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Add code
Mar 03, 2026
Viaarxiv icon

OpAgent: Operator Agent for Web Navigation

Add code
Feb 14, 2026
Viaarxiv icon

Fully First-Order Algorithms for Online Bilevel Optimization

Add code
Feb 12, 2026
Viaarxiv icon

Achieving Better Local Regret Bound for Online Non-Convex Bilevel Optimization

Add code
Feb 06, 2026
Viaarxiv icon

ScDiVa: Masked Discrete Diffusion for Joint Modeling of Single-Cell Identity and Expression

Add code
Feb 03, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Autonomous Chain-of-Thought Distillation for Graph-Based Fraud Detection

Add code
Jan 30, 2026
Viaarxiv icon

SAMTok: Representing Any Mask with Two Words

Add code
Jan 22, 2026
Viaarxiv icon