Picture for Zeyuan Chen

Zeyuan Chen

CADGrasp: Learning Contact and Collision Aware General Dexterous Grasping in Cluttered Scenes

Add code
Jan 21, 2026
Viaarxiv icon

Soft Tail-dropping for Adaptive Visual Tokenization

Add code
Jan 20, 2026
Viaarxiv icon

APEX: Academic Poster Editing Agentic Expert

Add code
Jan 08, 2026
Viaarxiv icon

CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning

Add code
Dec 09, 2025
Viaarxiv icon

C3Editor: Achieving Controllable Consistency in 2D Model for 3D Editing

Add code
Oct 06, 2025
Viaarxiv icon

WALT: Web Agents that Learn Tools

Add code
Oct 01, 2025
Viaarxiv icon

SCUBA: Salesforce Computer Use Benchmark

Add code
Sep 30, 2025
Figure 1 for SCUBA: Salesforce Computer Use Benchmark
Figure 2 for SCUBA: Salesforce Computer Use Benchmark
Figure 3 for SCUBA: Salesforce Computer Use Benchmark
Figure 4 for SCUBA: Salesforce Computer Use Benchmark
Viaarxiv icon

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Figure 1 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 2 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 3 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 4 for CoAct-1: Computer-using Agents with Coding as Actions
Viaarxiv icon

YOLO-Count: Differentiable Object Counting for Text-to-Image Generation

Add code
Aug 01, 2025
Figure 1 for YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
Figure 2 for YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
Figure 3 for YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
Figure 4 for YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
Viaarxiv icon

DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion

Add code
Jul 30, 2025
Viaarxiv icon