Picture for Yang Dai

Yang Dai

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Add code
May 28, 2026
Viaarxiv icon

EgoCoT-Bench: Benchmarking Grounded and Verifiable Operation-Centric Chain of Thought Reasoning for MLLMs

Add code
May 19, 2026
Viaarxiv icon

CaptchaMind: Training CAPTCHA Solvers via Reinforcement Learning with Explicit Reasoning Supervision

Add code
May 19, 2026
Viaarxiv icon

Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization

Add code
Apr 01, 2026
Viaarxiv icon

VIVID-Med: LLM-Supervised Structured Pretraining for Deployable Medical ViTs

Add code
Mar 11, 2026
Viaarxiv icon

The Power of Decaying Steps: Enhancing Attack Stability and Transferability for Sign-based Optimizers

Add code
Feb 22, 2026
Viaarxiv icon

CtrlCoT: Dual-Granularity Chain-of-Thought Compression for Controllable Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

SRU-Pix2Pix: A Fusion-Driven Generator Network for Medical Image Translation with Few-Shot Learning

Add code
Jan 08, 2026
Viaarxiv icon