Picture for Yongtao Wang

Yongtao Wang

ATOM-Bench: A Real-World Benchmark for Atomic Skills and Compositional Generalization in Manipulation Policies

Add code
Jun 15, 2026
Viaarxiv icon

DrivingAgent: Design and Scheduling Agents for Autonomous Driving Systems

Add code
Jun 11, 2026
Viaarxiv icon

Feat2Go: Visual Feature-Grounded Value Estimation for Embodied Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

3DVLA: Enhancing Vision-Language-Action Models via 3D Spatial and Instance Understanding

Add code
May 28, 2026
Viaarxiv icon

HiDrive: A Closed-Loop Benchmark for High-Level Autonomous Driving

Add code
May 11, 2026
Viaarxiv icon

VL-SAM-v3: Memory-Guided Visual Priors for Open-World Object Detection

Add code
May 05, 2026
Viaarxiv icon

QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models

Add code
Apr 03, 2026
Viaarxiv icon

ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

Add code
Mar 25, 2026
Viaarxiv icon

R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection

Add code
Mar 12, 2026
Viaarxiv icon

YOLO-NAS-Bench: A Surrogate Benchmark with Self-Evolving Predictors for YOLO Architecture Search

Add code
Mar 10, 2026
Viaarxiv icon