Picture for Yang Zhou

Yang Zhou

Yahoo! Labs

NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving

Add code
Apr 07, 2025
Viaarxiv icon

Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis

Add code
Apr 04, 2025
Viaarxiv icon

Time-optimal Convexified Reeds-Shepp Paths on a Sphere

Add code
Apr 01, 2025
Viaarxiv icon

Omni-AD: Learning to Reconstruct Global and Local Features for Multi-class Anomaly Detection

Add code
Mar 27, 2025
Viaarxiv icon

Video Motion Graphs

Add code
Mar 26, 2025
Viaarxiv icon

Aether: Geometric-Aware Unified World Modeling

Add code
Mar 25, 2025
Viaarxiv icon

Visual Persona: Foundation Model for Full-Body Human Customization

Add code
Mar 19, 2025
Viaarxiv icon

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation

Add code
Mar 19, 2025
Viaarxiv icon

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

Add code
Mar 18, 2025
Viaarxiv icon

Introducing Unbiased Depth into 2D Gaussian Splatting for High-accuracy Surface Reconstruction

Add code
Mar 09, 2025
Viaarxiv icon