Picture for Yuwei Wu

Yuwei Wu

Sekai: A Video Dataset towards World Exploration

Add code
Jun 18, 2025
Viaarxiv icon

Hyperbolic Dual Feature Augmentation for Open-Environment

Add code
Jun 10, 2025
Viaarxiv icon

Large Language Models are Demonstration Pre-Selectors for Themselves

Add code
Jun 06, 2025
Viaarxiv icon

Multi-Sourced Compositional Generalization in Visual Question Answering

Add code
May 29, 2025
Viaarxiv icon

Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL

Add code
May 21, 2025
Viaarxiv icon

Memory-Centric Embodied Question Answer

Add code
May 20, 2025
Viaarxiv icon

Multi-Label Stereo Matching for Transparent Scene Depth Estimation

Add code
May 20, 2025
Viaarxiv icon

Diving into the Fusion of Monocular Priors for Generalized Stereo Matching

Add code
May 20, 2025
Viaarxiv icon

3D Visual Illusion Depth Estimation

Add code
May 19, 2025
Viaarxiv icon

LLM-Land: Large Language Models for Context-Aware Drone Landing

Add code
May 09, 2025
Viaarxiv icon