Picture for Yang Zhou

Yang Zhou

Yahoo! Labs

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation

Add code
Mar 19, 2025
Viaarxiv icon

Visual Persona: Foundation Model for Full-Body Human Customization

Add code
Mar 19, 2025
Viaarxiv icon

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

Add code
Mar 18, 2025
Viaarxiv icon

Introducing Unbiased Depth into 2D Gaussian Splatting for High-accuracy Surface Reconstruction

Add code
Mar 09, 2025
Viaarxiv icon

TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction

Add code
Mar 06, 2025
Viaarxiv icon

V2X-LLM: Enhancing V2X Integration and Understanding in Connected Vehicle Corridors

Add code
Mar 04, 2025
Viaarxiv icon

Attention Distillation: A Unified Approach to Visual Characteristics Transfer

Add code
Feb 27, 2025
Viaarxiv icon

Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios

Add code
Feb 27, 2025
Viaarxiv icon

StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models

Add code
Feb 13, 2025
Viaarxiv icon

GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?

Add code
Feb 07, 2025
Viaarxiv icon