Picture for Yepeng Tang

Yepeng Tang

Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities

Add code
Apr 02, 2025
Viaarxiv icon

Image Difference Grounding with Natural Language

Add code
Apr 02, 2025
Viaarxiv icon

VRoPE: Rotary Position Embedding for Video Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Diffusion Feedback Helps CLIP See Better

Add code
Jul 29, 2024
Viaarxiv icon