Picture for Wenwen Yu

Wenwen Yu

ClickTrack: Towards Real-time Interactive Single Object Tracking

Add code
Nov 24, 2024
Viaarxiv icon

Click; Single Object Tracking; Video Object Segmentation; Real-time Interaction

Add code
Nov 20, 2024
Viaarxiv icon

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

Add code
Mar 28, 2024
Viaarxiv icon

P2Seg: Pointly-supervised Segmentation via Mutual Distillation

Add code
Jan 18, 2024
Viaarxiv icon

P2RBox: A Single Point is All You Need for Oriented Object Detection

Add code
Nov 22, 2023
Figure 1 for P2RBox: A Single Point is All You Need for Oriented Object Detection
Figure 2 for P2RBox: A Single Point is All You Need for Oriented Object Detection
Figure 3 for P2RBox: A Single Point is All You Need for Oriented Object Detection
Figure 4 for P2RBox: A Single Point is All You Need for Oriented Object Detection
Viaarxiv icon

Turning a CLIP Model into a Scene Text Spotter

Add code
Aug 21, 2023
Figure 1 for Turning a CLIP Model into a Scene Text Spotter
Figure 2 for Turning a CLIP Model into a Scene Text Spotter
Figure 3 for Turning a CLIP Model into a Scene Text Spotter
Figure 4 for Turning a CLIP Model into a Scene Text Spotter
Viaarxiv icon

Looking and Listening: Audio Guided Text Recognition

Add code
Jun 06, 2023
Figure 1 for Looking and Listening: Audio Guided Text Recognition
Figure 2 for Looking and Listening: Audio Guided Text Recognition
Figure 3 for Looking and Listening: Audio Guided Text Recognition
Figure 4 for Looking and Listening: Audio Guided Text Recognition
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Jun 05, 2023
Viaarxiv icon

On the Hidden Mystery of OCR in Large Multimodal Models

Add code
May 13, 2023
Viaarxiv icon

ICDAR 2023 Competition on Reading the Seal Title

Add code
Apr 24, 2023
Viaarxiv icon