Picture for Qing Jiang

Qing Jiang

ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Add code
Dec 02, 2024
Viaarxiv icon

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Add code
Nov 21, 2024
Figure 1 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 2 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 3 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 4 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Viaarxiv icon

Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection

Add code
May 16, 2024
Figure 1 for Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Figure 2 for Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Figure 3 for Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Figure 4 for Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Viaarxiv icon

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Add code
Mar 21, 2024
Viaarxiv icon

Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks

Add code
Jan 25, 2024
Viaarxiv icon

T-Rex: Counting by Visual Prompting

Add code
Nov 22, 2023
Viaarxiv icon

Visual In-Context Prompting

Add code
Nov 22, 2023
Viaarxiv icon

Revisiting Scene Text Recognition: A Data Perspective

Add code
Jul 19, 2023
Viaarxiv icon