Picture for Shilong Zhang

Shilong Zhang

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Viaarxiv icon

Zero-shot Image Editing with Reference Imitation

Add code
Jun 11, 2024
Viaarxiv icon

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Add code
Jun 10, 2024
Viaarxiv icon

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Add code
Mar 25, 2024
Viaarxiv icon

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Add code
Jul 07, 2023
Viaarxiv icon

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

Add code
May 09, 2023
Figure 1 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 2 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 3 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 4 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Viaarxiv icon

Dense Distinct Query for End-to-End Object Detection

Add code
Mar 22, 2023
Figure 1 for Dense Distinct Query for End-to-End Object Detection
Figure 2 for Dense Distinct Query for End-to-End Object Detection
Figure 3 for Dense Distinct Query for End-to-End Object Detection
Figure 4 for Dense Distinct Query for End-to-End Object Detection
Viaarxiv icon

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Add code
Dec 16, 2022
Viaarxiv icon

Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection

Add code
Sep 04, 2022
Figure 1 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Figure 2 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Figure 3 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Figure 4 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Viaarxiv icon

What Are Expected Queries in End-to-End Object Detection?

Add code
Jun 02, 2022
Figure 1 for What Are Expected Queries in End-to-End Object Detection?
Figure 2 for What Are Expected Queries in End-to-End Object Detection?
Figure 3 for What Are Expected Queries in End-to-End Object Detection?
Figure 4 for What Are Expected Queries in End-to-End Object Detection?
Viaarxiv icon