Picture for Shilong Zhang

Shilong Zhang

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Viaarxiv icon

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Viaarxiv icon

Zero-shot Image Editing with Reference Imitation

Add code
Jun 11, 2024
Viaarxiv icon

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Add code
Jun 10, 2024
Viaarxiv icon

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Add code
Mar 25, 2024
Figure 1 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Figure 2 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Figure 3 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Figure 4 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Viaarxiv icon

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Add code
Jul 07, 2023
Viaarxiv icon

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

Add code
May 09, 2023
Figure 1 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 2 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 3 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 4 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Viaarxiv icon

Dense Distinct Query for End-to-End Object Detection

Add code
Mar 22, 2023
Figure 1 for Dense Distinct Query for End-to-End Object Detection
Figure 2 for Dense Distinct Query for End-to-End Object Detection
Figure 3 for Dense Distinct Query for End-to-End Object Detection
Figure 4 for Dense Distinct Query for End-to-End Object Detection
Viaarxiv icon

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Add code
Dec 16, 2022
Viaarxiv icon

Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection

Add code
Sep 04, 2022
Figure 1 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Figure 2 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Figure 3 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Figure 4 for Consistent Teacher Provides Better Supervision in Semi-supervised Object Detection
Viaarxiv icon