Picture for Shilong Zhang

Shilong Zhang

Goku: Flow Based Video Generative Foundation Models

Add code
Feb 10, 2025
Viaarxiv icon

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Add code
Feb 07, 2025
Viaarxiv icon

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Viaarxiv icon

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Viaarxiv icon

Zero-shot Image Editing with Reference Imitation

Add code
Jun 11, 2024
Viaarxiv icon

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Add code
Jun 10, 2024
Viaarxiv icon

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Add code
Mar 25, 2024
Figure 1 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Figure 2 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Figure 3 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Figure 4 for FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Viaarxiv icon

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Add code
Jul 07, 2023
Viaarxiv icon

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

Add code
May 09, 2023
Figure 1 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 2 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 3 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 4 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Viaarxiv icon

Dense Distinct Query for End-to-End Object Detection

Add code
Mar 22, 2023
Figure 1 for Dense Distinct Query for End-to-End Object Detection
Figure 2 for Dense Distinct Query for End-to-End Object Detection
Figure 3 for Dense Distinct Query for End-to-End Object Detection
Figure 4 for Dense Distinct Query for End-to-End Object Detection
Viaarxiv icon