Picture for Weifeng Chen

Weifeng Chen

OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

Add code
Dec 19, 2024
Figure 1 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 2 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 3 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 4 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Viaarxiv icon

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Viaarxiv icon

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Viaarxiv icon

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

Add code
Apr 23, 2024
Viaarxiv icon

Magic Clothing: Controllable Garment-Driven Image Synthesis

Add code
Apr 15, 2024
Viaarxiv icon

OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Add code
Mar 07, 2024
Viaarxiv icon

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Add code
Feb 09, 2024
Viaarxiv icon

DiffusionGPT: LLM-Driven Text-to-Image Generation System

Add code
Jan 18, 2024
Viaarxiv icon

AffordanceLLM: Grounding Affordance from Vision Language Models

Add code
Jan 12, 2024
Figure 1 for AffordanceLLM: Grounding Affordance from Vision Language Models
Figure 2 for AffordanceLLM: Grounding Affordance from Vision Language Models
Figure 3 for AffordanceLLM: Grounding Affordance from Vision Language Models
Figure 4 for AffordanceLLM: Grounding Affordance from Vision Language Models
Viaarxiv icon

Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search

Add code
Nov 15, 2023
Viaarxiv icon