Picture for Weilin Huang

Weilin Huang

Fast Prompt Alignment for Text-to-Image Generation

Add code
Dec 11, 2024
Viaarxiv icon

SeedEdit: Align Image Re-Generation to Image Editing

Add code
Nov 11, 2024
Viaarxiv icon

UniFL: Improve Stable Diffusion via Unified Feedback Learning

Add code
Apr 08, 2024
Figure 1 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Figure 2 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Figure 3 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Figure 4 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Viaarxiv icon

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models

Add code
Dec 12, 2023
Figure 1 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Figure 2 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Figure 3 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Figure 4 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Viaarxiv icon

Enhancing Cross-domain Click-Through Rate Prediction via Explicit Feature Augmentation

Add code
Nov 30, 2023
Figure 1 for Enhancing Cross-domain Click-Through Rate Prediction via Explicit Feature Augmentation
Figure 2 for Enhancing Cross-domain Click-Through Rate Prediction via Explicit Feature Augmentation
Figure 3 for Enhancing Cross-domain Click-Through Rate Prediction via Explicit Feature Augmentation
Figure 4 for Enhancing Cross-domain Click-Through Rate Prediction via Explicit Feature Augmentation
Viaarxiv icon

Forgedit: Text Guided Image Editing via Learning and Forgetting

Add code
Sep 19, 2023
Viaarxiv icon

Cross-domain Augmentation Networks for Click-Through Rate Prediction

Add code
May 09, 2023
Viaarxiv icon

Mixer: Image to Multi-Modal Retrieval Learning for Industrial Application

Add code
May 06, 2023
Viaarxiv icon

Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding

Add code
Sep 28, 2022
Figure 1 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Figure 2 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Figure 3 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Figure 4 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Viaarxiv icon

Cross-Architecture Self-supervised Video Representation Learning

Add code
May 26, 2022
Figure 1 for Cross-Architecture Self-supervised Video Representation Learning
Figure 2 for Cross-Architecture Self-supervised Video Representation Learning
Figure 3 for Cross-Architecture Self-supervised Video Representation Learning
Figure 4 for Cross-Architecture Self-supervised Video Representation Learning
Viaarxiv icon