Picture for Weifeng Lin

Weifeng Lin

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Add code
Aug 05, 2024
Viaarxiv icon

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Add code
Apr 01, 2024
Viaarxiv icon

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Add code
Feb 08, 2024
Viaarxiv icon

Hierarchical Side-Tuning for Vision Transformers

Add code
Oct 10, 2023
Viaarxiv icon

Scale-Aware Modulation Meet Transformer

Add code
Jul 26, 2023
Viaarxiv icon