Picture for Quan Sun

Quan Sun

Emu3: Next-Token Prediction is All You Need

Add code
Sep 27, 2024
Viaarxiv icon

Diffusion Feedback Helps CLIP See Better

Add code
Jul 29, 2024
Viaarxiv icon

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Add code
Feb 06, 2024
Viaarxiv icon

Generative Multimodal Models are In-Context Learners

Add code
Dec 20, 2023
Viaarxiv icon

CapsFusion: Rethinking Image-Text Data at Scale

Add code
Nov 02, 2023
Viaarxiv icon

Generative Pretraining in Multimodality

Add code
Jul 11, 2023
Viaarxiv icon

EVA-CLIP: Improved Training Techniques for CLIP at Scale

Add code
Mar 27, 2023
Viaarxiv icon

EVA-02: A Visual Representation for Neon Genesis

Add code
Mar 22, 2023
Viaarxiv icon

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

Add code
Nov 14, 2022
Viaarxiv icon

Thermal Infrared Image Inpainting via Edge-Aware Guidance

Add code
Oct 28, 2022
Viaarxiv icon