Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation

Add code
Apr 23, 2024

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: