Picture for Shitian Zhao

Shitian Zhao

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Add code
Jan 23, 2025
Figure 1 for IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models
Figure 2 for IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models
Figure 3 for IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models
Figure 4 for IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models
Viaarxiv icon

Boosting Open-Domain Continual Learning via Leveraging Intra-domain Category-aware Prototype

Add code
Aug 19, 2024
Viaarxiv icon

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Add code
Aug 05, 2024
Viaarxiv icon

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Add code
Feb 08, 2024
Viaarxiv icon

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

Add code
Dec 09, 2023
Figure 1 for Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
Figure 2 for Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
Figure 3 for Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
Figure 4 for Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
Viaarxiv icon