Picture for Chia-Chih Chen

Chia-Chih Chen

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Add code
Aug 16, 2024
Figure 1 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 2 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 3 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 4 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Viaarxiv icon

HIVE: Harnessing Human Feedback for Instructional Visual Editing

Add code
Mar 16, 2023
Viaarxiv icon

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Add code
Dec 19, 2022
Figure 1 for LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
Figure 2 for LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
Figure 3 for LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
Figure 4 for LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
Viaarxiv icon