Picture for Changyou Chen

Changyou Chen

ANU & NICTA

A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation

Add code
Dec 20, 2024
Viaarxiv icon

Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements

Add code
Nov 15, 2024
Viaarxiv icon

LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding

Add code
Nov 02, 2024
Viaarxiv icon

TextLap: Customizing Language Models for Text-to-Layout Planning

Add code
Oct 09, 2024
Viaarxiv icon

MMR: Evaluating Reading Ability of Large Multimodal Models

Add code
Aug 26, 2024
Figure 1 for MMR: Evaluating Reading Ability of Large Multimodal Models
Figure 2 for MMR: Evaluating Reading Ability of Large Multimodal Models
Figure 3 for MMR: Evaluating Reading Ability of Large Multimodal Models
Figure 4 for MMR: Evaluating Reading Ability of Large Multimodal Models
Viaarxiv icon

LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models

Add code
Jul 27, 2024
Figure 1 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Figure 2 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Figure 3 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Figure 4 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Viaarxiv icon

Diffusion Models for Multi-Task Generative Modeling

Add code
Jul 24, 2024
Figure 1 for Diffusion Models for Multi-Task Generative Modeling
Figure 2 for Diffusion Models for Multi-Task Generative Modeling
Figure 3 for Diffusion Models for Multi-Task Generative Modeling
Figure 4 for Diffusion Models for Multi-Task Generative Modeling
Viaarxiv icon

Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning

Add code
Jul 24, 2024
Figure 1 for Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Figure 2 for Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Figure 3 for Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Figure 4 for Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Viaarxiv icon

TRINS: Towards Multimodal Language Models that Can Read

Add code
Jun 10, 2024
Viaarxiv icon

LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs

Add code
May 02, 2024
Viaarxiv icon