Picture for Jieyu Zhang

Jieyu Zhang

Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training

Add code
Dec 11, 2024
Viaarxiv icon

Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming

Add code
Dec 11, 2024
Viaarxiv icon

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action

Add code
Dec 10, 2024
Viaarxiv icon

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models

Add code
Dec 09, 2024
Viaarxiv icon

EcoAct: Economic Agent Determines When to Register What Action

Add code
Nov 03, 2024
Viaarxiv icon

Language Model Preference Evaluation with Multiple Weak Evaluators

Add code
Oct 14, 2024
Figure 1 for Language Model Preference Evaluation with Multiple Weak Evaluators
Figure 2 for Language Model Preference Evaluation with Multiple Weak Evaluators
Figure 3 for Language Model Preference Evaluation with Multiple Weak Evaluators
Figure 4 for Language Model Preference Evaluation with Multiple Weak Evaluators
Viaarxiv icon

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Add code
Aug 16, 2024
Figure 1 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 2 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 3 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 4 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Viaarxiv icon

Rethinking LLM-based Preference Evaluation

Add code
Jul 01, 2024
Viaarxiv icon

Biomedical Visual Instruction Tuning with Clinician Preference Alignment

Add code
Jun 19, 2024
Figure 1 for Biomedical Visual Instruction Tuning with Clinician Preference Alignment
Figure 2 for Biomedical Visual Instruction Tuning with Clinician Preference Alignment
Figure 3 for Biomedical Visual Instruction Tuning with Clinician Preference Alignment
Figure 4 for Biomedical Visual Instruction Tuning with Clinician Preference Alignment
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon