Picture for Jiuhai Chen

Jiuhai Chen

Multi-Objective Linguistic Control of Large Language Models

Add code
Jun 23, 2024
Viaarxiv icon

GenQA: Generating Millions of Instructions from a Handful of Prompts

Add code
Jun 14, 2024
Figure 1 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 2 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 3 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 4 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Viaarxiv icon

OPTune: Efficient Online Preference Tuning

Add code
Jun 11, 2024
Figure 1 for OPTune: Efficient Online Preference Tuning
Figure 2 for OPTune: Efficient Online Preference Tuning
Figure 3 for OPTune: Efficient Online Preference Tuning
Figure 4 for OPTune: Efficient Online Preference Tuning
Viaarxiv icon

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

Add code
May 29, 2024
Viaarxiv icon

Automated Data Curation for Robust Language Model Fine-Tuning

Add code
Mar 19, 2024
Viaarxiv icon

Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements

Add code
Feb 16, 2024
Viaarxiv icon

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Add code
Feb 15, 2024
Figure 1 for Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Figure 2 for Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Figure 3 for Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Figure 4 for Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Viaarxiv icon

ODIN: Disentangled Reward Mitigates Hacking in RLHF

Add code
Feb 11, 2024
Viaarxiv icon

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

Add code
Oct 18, 2023
Viaarxiv icon

From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning

Add code
Sep 08, 2023
Viaarxiv icon