Picture for Haoye Zhang

Haoye Zhang

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Add code
Aug 03, 2024
Figure 1 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 2 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 3 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 4 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Viaarxiv icon

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Add code
May 27, 2024
Figure 1 for RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Figure 2 for RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Figure 3 for RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Figure 4 for RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Viaarxiv icon

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Add code
Dec 01, 2023
Viaarxiv icon

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants

Add code
Oct 01, 2023
Viaarxiv icon

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Add code
Aug 23, 2023
Viaarxiv icon