Picture for Kaifu Zhang

Kaifu Zhang

Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement

Add code
Dec 05, 2024
Viaarxiv icon

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Add code
Nov 21, 2024
Viaarxiv icon

PMMT: Preference Alignment in Multilingual Machine Translation via LLM Distillation

Add code
Oct 15, 2024
Viaarxiv icon

Building Decision Making Models Through Language Model Regime

Add code
Aug 12, 2024
Viaarxiv icon

BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training

Add code
Aug 12, 2024
Figure 1 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Figure 2 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Figure 3 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Figure 4 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Viaarxiv icon

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

Add code
Jun 11, 2024
Viaarxiv icon

Wings: Learning Multimodal LLMs without Text-only Forgetting

Add code
Jun 05, 2024
Figure 1 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 2 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 3 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 4 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Viaarxiv icon

Parrot: Multilingual Visual Instruction Tuning

Add code
Jun 04, 2024
Figure 1 for Parrot: Multilingual Visual Instruction Tuning
Figure 2 for Parrot: Multilingual Visual Instruction Tuning
Figure 3 for Parrot: Multilingual Visual Instruction Tuning
Figure 4 for Parrot: Multilingual Visual Instruction Tuning
Viaarxiv icon

Ovis: Structural Embedding Alignment for Multimodal Large Language Model

Add code
May 31, 2024
Figure 1 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Figure 2 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Figure 3 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Figure 4 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Viaarxiv icon