Picture for Kaifu Zhang

Kaifu Zhang

Evaluating Image Caption via Cycle-consistent Text-to-Image Generation

Add code
Jan 08, 2025
Figure 1 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 2 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 3 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 4 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Viaarxiv icon

MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs

Add code
Jan 06, 2025
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Viaarxiv icon

Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models

Add code
Dec 23, 2024
Viaarxiv icon

Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement

Add code
Dec 05, 2024
Viaarxiv icon

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Add code
Nov 21, 2024
Viaarxiv icon

PMMT: Preference Alignment in Multilingual Machine Translation via LLM Distillation

Add code
Oct 15, 2024
Viaarxiv icon

Building Decision Making Models Through Language Model Regime

Add code
Aug 12, 2024
Viaarxiv icon

BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training

Add code
Aug 12, 2024
Figure 1 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Figure 2 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Figure 3 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Figure 4 for BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Viaarxiv icon

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

Add code
Jun 11, 2024
Viaarxiv icon