Picture for Wanggui He

Wanggui He

T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts

Add code
Dec 05, 2024
Figure 1 for T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Figure 2 for T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Figure 3 for T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Figure 4 for T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Viaarxiv icon

PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation

Add code
Dec 04, 2024
Viaarxiv icon

A Comprehensive Survey of Datasets, Theories, Variants, and Applications in Direct Preference Optimization

Add code
Oct 21, 2024
Viaarxiv icon

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Add code
Aug 28, 2024
Viaarxiv icon

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

Add code
Aug 19, 2024
Figure 1 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 2 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 3 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Figure 4 for TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
Viaarxiv icon

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

Add code
Jul 11, 2024
Figure 1 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 2 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 3 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 4 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Viaarxiv icon

MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Add code
Jun 11, 2024
Viaarxiv icon

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

Add code
Apr 22, 2024
Viaarxiv icon

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

Add code
Nov 23, 2023
Viaarxiv icon

Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training

Add code
Apr 19, 2021
Figure 1 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Figure 2 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Figure 3 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Figure 4 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Viaarxiv icon