Picture for Qifan Yu

Qifan Yu

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Viaarxiv icon

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Add code
Sep 30, 2024
Viaarxiv icon

A high-accuracy multi-model mixing retrosynthetic method

Add code
Sep 06, 2024
Figure 1 for A high-accuracy multi-model mixing retrosynthetic method
Figure 2 for A high-accuracy multi-model mixing retrosynthetic method
Figure 3 for A high-accuracy multi-model mixing retrosynthetic method
Figure 4 for A high-accuracy multi-model mixing retrosynthetic method
Viaarxiv icon

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data

Add code
Nov 22, 2023
Viaarxiv icon

Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model

Add code
Aug 15, 2023
Viaarxiv icon

Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration

Add code
May 22, 2023
Viaarxiv icon

Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World

Add code
Mar 23, 2023
Viaarxiv icon