Picture for Qifan Yu

Qifan Yu

SOYO: A Tuning-Free Approach for Video Style Morphing via Style-Adaptive Interpolation in Diffusion Models

Add code
Mar 10, 2025
Viaarxiv icon

Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning

Add code
Feb 12, 2025
Viaarxiv icon

Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness

Add code
Dec 09, 2024
Figure 1 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 2 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 3 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Figure 4 for Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
Viaarxiv icon

STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training

Add code
Nov 29, 2024
Viaarxiv icon

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Add code
Nov 24, 2024
Figure 1 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 2 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 3 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 4 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Viaarxiv icon

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Figure 1 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 2 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 3 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 4 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Viaarxiv icon

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Add code
Sep 30, 2024
Viaarxiv icon

A high-accuracy multi-model mixing retrosynthetic method

Add code
Sep 06, 2024
Figure 1 for A high-accuracy multi-model mixing retrosynthetic method
Figure 2 for A high-accuracy multi-model mixing retrosynthetic method
Figure 3 for A high-accuracy multi-model mixing retrosynthetic method
Figure 4 for A high-accuracy multi-model mixing retrosynthetic method
Viaarxiv icon

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data

Add code
Nov 22, 2023
Figure 1 for HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
Figure 2 for HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
Figure 3 for HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
Figure 4 for HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
Viaarxiv icon

Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model

Add code
Aug 15, 2023
Figure 1 for Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Figure 2 for Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Figure 3 for Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Figure 4 for Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Viaarxiv icon