Picture for Siming Fu

Siming Fu

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Add code
Feb 10, 2025
Viaarxiv icon

A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences

Add code
Jan 19, 2025
Figure 1 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Figure 2 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Figure 3 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Figure 4 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Viaarxiv icon

RestorerID: Towards Tuning-Free Face Restoration with ID Preservation

Add code
Nov 21, 2024
Viaarxiv icon

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Add code
Sep 26, 2024
Figure 1 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Figure 2 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Figure 3 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Figure 4 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Viaarxiv icon

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Add code
Aug 28, 2024
Viaarxiv icon

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

Add code
Jul 11, 2024
Figure 1 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 2 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 3 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 4 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Viaarxiv icon

MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Add code
Jun 11, 2024
Viaarxiv icon

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

Add code
Nov 23, 2023
Viaarxiv icon

Uniformly Distributed Category Prototype-Guided Vision-Language Framework for Long-Tail Recognition

Add code
Aug 24, 2023
Viaarxiv icon

Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning

Add code
Aug 22, 2022
Figure 1 for Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning
Figure 2 for Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning
Figure 3 for Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning
Figure 4 for Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning
Viaarxiv icon