Picture for Qingdong He

Qingdong He

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Add code
Dec 04, 2024
Figure 1 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 2 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 3 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 4 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Viaarxiv icon

Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Add code
Nov 26, 2024
Figure 1 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 2 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 3 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 4 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Viaarxiv icon

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation

Add code
Nov 25, 2024
Figure 1 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 2 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 3 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 4 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Viaarxiv icon

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Add code
Nov 22, 2024
Viaarxiv icon

Typicalness-Aware Learning for Failure Detection

Add code
Nov 04, 2024
Figure 1 for Typicalness-Aware Learning for Failure Detection
Figure 2 for Typicalness-Aware Learning for Failure Detection
Figure 3 for Typicalness-Aware Learning for Failure Detection
Figure 4 for Typicalness-Aware Learning for Failure Detection
Viaarxiv icon

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Add code
Sep 16, 2024
Viaarxiv icon

DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

Add code
Aug 24, 2024
Figure 1 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Figure 2 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Figure 3 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Figure 4 for DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Viaarxiv icon

Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss

Add code
Jul 15, 2024
Figure 1 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Figure 2 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Figure 3 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Figure 4 for Effective Motion Modeling for UAV-platform Multiple Object Tracking with Re-Margin Loss
Viaarxiv icon

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Add code
Jun 06, 2024
Viaarxiv icon

NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models

Add code
May 31, 2024
Figure 1 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 2 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 3 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 4 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Viaarxiv icon