Picture for Haofan Wang

Haofan Wang

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Add code
Nov 15, 2024
Viaarxiv icon

Training-free Regional Prompting for Diffusion Transformers

Add code
Nov 04, 2024
Viaarxiv icon

InstantIR: Blind Image Restoration with Instant Generative Reference

Add code
Oct 09, 2024
Figure 1 for InstantIR: Blind Image Restoration with Instant Generative Reference
Figure 2 for InstantIR: Blind Image Restoration with Instant Generative Reference
Figure 3 for InstantIR: Blind Image Restoration with Instant Generative Reference
Figure 4 for InstantIR: Blind Image Restoration with Instant Generative Reference
Viaarxiv icon

Image Watermarks are Removable Using Controllable Regeneration from Clean Noise

Add code
Oct 07, 2024
Viaarxiv icon

CSGO: Content-Style Composition in Text-to-Image Generation

Add code
Sep 04, 2024
Figure 1 for CSGO: Content-Style Composition in Text-to-Image Generation
Figure 2 for CSGO: Content-Style Composition in Text-to-Image Generation
Figure 3 for CSGO: Content-Style Composition in Text-to-Image Generation
Figure 4 for CSGO: Content-Style Composition in Text-to-Image Generation
Viaarxiv icon

Multi-scale Multi-instance Visual Sound Localization and Segmentation

Add code
Aug 31, 2024
Viaarxiv icon

InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation

Add code
Jun 30, 2024
Viaarxiv icon

Unified Video-Language Pre-training with Synchronized Audio

Add code
May 12, 2024
Viaarxiv icon

Multimodal Sense-Informed Prediction of 3D Human Motions

Add code
May 05, 2024
Figure 1 for Multimodal Sense-Informed Prediction of 3D Human Motions
Figure 2 for Multimodal Sense-Informed Prediction of 3D Human Motions
Figure 3 for Multimodal Sense-Informed Prediction of 3D Human Motions
Figure 4 for Multimodal Sense-Informed Prediction of 3D Human Motions
Viaarxiv icon

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation

Add code
Apr 04, 2024
Viaarxiv icon