Picture for Hongtao Xie

Hongtao Xie

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model

Add code
Oct 14, 2024
Viaarxiv icon

How Control Information Influences Multilingual Text Image Generation and Editing?

Add code
Jul 16, 2024
Viaarxiv icon

Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition

Add code
Jul 08, 2024
Viaarxiv icon

Pistis-RAG: A Scalable Cascading Framework Towards Trustworthy Retrieval-Augmented Generation

Add code
Jun 21, 2024
Viaarxiv icon

Hallucination Mitigation Prompts Long-term Video Understanding

Add code
Jun 17, 2024
Viaarxiv icon

DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection

Add code
May 16, 2024
Viaarxiv icon

Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition

Add code
May 11, 2024
Viaarxiv icon

Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing

Add code
May 07, 2024
Viaarxiv icon

AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation

Add code
Apr 08, 2024
Viaarxiv icon

DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations

Add code
Mar 12, 2024
Figure 1 for DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations
Figure 2 for DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations
Figure 3 for DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations
Figure 4 for DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations
Viaarxiv icon