Picture for Xinyan Xiao

Xinyan Xiao

Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training

Add code
Oct 06, 2024
Viaarxiv icon

MonoFormer: One Transformer for Both Diffusion and Autoregression

Add code
Sep 24, 2024
Figure 1 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 2 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 3 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 4 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Viaarxiv icon

UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion

Add code
Jan 25, 2024
Viaarxiv icon

UniVG: Towards UNIfied-modal Video Generation

Add code
Jan 17, 2024
Viaarxiv icon

HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models

Add code
Jan 11, 2024
Viaarxiv icon

Exploiting Diffusion Priors for All-in-One Image Restoration

Add code
Dec 02, 2023
Viaarxiv icon

UNIMO-3: Multi-granularity Interaction for Vision-Language Representation Learning

Add code
May 23, 2023
Viaarxiv icon

WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning

Add code
Dec 20, 2022
Viaarxiv icon

UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance

Add code
Nov 03, 2022
Viaarxiv icon

FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness

Add code
Nov 01, 2022
Viaarxiv icon