Picture for Yalong Bai

Yalong Bai

Learning User Preferences for Image Generation Model

Add code
Aug 11, 2025
Viaarxiv icon

V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation

Add code
Mar 10, 2025
Viaarxiv icon

Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing

Add code
Nov 29, 2024
Figure 1 for Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing
Figure 2 for Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing
Figure 3 for Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing
Figure 4 for Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing
Viaarxiv icon

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

Add code
Jun 16, 2024
Figure 1 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Figure 2 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Figure 3 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Figure 4 for STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
Viaarxiv icon

Dynamic Prompt Optimizing for Text-to-Image Generation

Add code
Apr 05, 2024
Figure 1 for Dynamic Prompt Optimizing for Text-to-Image Generation
Figure 2 for Dynamic Prompt Optimizing for Text-to-Image Generation
Figure 3 for Dynamic Prompt Optimizing for Text-to-Image Generation
Figure 4 for Dynamic Prompt Optimizing for Text-to-Image Generation
Viaarxiv icon

StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models

Add code
Jan 25, 2024
Figure 1 for StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
Figure 2 for StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
Figure 3 for StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
Figure 4 for StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
Viaarxiv icon

Learning and Evaluating Human Preferences for Conversational Head Generation

Add code
Aug 02, 2023
Figure 1 for Learning and Evaluating Human Preferences for Conversational Head Generation
Figure 2 for Learning and Evaluating Human Preferences for Conversational Head Generation
Figure 3 for Learning and Evaluating Human Preferences for Conversational Head Generation
Figure 4 for Learning and Evaluating Human Preferences for Conversational Head Generation
Viaarxiv icon

Interactive Conversational Head Generation

Add code
Jul 05, 2023
Figure 1 for Interactive Conversational Head Generation
Figure 2 for Interactive Conversational Head Generation
Figure 3 for Interactive Conversational Head Generation
Figure 4 for Interactive Conversational Head Generation
Viaarxiv icon

Deep Equilibrium Multimodal Fusion

Add code
Jun 29, 2023
Figure 1 for Deep Equilibrium Multimodal Fusion
Figure 2 for Deep Equilibrium Multimodal Fusion
Figure 3 for Deep Equilibrium Multimodal Fusion
Figure 4 for Deep Equilibrium Multimodal Fusion
Viaarxiv icon

Visual-Aware Text-to-Speech

Add code
Jun 21, 2023
Figure 1 for Visual-Aware Text-to-Speech
Figure 2 for Visual-Aware Text-to-Speech
Figure 3 for Visual-Aware Text-to-Speech
Figure 4 for Visual-Aware Text-to-Speech
Viaarxiv icon