Picture for Yinxiao Li

Yinxiao Li

Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation

Add code
Jan 11, 2025
Viaarxiv icon

DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes

Add code
Dec 15, 2024
Viaarxiv icon

A Simple Approach to Unifying Diffusion-based Conditional Generation

Add code
Oct 15, 2024
Figure 1 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Figure 2 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Figure 3 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Figure 4 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Viaarxiv icon

Cropper: Vision-Language Model for Image Cropping through In-Context Learning

Add code
Aug 14, 2024
Figure 1 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Figure 2 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Figure 3 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Figure 4 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Viaarxiv icon

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Add code
Jan 11, 2024
Figure 1 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Figure 2 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Figure 3 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Figure 4 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Viaarxiv icon

SVDiff: Compact Parameter Space for Diffusion Fine-Tuning

Add code
Mar 22, 2023
Viaarxiv icon

MaxViT: Multi-Axis Vision Transformer

Add code
Apr 04, 2022
Figure 1 for MaxViT: Multi-Axis Vision Transformer
Figure 2 for MaxViT: Multi-Axis Vision Transformer
Figure 3 for MaxViT: Multi-Axis Vision Transformer
Figure 4 for MaxViT: Multi-Axis Vision Transformer
Viaarxiv icon

MAXIM: Multi-Axis MLP for Image Processing

Add code
Jan 09, 2022
Figure 1 for MAXIM: Multi-Axis MLP for Image Processing
Figure 2 for MAXIM: Multi-Axis MLP for Image Processing
Figure 3 for MAXIM: Multi-Axis MLP for Image Processing
Figure 4 for MAXIM: Multi-Axis MLP for Image Processing
Viaarxiv icon

COMISR: Compression-Informed Video Super-Resolution

Add code
May 04, 2021
Figure 1 for COMISR: Compression-Informed Video Super-Resolution
Figure 2 for COMISR: Compression-Informed Video Super-Resolution
Figure 3 for COMISR: Compression-Informed Video Super-Resolution
Figure 4 for COMISR: Compression-Informed Video Super-Resolution
Viaarxiv icon

PERF-Net: Pose Empowered RGB-Flow Net

Add code
Sep 28, 2020
Figure 1 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 2 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 3 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 4 for PERF-Net: Pose Empowered RGB-Flow Net
Viaarxiv icon