Picture for Ji Li

Ji Li

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Add code
Aug 11, 2024
Figure 1 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 2 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 3 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 4 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Viaarxiv icon

Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering

Add code
Jun 14, 2024
Viaarxiv icon

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

Add code
Jun 13, 2024
Figure 1 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 2 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 3 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 4 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Viaarxiv icon

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Add code
Jun 12, 2024
Viaarxiv icon

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

Add code
Jun 06, 2024
Figure 1 for Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
Figure 2 for Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
Figure 3 for Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
Figure 4 for Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
Viaarxiv icon

Data-driven Energy Consumption Modelling for Electric Micromobility using an Open Dataset

Add code
Mar 26, 2024
Viaarxiv icon

DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing

Add code
Mar 21, 2024
Figure 1 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 2 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 3 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 4 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Viaarxiv icon

Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering

Add code
Mar 14, 2024
Viaarxiv icon

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Add code
Mar 01, 2024
Viaarxiv icon

Privacy-Aware Energy Consumption Modeling of Connected Battery Electric Vehicles using Federated Learning

Add code
Dec 12, 2023
Viaarxiv icon