Picture for Zeyu Lu

Zeyu Lu

FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model

Add code
Oct 17, 2024
Viaarxiv icon

Diffusion Models Need Visual Priors for Image Generation

Add code
Oct 11, 2024
Figure 1 for Diffusion Models Need Visual Priors for Image Generation
Figure 2 for Diffusion Models Need Visual Priors for Image Generation
Figure 3 for Diffusion Models Need Visual Priors for Image Generation
Figure 4 for Diffusion Models Need Visual Priors for Image Generation
Viaarxiv icon

GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI

Add code
Sep 02, 2024
Viaarxiv icon

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Add code
Jul 11, 2024
Viaarxiv icon

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots

Add code
May 13, 2024
Figure 1 for Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Figure 2 for Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Figure 3 for Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Figure 4 for Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Viaarxiv icon

A Survey on Long Video Generation: Challenges, Methods, and Prospects

Add code
Mar 25, 2024
Figure 1 for A Survey on Long Video Generation: Challenges, Methods, and Prospects
Figure 2 for A Survey on Long Video Generation: Challenges, Methods, and Prospects
Figure 3 for A Survey on Long Video Generation: Challenges, Methods, and Prospects
Figure 4 for A Survey on Long Video Generation: Challenges, Methods, and Prospects
Viaarxiv icon

FiT: Flexible Vision Transformer for Diffusion Model

Add code
Feb 19, 2024
Figure 1 for FiT: Flexible Vision Transformer for Diffusion Model
Figure 2 for FiT: Flexible Vision Transformer for Diffusion Model
Figure 3 for FiT: Flexible Vision Transformer for Diffusion Model
Figure 4 for FiT: Flexible Vision Transformer for Diffusion Model
Viaarxiv icon

LLaMA Pro: Progressive LLaMA with Block Expansion

Add code
Jan 04, 2024
Viaarxiv icon

$π$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation

Add code
Apr 28, 2023
Viaarxiv icon

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

Add code
Apr 25, 2023
Viaarxiv icon