Picture for Zhongang Qi

Zhongang Qi

Mark

Taming Rectified Flow for Inversion and Editing

Add code
Nov 07, 2024
Figure 1 for Taming Rectified Flow for Inversion and Editing
Figure 2 for Taming Rectified Flow for Inversion and Editing
Figure 3 for Taming Rectified Flow for Inversion and Editing
Figure 4 for Taming Rectified Flow for Inversion and Editing
Viaarxiv icon

E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding

Add code
Sep 26, 2024
Viaarxiv icon

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

Add code
Aug 23, 2024
Figure 1 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Figure 2 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Figure 3 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Figure 4 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Viaarxiv icon

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

Add code
Aug 07, 2024
Figure 1 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 2 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 3 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 4 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Viaarxiv icon

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

Add code
Jul 10, 2024
Viaarxiv icon

EA-VTR: Event-Aware Video-Text Retrieval

Add code
Jul 10, 2024
Viaarxiv icon

PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM

Add code
Jun 05, 2024
Viaarxiv icon

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model

Add code
Mar 15, 2024
Viaarxiv icon

RecDCL: Dual Contrastive Learning for Recommendation

Add code
Jan 28, 2024
Viaarxiv icon

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Add code
Dec 07, 2023
Viaarxiv icon