Picture for Cuiling Lan

Cuiling Lan

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Add code
Dec 13, 2024
Viaarxiv icon

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Add code
Jul 18, 2024
Viaarxiv icon

Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis

Add code
May 13, 2024
Figure 1 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 2 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 3 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 4 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Viaarxiv icon

Slot-VLM: SlowFast Slots for Video-Language Modeling

Add code
Feb 20, 2024
Viaarxiv icon

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement

Add code
Feb 15, 2024
Viaarxiv icon

Retrieval-based Video Language Model for Efficient Long Video Question Answering

Add code
Dec 08, 2023
Viaarxiv icon

Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer

Add code
Oct 04, 2023
Viaarxiv icon

Shatter and Gather: Learning Referring Image Segmentation with Text Supervision

Add code
Aug 29, 2023
Viaarxiv icon

Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey

Add code
Aug 18, 2023
Viaarxiv icon

Adaptive Frequency Filters As Efficient Global Token Mixers

Add code
Jul 26, 2023
Viaarxiv icon