Picture for Cuiling Lan

Cuiling Lan

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response

Add code
Jan 10, 2025
Viaarxiv icon

GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs

Add code
Dec 22, 2024
Viaarxiv icon

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Add code
Dec 13, 2024
Viaarxiv icon

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Add code
Jul 18, 2024
Viaarxiv icon

Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis

Add code
May 13, 2024
Figure 1 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 2 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 3 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 4 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Viaarxiv icon

Slot-VLM: SlowFast Slots for Video-Language Modeling

Add code
Feb 20, 2024
Viaarxiv icon

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement

Add code
Feb 15, 2024
Viaarxiv icon

Retrieval-based Video Language Model for Efficient Long Video Question Answering

Add code
Dec 08, 2023
Viaarxiv icon

Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer

Add code
Oct 04, 2023
Viaarxiv icon

Shatter and Gather: Learning Referring Image Segmentation with Text Supervision

Add code
Aug 29, 2023
Viaarxiv icon