Picture for Cuiling Lan

Cuiling Lan

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response

Add code
Jan 10, 2025
Figure 1 for BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response
Figure 2 for BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response
Figure 3 for BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response
Figure 4 for BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response
Viaarxiv icon

GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs

Add code
Dec 22, 2024
Viaarxiv icon

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Add code
Dec 13, 2024
Figure 1 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 2 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 3 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 4 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Viaarxiv icon

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Add code
Jul 18, 2024
Figure 1 for UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt
Figure 2 for UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt
Figure 3 for UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt
Figure 4 for UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt
Viaarxiv icon

Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis

Add code
May 13, 2024
Figure 1 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 2 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 3 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Figure 4 for Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Viaarxiv icon

Slot-VLM: SlowFast Slots for Video-Language Modeling

Add code
Feb 20, 2024
Viaarxiv icon

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement

Add code
Feb 15, 2024
Figure 1 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Figure 2 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Figure 3 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Figure 4 for Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
Viaarxiv icon

Retrieval-based Video Language Model for Efficient Long Video Question Answering

Add code
Dec 08, 2023
Figure 1 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 2 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 3 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Figure 4 for Retrieval-based Video Language Model for Efficient Long Video Question Answering
Viaarxiv icon

Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer

Add code
Oct 04, 2023
Figure 1 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 2 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 3 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Figure 4 for Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
Viaarxiv icon

Shatter and Gather: Learning Referring Image Segmentation with Text Supervision

Add code
Aug 29, 2023
Viaarxiv icon