Picture for Fabian Caba Heilbron

Fabian Caba Heilbron

ResidualViT for Efficient Temporally Dense Video Encoding

Add code
Sep 16, 2025
Figure 1 for ResidualViT for Efficient Temporally Dense Video Encoding
Figure 2 for ResidualViT for Efficient Temporally Dense Video Encoding
Figure 3 for ResidualViT for Efficient Temporally Dense Video Encoding
Figure 4 for ResidualViT for Efficient Temporally Dense Video Encoding
Viaarxiv icon

Discovering Divergent Representations between Text-to-Image Models

Add code
Sep 10, 2025
Figure 1 for Discovering Divergent Representations between Text-to-Image Models
Figure 2 for Discovering Divergent Representations between Text-to-Image Models
Figure 3 for Discovering Divergent Representations between Text-to-Image Models
Figure 4 for Discovering Divergent Representations between Text-to-Image Models
Viaarxiv icon

Improving Personalized Search with Regularized Low-Rank Parameter Updates

Add code
Jun 11, 2025
Viaarxiv icon

Generative Timelines for Instructed Visual Assembly

Add code
Nov 19, 2024
Viaarxiv icon

Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets

Add code
Sep 02, 2024
Figure 1 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Figure 2 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Figure 3 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Figure 4 for Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
Viaarxiv icon

Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval

Add code
May 06, 2024
Figure 1 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 2 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 3 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 4 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Viaarxiv icon

Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

Add code
Apr 05, 2024
Figure 1 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Figure 2 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Figure 3 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Figure 4 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Viaarxiv icon

Scaling Up Video Summarization Pretraining with Large Language Models

Add code
Apr 04, 2024
Viaarxiv icon

Towards Automated Movie Trailer Generation

Add code
Apr 04, 2024
Figure 1 for Towards Automated Movie Trailer Generation
Figure 2 for Towards Automated Movie Trailer Generation
Figure 3 for Towards Automated Movie Trailer Generation
Figure 4 for Towards Automated Movie Trailer Generation
Viaarxiv icon

Long-range Multimodal Pretraining for Movie Understanding

Add code
Aug 18, 2023
Viaarxiv icon