Picture for Cristian Rodriguez-Opazo

Cristian Rodriguez-Opazo

Knowledge Composition using Task Vectors with Learned Anisotropic Scaling

Add code
Jul 03, 2024
Viaarxiv icon

Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling

Add code
May 27, 2024
Viaarxiv icon

Unveiling Backbone Effects in CLIP: Exploring Representational Synergies and Variances

Add code
Dec 22, 2023
Viaarxiv icon

LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach

Add code
Dec 19, 2021
Figure 1 for LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach
Figure 2 for LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach
Figure 3 for LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach
Figure 4 for LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach
Viaarxiv icon

Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

Add code
Aug 09, 2021
Figure 1 for Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
Figure 2 for Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
Figure 3 for Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
Figure 4 for Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
Viaarxiv icon

A Recurrent Vision-and-Language BERT for Navigation

Add code
Nov 26, 2020
Figure 1 for A Recurrent Vision-and-Language BERT for Navigation
Figure 2 for A Recurrent Vision-and-Language BERT for Navigation
Figure 3 for A Recurrent Vision-and-Language BERT for Navigation
Figure 4 for A Recurrent Vision-and-Language BERT for Navigation
Viaarxiv icon

Language and Visual Entity Relationship Graph for Agent Navigation

Add code
Oct 19, 2020
Figure 1 for Language and Visual Entity Relationship Graph for Agent Navigation
Figure 2 for Language and Visual Entity Relationship Graph for Agent Navigation
Figure 3 for Language and Visual Entity Relationship Graph for Agent Navigation
Figure 4 for Language and Visual Entity Relationship Graph for Agent Navigation
Viaarxiv icon

DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video

Add code
Oct 13, 2020
Figure 1 for DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video
Figure 2 for DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video
Figure 3 for DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video
Figure 4 for DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video
Viaarxiv icon

The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose

Add code
Jul 01, 2020
Figure 1 for The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose
Figure 2 for The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose
Figure 3 for The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose
Figure 4 for The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose
Viaarxiv icon

A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews

Add code
May 28, 2020
Figure 1 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
Figure 2 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
Figure 3 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
Figure 4 for A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
Viaarxiv icon