Picture for Lluis Castrejon

Lluis Castrejon

HAMMR: HierArchical MultiModal React agents for generic VQA

Add code
Apr 08, 2024
Viaarxiv icon

How (not) to ensemble LVLMs for VQA

Add code
Oct 10, 2023
Viaarxiv icon

Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

Add code
Jun 15, 2023
Viaarxiv icon

Cascaded Video Generation for Videos In-the-Wild

Add code
Jun 01, 2022
Figure 1 for Cascaded Video Generation for Videos In-the-Wild
Figure 2 for Cascaded Video Generation for Videos In-the-Wild
Figure 3 for Cascaded Video Generation for Videos In-the-Wild
Figure 4 for Cascaded Video Generation for Videos In-the-Wild
Viaarxiv icon

Hierarchical Video Generation for Complex Data

Add code
Jun 04, 2021
Figure 1 for Hierarchical Video Generation for Complex Data
Figure 2 for Hierarchical Video Generation for Complex Data
Figure 3 for Hierarchical Video Generation for Complex Data
Figure 4 for Hierarchical Video Generation for Complex Data
Viaarxiv icon

Recovering Petaflops in Contrastive Semi-Supervised Learning of Visual Representations

Add code
Jun 18, 2020
Figure 1 for Recovering Petaflops in Contrastive Semi-Supervised Learning of Visual Representations
Figure 2 for Recovering Petaflops in Contrastive Semi-Supervised Learning of Visual Representations
Figure 3 for Recovering Petaflops in Contrastive Semi-Supervised Learning of Visual Representations
Figure 4 for Recovering Petaflops in Contrastive Semi-Supervised Learning of Visual Representations
Viaarxiv icon

Improved Conditional VRNNs for Video Prediction

Add code
Apr 27, 2019
Figure 1 for Improved Conditional VRNNs for Video Prediction
Figure 2 for Improved Conditional VRNNs for Video Prediction
Figure 3 for Improved Conditional VRNNs for Video Prediction
Figure 4 for Improved Conditional VRNNs for Video Prediction
Viaarxiv icon

MovieGraphs: Towards Understanding Human-Centric Situations from Videos

Add code
Apr 15, 2018
Figure 1 for MovieGraphs: Towards Understanding Human-Centric Situations from Videos
Figure 2 for MovieGraphs: Towards Understanding Human-Centric Situations from Videos
Figure 3 for MovieGraphs: Towards Understanding Human-Centric Situations from Videos
Figure 4 for MovieGraphs: Towards Understanding Human-Centric Situations from Videos
Viaarxiv icon

Annotating Object Instances with a Polygon-RNN

Add code
Apr 18, 2017
Figure 1 for Annotating Object Instances with a Polygon-RNN
Figure 2 for Annotating Object Instances with a Polygon-RNN
Figure 3 for Annotating Object Instances with a Polygon-RNN
Figure 4 for Annotating Object Instances with a Polygon-RNN
Viaarxiv icon

Cross-Modal Scene Networks

Add code
Oct 27, 2016
Figure 1 for Cross-Modal Scene Networks
Figure 2 for Cross-Modal Scene Networks
Figure 3 for Cross-Modal Scene Networks
Figure 4 for Cross-Modal Scene Networks
Viaarxiv icon