Picture for Vittorio Ferrari

Vittorio Ferrari

MaskInversion: Localized Embeddings via Optimization of Explainability Maps

Add code
Jul 29, 2024
Viaarxiv icon

HAMMR: HierArchical MultiModal React agents for generic VQA

Add code
Apr 08, 2024
Viaarxiv icon

Grounding Everything: Emerging Localization Properties in Vision-Language Transformers

Add code
Dec 05, 2023
Viaarxiv icon

StoryBench: A Multifaceted Benchmark for Continuous Story Visualization

Add code
Aug 22, 2023
Figure 1 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Figure 2 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Figure 3 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Figure 4 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Viaarxiv icon

CAD-Estate: Large-scale CAD Model Annotation in RGB Videos

Add code
Jun 15, 2023
Viaarxiv icon

Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

Add code
Jun 15, 2023
Viaarxiv icon

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Add code
Jun 15, 2023
Figure 1 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 2 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 3 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 4 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Viaarxiv icon

Estimating Generic 3D Room Structures from 2D Annotations

Add code
Jun 15, 2023
Viaarxiv icon

Tracking by 3D Model Estimation of Unknown Objects in Videos

Add code
Apr 13, 2023
Viaarxiv icon

Connecting Vision and Language with Video Localized Narratives

Add code
Mar 15, 2023
Viaarxiv icon