Picture for Fabio Carrara

Fabio Carrara

ISTI CNR, Pisa, Italy

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

Add code
Nov 28, 2024
Viaarxiv icon

Is CLIP the main roadblock for fine-grained open-world perception?

Add code
Apr 04, 2024
Viaarxiv icon

The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding

Add code
Nov 29, 2023
Figure 1 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Figure 2 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Figure 3 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Figure 4 for The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Viaarxiv icon

The Emotions of the Crowd: Learning Image Sentiment from Tweets via Cross-modal Distillation

Add code
Apr 28, 2023
Figure 1 for The Emotions of the Crowd: Learning Image Sentiment from Tweets via Cross-modal Distillation
Figure 2 for The Emotions of the Crowd: Learning Image Sentiment from Tweets via Cross-modal Distillation
Figure 3 for The Emotions of the Crowd: Learning Image Sentiment from Tweets via Cross-modal Distillation
Figure 4 for The Emotions of the Crowd: Learning Image Sentiment from Tweets via Cross-modal Distillation
Viaarxiv icon

Deep learning for structural health monitoring: An application to heritage structures

Add code
Nov 04, 2022
Viaarxiv icon

Recurrent Vision Transformer for Solving Visual Reasoning Problems

Add code
Nov 29, 2021
Figure 1 for Recurrent Vision Transformer for Solving Visual Reasoning Problems
Figure 2 for Recurrent Vision Transformer for Solving Visual Reasoning Problems
Figure 3 for Recurrent Vision Transformer for Solving Visual Reasoning Problems
Figure 4 for Recurrent Vision Transformer for Solving Visual Reasoning Problems
Viaarxiv icon

Multi-Camera Vehicle Counting Using Edge-AI

Add code
Jun 05, 2021
Figure 1 for Multi-Camera Vehicle Counting Using Edge-AI
Figure 2 for Multi-Camera Vehicle Counting Using Edge-AI
Figure 3 for Multi-Camera Vehicle Counting Using Edge-AI
Figure 4 for Multi-Camera Vehicle Counting Using Edge-AI
Viaarxiv icon

Solving the Same-Different Task with Convolutional Neural Networks

Add code
Jan 22, 2021
Figure 1 for Solving the Same-Different Task with Convolutional Neural Networks
Figure 2 for Solving the Same-Different Task with Convolutional Neural Networks
Figure 3 for Solving the Same-Different Task with Convolutional Neural Networks
Figure 4 for Solving the Same-Different Task with Convolutional Neural Networks
Viaarxiv icon

Combining GANs and AutoEncoders for Efficient Anomaly Detection

Add code
Nov 26, 2020
Figure 1 for Combining GANs and AutoEncoders for Efficient Anomaly Detection
Figure 2 for Combining GANs and AutoEncoders for Efficient Anomaly Detection
Figure 3 for Combining GANs and AutoEncoders for Efficient Anomaly Detection
Figure 4 for Combining GANs and AutoEncoders for Efficient Anomaly Detection
Viaarxiv icon

The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval

Add code
Aug 06, 2020
Figure 1 for The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval
Figure 2 for The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval
Figure 3 for The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval
Figure 4 for The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval
Viaarxiv icon