Picture for Francesco Barbato

Francesco Barbato

SalFormer360: a transformer-based saliency estimation model for 360-degree videos

Add code
Feb 04, 2026
Viaarxiv icon

Split&Splat: Zero-Shot Panoptic Segmentation via Explicit Instance Modeling and 3D Gaussian Splatting

Add code
Feb 01, 2026
Viaarxiv icon

MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment

Add code
Sep 17, 2025
Viaarxiv icon

Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection

Add code
Jul 01, 2024
Viaarxiv icon

When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather

Add code
Mar 20, 2024
Figure 1 for When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather
Figure 2 for When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather
Figure 3 for When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather
Figure 4 for When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather
Viaarxiv icon

A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation

Add code
Feb 29, 2024
Viaarxiv icon

RECALL+: Adversarial Web-based Replay for Continual Learning in Semantic Segmentation

Add code
Sep 19, 2023
Viaarxiv icon

SynDrone -- Multi-modal UAV Dataset for Urban Scenarios

Add code
Aug 21, 2023
Figure 1 for SynDrone -- Multi-modal UAV Dataset for Urban Scenarios
Figure 2 for SynDrone -- Multi-modal UAV Dataset for Urban Scenarios
Figure 3 for SynDrone -- Multi-modal UAV Dataset for Urban Scenarios
Figure 4 for SynDrone -- Multi-modal UAV Dataset for Urban Scenarios
Viaarxiv icon

Continual Road-Scene Semantic Segmentation via Feature-Aligned Symmetric Multi-Modal Network

Add code
Aug 09, 2023
Figure 1 for Continual Road-Scene Semantic Segmentation via Feature-Aligned Symmetric Multi-Modal Network
Figure 2 for Continual Road-Scene Semantic Segmentation via Feature-Aligned Symmetric Multi-Modal Network
Figure 3 for Continual Road-Scene Semantic Segmentation via Feature-Aligned Symmetric Multi-Modal Network
Figure 4 for Continual Road-Scene Semantic Segmentation via Feature-Aligned Symmetric Multi-Modal Network
Viaarxiv icon

DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks

Add code
Nov 08, 2022
Figure 1 for DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks
Figure 2 for DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks
Figure 3 for DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks
Figure 4 for DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks
Viaarxiv icon