Picture for Fabio Galasso

Fabio Galasso

ANTHROPOS-V: benchmarking the novel task of Crowd Volume Estimation

Add code
Jan 03, 2025
Figure 1 for ANTHROPOS-V: benchmarking the novel task of Crowd Volume Estimation
Figure 2 for ANTHROPOS-V: benchmarking the novel task of Crowd Volume Estimation
Figure 3 for ANTHROPOS-V: benchmarking the novel task of Crowd Volume Estimation
Figure 4 for ANTHROPOS-V: benchmarking the novel task of Crowd Volume Estimation
Viaarxiv icon

Social EgoMesh Estimation

Add code
Nov 07, 2024
Figure 1 for Social EgoMesh Estimation
Figure 2 for Social EgoMesh Estimation
Figure 3 for Social EgoMesh Estimation
Figure 4 for Social EgoMesh Estimation
Viaarxiv icon

TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos

Add code
Nov 04, 2024
Figure 1 for TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos
Figure 2 for TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos
Figure 3 for TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos
Figure 4 for TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos
Viaarxiv icon

Compositional Entailment Learning for Hyperbolic Vision-Language Models

Add code
Oct 09, 2024
Figure 1 for Compositional Entailment Learning for Hyperbolic Vision-Language Models
Figure 2 for Compositional Entailment Learning for Hyperbolic Vision-Language Models
Figure 3 for Compositional Entailment Learning for Hyperbolic Vision-Language Models
Figure 4 for Compositional Entailment Learning for Hyperbolic Vision-Language Models
Viaarxiv icon

OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras

Add code
Aug 18, 2024
Figure 1 for OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras
Figure 2 for OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras
Figure 3 for OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras
Figure 4 for OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras
Viaarxiv icon

Hyperbolic Learning with Multimodal Large Language Models

Add code
Aug 09, 2024
Viaarxiv icon

Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation

Add code
Jul 19, 2024
Figure 1 for Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Figure 2 for Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Figure 3 for Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Figure 4 for Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Viaarxiv icon

Length-Aware Motion Synthesis via Latent Diffusion

Add code
Jul 16, 2024
Viaarxiv icon

MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization

Add code
May 06, 2024
Figure 1 for MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization
Figure 2 for MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization
Figure 3 for MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization
Figure 4 for MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization
Viaarxiv icon

Following the Human Thread in Social Navigation

Add code
Apr 17, 2024
Viaarxiv icon