Picture for Rita Cucchiara

Rita Cucchiara

Is Multiple Object Tracking a Matter of Specialization?

Add code
Nov 01, 2024
Viaarxiv icon

TPP-Gaze: Modelling Gaze Dynamics in Space and Time with Neural Temporal Point Processes

Add code
Oct 30, 2024
Viaarxiv icon

Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments

Add code
Oct 23, 2024
Viaarxiv icon

Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training

Add code
Oct 09, 2024
Figure 1 for Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training
Figure 2 for Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training
Figure 3 for Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training
Figure 4 for Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training
Viaarxiv icon

Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection

Add code
Sep 16, 2024
Figure 1 for Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection
Viaarxiv icon

KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction

Add code
Sep 09, 2024
Figure 1 for KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction
Figure 2 for KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction
Figure 3 for KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction
Figure 4 for KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction
Viaarxiv icon

Fluent and Accurate Image Captioning with a Self-Trained Reward Model

Add code
Aug 29, 2024
Figure 1 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Figure 2 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Figure 3 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Figure 4 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Viaarxiv icon

μgat: Improving Single-Page Document Parsing by Providing Multi-Page Context

Add code
Aug 28, 2024
Viaarxiv icon

Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas

Add code
Aug 28, 2024
Viaarxiv icon

Alfie: Democratising RGBA Image Generation With No $$$

Add code
Aug 27, 2024
Figure 1 for Alfie: Democratising RGBA Image Generation With No $$$
Figure 2 for Alfie: Democratising RGBA Image Generation With No $$$
Figure 3 for Alfie: Democratising RGBA Image Generation With No $$$
Figure 4 for Alfie: Democratising RGBA Image Generation With No $$$
Viaarxiv icon