Picture for Rita Cucchiara

Rita Cucchiara

Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

A Unified Masked Jigsaw Puzzle Framework for Vision and Language Models

Add code
Jan 17, 2026
Viaarxiv icon

CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models

Add code
Jan 08, 2026
Viaarxiv icon

Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models

Add code
Dec 17, 2025
Viaarxiv icon

Recurrence Meets Transformers for Universal Multimodal Retrieval

Add code
Sep 10, 2025
Viaarxiv icon

Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation

Add code
Aug 23, 2025
Viaarxiv icon

Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?

Add code
Aug 13, 2025
Figure 1 for Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?
Figure 2 for Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?
Figure 3 for Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?
Figure 4 for Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?
Viaarxiv icon

BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images

Add code
Jul 16, 2025
Viaarxiv icon

RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors

Add code
Jun 09, 2025
Viaarxiv icon

Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals

Add code
May 27, 2025
Figure 1 for Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals
Figure 2 for Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals
Figure 3 for Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals
Figure 4 for Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals
Viaarxiv icon