Picture for Emanuele Vivoli

Emanuele Vivoli

Media Integration and Communication Center, UNIFI, Department of Information Engineering

ComicsPAP: understanding comic strips by picking the correct panel

Add code
Mar 11, 2025
Viaarxiv icon

HoloMine: A Synthetic Dataset for Buried Landmines Recognition using Microwave Holographic Imaging

Add code
Feb 28, 2025
Viaarxiv icon

ComiCap: A VLMs pipeline for dense captioning of Comic Panels

Add code
Sep 24, 2024
Viaarxiv icon

One missing piece in Vision and Language: A Survey on Comics Understanding

Add code
Sep 14, 2024
Figure 1 for One missing piece in Vision and Language: A Survey on Comics Understanding
Figure 2 for One missing piece in Vision and Language: A Survey on Comics Understanding
Figure 3 for One missing piece in Vision and Language: A Survey on Comics Understanding
Figure 4 for One missing piece in Vision and Language: A Survey on Comics Understanding
Viaarxiv icon

Towards Generative Class Prompt Learning for Few-shot Visual Recognition

Add code
Sep 03, 2024
Viaarxiv icon

CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding

Add code
Jul 04, 2024
Viaarxiv icon

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking

Add code
Jul 03, 2024
Viaarxiv icon

Multimodal Transformer for Comics Text-Cloze

Add code
Mar 06, 2024
Figure 1 for Multimodal Transformer for Comics Text-Cloze
Figure 2 for Multimodal Transformer for Comics Text-Cloze
Figure 3 for Multimodal Transformer for Comics Text-Cloze
Figure 4 for Multimodal Transformer for Comics Text-Cloze
Viaarxiv icon

Error assessment of microwave holography inversion for shallow buried objects

Add code
Mar 27, 2023
Viaarxiv icon

CTE: A Dataset for Contextualized Table Extraction

Add code
Feb 13, 2023
Viaarxiv icon