Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Simone Bonechi

Who Made This? Fake Detection and Source Attribution with Diffusion Features

Oct 31, 2025

Simone Bonechi, Paolo Andreini, Barbara Toniella Corradini

Figure 1 for Who Made This? Fake Detection and Source Attribution with Diffusion Features

Figure 2 for Who Made This? Fake Detection and Source Attribution with Diffusion Features

Figure 3 for Who Made This? Fake Detection and Source Attribution with Diffusion Features

Figure 4 for Who Made This? Fake Detection and Source Attribution with Diffusion Features

Abstract:The rapid progress of generative diffusion models has enabled the creation of synthetic images that are increasingly difficult to distinguish from real ones, raising concerns about authenticity, copyright, and misinformation. Existing supervised detectors often struggle to generalize across unseen generators, requiring extensive labeled data and frequent retraining. We introduce FRIDA (Fake-image Recognition and source Identification via Diffusion-features Analysis), a lightweight framework that leverages internal activations from a pre-trained diffusion model for deepfake detection and source generator attribution. A k-nearest-neighbor classifier applied to diffusion features achieves state-of-the-art cross-generator performance without fine-tuning, while a compact neural model enables accurate source attribution. These results show that diffusion representations inherently encode generator-specific patterns, providing a simple and interpretable foundation for synthetic image forensics.

Via

Access Paper or Ask Questions

Weak Supervision for Generating Pixel-Level Annotations in Scene Text Segmentation

Nov 19, 2019

Simone Bonechi, Paolo Andreini, Monica Bianchini, Franco Scarselli

Figure 1 for Weak Supervision for Generating Pixel-Level Annotations in Scene Text Segmentation

Figure 2 for Weak Supervision for Generating Pixel-Level Annotations in Scene Text Segmentation

Figure 3 for Weak Supervision for Generating Pixel-Level Annotations in Scene Text Segmentation

Figure 4 for Weak Supervision for Generating Pixel-Level Annotations in Scene Text Segmentation

Abstract:Providing pixel-level supervisions for scene text segmentation is inherently difficult and costly, so that only few small datasets are available for this task. To face the scarcity of training data, previous approaches based on Convolutional Neural Networks (CNNs) rely on the use of a synthetic dataset for pre-training. However, synthetic data cannot reproduce the complexity and variability of natural images. In this work, we propose to use a weakly supervised learning approach to reduce the domain-shift between synthetic and real data. Leveraging the bounding-box supervision of the COCO-Text and the MLT datasets, we generate weak pixel-level supervisions of real images. In particular, the COCO-Text-Segmentation (COCO_TS) and the MLT-Segmentation (MLT_S) datasets are created and released. These two datasets are used to train a CNN, the Segmentation Multiscale Attention Network (SMANet), which is specifically designed to face some peculiarities of the scene text segmentation task. The SMANet is trained end-to-end on the proposed datasets, and the experiments show that COCO_TS and MLT_S are a valid alternative to synthetic images, allowing to use only a fraction of the training samples and improving significantly the performances.

* arXiv admin note: substantial text overlap with arXiv:1904.00818

Via

Access Paper or Ask Questions

A Two Stage GAN for High Resolution Retinal Image Generation and Segmentation

Jul 29, 2019

Paolo Andreini, Simone Bonechi, Monica Bianchini, Alessandro Mecocci, Franco Scarselli, Andrea Sodi

Figure 1 for A Two Stage GAN for High Resolution Retinal Image Generation and Segmentation

Figure 2 for A Two Stage GAN for High Resolution Retinal Image Generation and Segmentation

Figure 3 for A Two Stage GAN for High Resolution Retinal Image Generation and Segmentation

Figure 4 for A Two Stage GAN for High Resolution Retinal Image Generation and Segmentation

Abstract:In recent years, the use of deep learning is becoming increasingly popular in computer vision. However, the effective training of deep architectures usually relies on huge sets of annotated data. This is critical in the medical field where it is difficult and expensive to obtain annotated images. In this paper, we use Generative Adversarial Networks (GANs) for synthesizing high quality retinal images, along with the corresponding semantic label-maps, to be used instead of real images during the training process. Differently from other previous proposals, we suggest a two step approach: first, a progressively growing GAN is trained to generate the semantic label-maps, which describe the blood vessel structure (i.e. vasculature); second, an image-to-image translation approach is used to obtain realistic retinal images from the generated vasculature. By using only a handful of training samples, our approach generates realistic high resolution images, that can be effectively used to enlarge small available datasets. Comparable results have been obtained employing the generated images in place of real data during training. The practical viability of the proposed approach has been demonstrated by applying it on two well established benchmark sets for retinal vessel segmentation, both containing a very small number of training samples. Our method obtained better performances with respect to state-of-the-art techniques.

Via

Access Paper or Ask Questions

COCO_TS Dataset: Pixel-level Annotations Based on Weak Supervision for Scene Text Segmentation

Apr 02, 2019

Simone Bonechi, Paolo Andreini, Monica Bianchini, Franco Scarselli

Figure 1 for COCO_TS Dataset: Pixel-level Annotations Based on Weak Supervision for Scene Text Segmentation

Figure 2 for COCO_TS Dataset: Pixel-level Annotations Based on Weak Supervision for Scene Text Segmentation

Figure 3 for COCO_TS Dataset: Pixel-level Annotations Based on Weak Supervision for Scene Text Segmentation

Figure 4 for COCO_TS Dataset: Pixel-level Annotations Based on Weak Supervision for Scene Text Segmentation

Abstract:The absence of large scale datasets with pixel-level supervisions is a significant obstacle for the training of deep convolutional networks for scene text segmentation. For this reason, synthetic data generation is normally employed to enlarge the training dataset. Nonetheless, synthetic data cannot reproduce the complexity and variability of natural images. In this paper, a weakly supervised learning approach is used to reduce the shift between training on real and synthetic data. Pixel-level supervisions for a text detection dataset (i.e. where only bounding-box annotations are available) are generated. In particular, the COCO-Text-Segmentation (COCO_TS) dataset, which provides pixel-level supervisions for the COCO-Text dataset, is created and released. The generated annotations are used to train a deep convolutional neural network for semantic segmentation. Experiments show that the proposed dataset can be used instead of synthetic data, allowing us to use only a fraction of the training samples and significantly improving the performances.

Via

Access Paper or Ask Questions