Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection

Mar 26, 2025

Andrii Yermakov, Jan Cech, Jiri Matas

Figure 1 for Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection

Figure 2 for Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection

Figure 3 for Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection

Figure 4 for Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection

Share this with someone who'll enjoy it:

Abstract:This paper tackles the challenge of detecting partially manipulated facial deepfakes, which involve subtle alterations to specific facial features while retaining the overall context, posing a greater detection difficulty than fully synthetic faces. We leverage the Contrastive Language-Image Pre-training (CLIP) model, specifically its ViT-L/14 visual encoder, to develop a generalizable detection method that performs robustly across diverse datasets and unknown forgery techniques with minimal modifications to the original model. The proposed approach utilizes parameter-efficient fine-tuning (PEFT) techniques, such as LN-tuning, to adjust a small subset of the model's parameters, preserving CLIP's pre-trained knowledge and reducing overfitting. A tailored preprocessing pipeline optimizes the method for facial images, while regularization strategies, including L2 normalization and metric learning on a hyperspherical manifold, enhance generalization. Trained on the FaceForensics++ dataset and evaluated in a cross-dataset fashion on Celeb-DF-v2, DFDC, FFIW, and others, the proposed method achieves competitive detection accuracy comparable to or outperforming much more complex state-of-the-art techniques. This work highlights the efficacy of CLIP's visual encoder in facial deepfake detection and establishes a simple, powerful baseline for future research, advancing the field of generalizable deepfake detection. The code is available at: https://github.com/yermandy/deepfake-detection

View paper on

Share this with someone who'll enjoy it:

Title:Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection

Paper and Code