Picture for Francesco Pinto

Francesco Pinto

AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration

Add code
Mar 20, 2025
Viaarxiv icon

MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models

Add code
Mar 19, 2025
Viaarxiv icon

SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations

Add code
Dec 09, 2024
Viaarxiv icon

Copyright-Protected Language Generation via Adaptive Model Fusion

Add code
Dec 09, 2024
Viaarxiv icon

Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models

Add code
Nov 09, 2024
Viaarxiv icon

Focus On This, Not That! Steering LLMs With Adaptive Feature Specification

Add code
Oct 30, 2024
Viaarxiv icon

Strong Copyright Protection for Language Models via Adaptive Model Fusion

Add code
Jul 29, 2024
Viaarxiv icon

Extracting Training Data from Document-Based VQA Models

Add code
Jul 11, 2024
Viaarxiv icon

Towards Certification of Uncertainty Calibration under Adversarial Attacks

Add code
May 22, 2024
Figure 1 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Figure 2 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Figure 3 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Figure 4 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Viaarxiv icon

As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?

Add code
Mar 19, 2024
Figure 1 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 2 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 3 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 4 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Viaarxiv icon