Picture for Giulio Zizzo

Giulio Zizzo

MAD-MAX: Modular And Diverse Malicious Attack MiXtures for Automated LLM Red Teaming

Add code
Mar 08, 2025
Viaarxiv icon

Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs

Add code
Feb 21, 2025
Viaarxiv icon

Granite Guardian

Add code
Dec 10, 2024
Figure 1 for Granite Guardian
Figure 2 for Granite Guardian
Figure 3 for Granite Guardian
Figure 4 for Granite Guardian
Viaarxiv icon

HarmLevelBench: Evaluating Harm-Level Compliance and the Impact of Quantization on Model Alignment

Add code
Nov 11, 2024
Viaarxiv icon

Assessing the Impact of Packing on Machine Learning-Based Malware Detection and Classification Systems

Add code
Oct 31, 2024
Figure 1 for Assessing the Impact of Packing on Machine Learning-Based Malware Detection and Classification Systems
Figure 2 for Assessing the Impact of Packing on Machine Learning-Based Malware Detection and Classification Systems
Figure 3 for Assessing the Impact of Packing on Machine Learning-Based Malware Detection and Classification Systems
Figure 4 for Assessing the Impact of Packing on Machine Learning-Based Malware Detection and Classification Systems
Viaarxiv icon

Towards Assurance of LLM Adversarial Robustness using Ontology-Driven Argumentation

Add code
Oct 10, 2024
Figure 1 for Towards Assurance of LLM Adversarial Robustness using Ontology-Driven Argumentation
Figure 2 for Towards Assurance of LLM Adversarial Robustness using Ontology-Driven Argumentation
Figure 3 for Towards Assurance of LLM Adversarial Robustness using Ontology-Driven Argumentation
Figure 4 for Towards Assurance of LLM Adversarial Robustness using Ontology-Driven Argumentation
Viaarxiv icon

Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs

Add code
Oct 04, 2024
Figure 1 for Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs
Figure 2 for Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs
Figure 3 for Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs
Viaarxiv icon

Towards Assuring EU AI Act Compliance and Adversarial Robustness of LLMs

Add code
Oct 04, 2024
Viaarxiv icon

Knowledge-Augmented Reasoning for EUAIA Compliance and Adversarial Robustness of LLMs

Add code
Oct 04, 2024
Viaarxiv icon

MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks

Add code
Sep 27, 2024
Figure 1 for MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
Figure 2 for MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
Figure 3 for MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
Figure 4 for MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
Viaarxiv icon